An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "corpus-data" keyword

View the packages on the pypi.org package registry that are tagged with the "corpus-data" keyword.

canto-filter 1.1.4
粵文分類篩選器 Cantonese text filter
10 versions - Latest release: 19 days ago - 570 downloads last month - 33 stars on GitHub - 1 maintainer
cantonesedetect 1.1.2
A minimal package that detect Cantonese sentences in Traditional Chinese text.
4 versions - Latest release: 4 months ago - 171 downloads last month - 33 stars on GitHub - 1 maintainer
ms3 2.6.0
A parser for MuseScore files, serving as data factory for annotated music corpora.
62 versions - Latest release: about 1 month ago - 2 dependent packages - 1 dependent repositories - 1.08 thousand downloads last month - 28 stars on GitHub - 1 maintainer
ua-gec 2.1.3
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian language
9 versions - Latest release: about 1 year ago - 1 dependent package - 1 dependent repositories - 361 downloads last month - 260 stars on GitHub - 1 maintainer
bengalinlp 2.0.0
BengaliNLP is a natural language processing toolkit for Bengali Language
2 versions - Latest release: 7 months ago - 86 downloads last month - 2 stars on GitHub - 1 maintainer