Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "corpora" keyword

Top 6.6% on pypi.org
pycantonese 3.4.0 💰
Cantonese Linguistics and NLP in Python
24 versions - Latest release: over 2 years ago - 1 dependent package - 6 dependent repositories - 1.5 thousand downloads last month - 335 stars on GitHub - 1 maintainer
demeuk 4.2.0
CLI tool to remove invalid chars from a corpus.
12 versions - Latest release: 5 months ago - 1 dependent repositories - 78 downloads last month - 14 stars on GitHub - 1 maintainer
lcpcli 0.1.4
Helper for converting CONLLU files and uploading the corpus to LiRI Corpus Platform (LCP)
4 versions - Latest release: 6 days ago - 101 downloads last month - 0 stars on GitHub - 1 maintainer
corpona 1.0.1
A library for reading corpora.
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 25 downloads last month - 5 stars on GitHub - 1 maintainer
acqdiv 1.1.0
Pipeline for the ACQDIV database
5 versions - Latest release: over 3 years ago - 1 dependent repositories - 42 downloads last month - 2 stars on GitHub - 2 maintainers
Top 6.2% on pypi.org
corus 0.10.0
Links to russian corpora, functions for loading and parsing
9 versions - Latest release: 10 months ago - 12 dependent repositories - 638 downloads last month - 268 stars on GitHub - 1 maintainer
corpuscula 1.0.56
Toolkit that simplifies corpus processing
51 versions - Latest release: over 2 years ago - 1 dependent package - 3 dependent repositories - 644 downloads last month - 3 stars on GitHub - 1 maintainer
kollo 1.0.1
Extract collocations from VERT data
2 versions - Latest release: about 1 year ago - 8 downloads last month - 1 maintainer
kep 1.0.0
NLTK Data
3 versions - Latest release: almost 2 years ago - 9 downloads last month - 1,333 stars on GitHub - 1 maintainer
wiki-dump-reader 0.0.4
Extract corpora from Wikipedia dumps
4 versions - Latest release: over 5 years ago - 1 dependent package - 4 dependent repositories - 139 downloads last month - 23 stars on GitHub - 1 maintainer
vrt-spacy 0.0.1
creating vrt corpora
1 version - Latest release: over 4 years ago - 1 dependent repositories - 20 downloads last month - 0 stars on GitHub - 1 maintainer
vrt-generator 0.0.6
creating vrt corpora
6 versions - Latest release: over 4 years ago - 1 dependent repositories - 56 downloads last month - 0 stars on GitHub - 1 maintainer
ruscorpora 0.10.0 💰
Links to https://github.com/kunansy/rnc
5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 44 downloads last month - 6 stars on GitHub - 1 maintainer
rnc 0.10.0 💰
API for Russian National Corpus
20 versions - Latest release: almost 2 years ago - 2 dependent repositories - 93 downloads last month - 6 stars on GitHub - 1 maintainer
pavis 0.0.3
Parallel corpora version control engine
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 21 downloads last month - 1 maintainer
opus-api 0.6.2
OPUS (opus.nlpl.eu) Python API
19 versions - Latest release: almost 6 years ago - 6 dependent repositories - 137 downloads last month - 14 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
nltkdata 0.0.1
NLTK Data
1 version - Latest release: over 4 years ago - 5 dependent repositories - 4.81 thousand downloads last month - 1,333 stars on GitHub - 1 maintainer
magnet4c 0.0.2
Corpora operator for multiple file formats.
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 20 downloads last month - 1 maintainer
lyricscorpora 1.0.0
Lyrics API
10 versions - Latest release: about 6 years ago - 1 dependent repositories - 17 downloads last month - 18 stars on GitHub - 1 maintainer
linguistica 5.2.1
Linguistica 5: Unsupervised Learning of Linguistic Structure
4 versions - Latest release: over 5 years ago - 1 dependent repositories - 47 downloads last month - 29 stars on GitHub - 1 maintainer
lingcorpora 2.0rc0
API for text corpora
7 versions - Latest release: almost 5 years ago - 4 dependent repositories - 37 downloads last month - 7 stars on GitHub - 3 maintainers
Top 8.6% on pypi.org
corpora 1.0
Lightweight, fast and scalable text corpus library.
1 version - Latest release: over 12 years ago - 9 dependent repositories - 744 downloads last month - 1 maintainer
coquery 0.10.0
Coquery: A free corpus query tool
5 versions - Latest release: about 7 years ago - 2 dependent repositories - 61 downloads last month - 1 maintainer
buzzword 1.4.0
Web-app for corpus linguistics
19 versions - Latest release: about 4 years ago - 195 downloads last month - 37,327 stars on GitHub - 1 maintainer
corpus-downloader 0.1.11
A downloader for textual corpora, for use in digital humanities, corpus linguistics, and natural ...
10 versions - Latest release: over 7 years ago - 2 dependent repositories - 50 downloads last month - 33 stars on GitHub - 1 maintainer
wordseg 0.0.5 💰
Word segmentation models
5 versions - Latest release: about 1 year ago - 1 dependent package - 1 dependent repositories - 1.52 thousand downloads last month - 3 stars on GitHub - 1 maintainer
search-kwic 0.2
Tool for KWIC representation of paralleltext. Find a word in the parallel text corresponding to q...
1 version - Latest release: almost 6 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 1 maintainer
corpus-similarity 1.0.1
Measuring corpus similarity in Python
3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 41 downloads last month - 10 stars on GitHub - 1 maintainer