Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "corpora" keyword
Top 6.6% on pypi.org
24 versions - Latest release: over 2 years ago - 1 dependent package - 6 dependent repositories - 1.5 thousand downloads last month - 335 stars on GitHub - 1 maintainer
pycantonese 3.4.0 💰
Cantonese Linguistics and NLP in Python24 versions - Latest release: over 2 years ago - 1 dependent package - 6 dependent repositories - 1.5 thousand downloads last month - 335 stars on GitHub - 1 maintainer
demeuk 4.2.0
CLI tool to remove invalid chars from a corpus.12 versions - Latest release: 5 months ago - 1 dependent repositories - 78 downloads last month - 14 stars on GitHub - 1 maintainer
lcpcli 0.1.4
Helper for converting CONLLU files and uploading the corpus to LiRI Corpus Platform (LCP)4 versions - Latest release: 6 days ago - 101 downloads last month - 0 stars on GitHub - 1 maintainer
corpona 1.0.1
A library for reading corpora.2 versions - Latest release: about 3 years ago - 1 dependent repositories - 25 downloads last month - 5 stars on GitHub - 1 maintainer
acqdiv 1.1.0
Pipeline for the ACQDIV database5 versions - Latest release: over 3 years ago - 1 dependent repositories - 42 downloads last month - 2 stars on GitHub - 2 maintainers
Top 6.2% on pypi.org
9 versions - Latest release: 10 months ago - 12 dependent repositories - 638 downloads last month - 268 stars on GitHub - 1 maintainer
corus 0.10.0
Links to russian corpora, functions for loading and parsing9 versions - Latest release: 10 months ago - 12 dependent repositories - 638 downloads last month - 268 stars on GitHub - 1 maintainer
corpuscula 1.0.56
Toolkit that simplifies corpus processing51 versions - Latest release: over 2 years ago - 1 dependent package - 3 dependent repositories - 644 downloads last month - 3 stars on GitHub - 1 maintainer
kollo 1.0.1
Extract collocations from VERT data2 versions - Latest release: about 1 year ago - 8 downloads last month - 1 maintainer
kep 1.0.0
NLTK Data3 versions - Latest release: almost 2 years ago - 9 downloads last month - 1,333 stars on GitHub - 1 maintainer
wiki-dump-reader 0.0.4
Extract corpora from Wikipedia dumps4 versions - Latest release: over 5 years ago - 1 dependent package - 4 dependent repositories - 139 downloads last month - 23 stars on GitHub - 1 maintainer
vrt-spacy 0.0.1
creating vrt corpora1 version - Latest release: over 4 years ago - 1 dependent repositories - 20 downloads last month - 0 stars on GitHub - 1 maintainer
vrt-generator 0.0.6
creating vrt corpora6 versions - Latest release: over 4 years ago - 1 dependent repositories - 56 downloads last month - 0 stars on GitHub - 1 maintainer
ruscorpora 0.10.0 💰
Links to https://github.com/kunansy/rnc5 versions - Latest release: almost 2 years ago - 1 dependent repositories - 44 downloads last month - 6 stars on GitHub - 1 maintainer
rnc 0.10.0 💰
API for Russian National Corpus20 versions - Latest release: almost 2 years ago - 2 dependent repositories - 93 downloads last month - 6 stars on GitHub - 1 maintainer
pavis 0.0.3
Parallel corpora version control engine3 versions - Latest release: over 4 years ago - 1 dependent repositories - 21 downloads last month - 1 maintainer
opus-api 0.6.2
OPUS (opus.nlpl.eu) Python API19 versions - Latest release: almost 6 years ago - 6 dependent repositories - 137 downloads last month - 14 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
1 version - Latest release: over 4 years ago - 5 dependent repositories - 4.81 thousand downloads last month - 1,333 stars on GitHub - 1 maintainer
nltkdata 0.0.1
NLTK Data1 version - Latest release: over 4 years ago - 5 dependent repositories - 4.81 thousand downloads last month - 1,333 stars on GitHub - 1 maintainer
magnet4c 0.0.2
Corpora operator for multiple file formats.2 versions - Latest release: over 4 years ago - 1 dependent repositories - 20 downloads last month - 1 maintainer
lyricscorpora 1.0.0
Lyrics API10 versions - Latest release: about 6 years ago - 1 dependent repositories - 17 downloads last month - 18 stars on GitHub - 1 maintainer
linguistica 5.2.1
Linguistica 5: Unsupervised Learning of Linguistic Structure4 versions - Latest release: over 5 years ago - 1 dependent repositories - 47 downloads last month - 29 stars on GitHub - 1 maintainer
lingcorpora 2.0rc0
API for text corpora7 versions - Latest release: almost 5 years ago - 4 dependent repositories - 37 downloads last month - 7 stars on GitHub - 3 maintainers
Top 8.6% on pypi.org
1 version - Latest release: over 12 years ago - 9 dependent repositories - 744 downloads last month - 1 maintainer
corpora 1.0
Lightweight, fast and scalable text corpus library.1 version - Latest release: over 12 years ago - 9 dependent repositories - 744 downloads last month - 1 maintainer
coquery 0.10.0
Coquery: A free corpus query tool5 versions - Latest release: about 7 years ago - 2 dependent repositories - 61 downloads last month - 1 maintainer
buzzword 1.4.0
Web-app for corpus linguistics19 versions - Latest release: about 4 years ago - 195 downloads last month - 37,327 stars on GitHub - 1 maintainer
corpus-downloader 0.1.11
A downloader for textual corpora, for use in digital humanities, corpus linguistics, and natural ...10 versions - Latest release: over 7 years ago - 2 dependent repositories - 50 downloads last month - 33 stars on GitHub - 1 maintainer
wordseg 0.0.5 💰
Word segmentation models5 versions - Latest release: about 1 year ago - 1 dependent package - 1 dependent repositories - 1.52 thousand downloads last month - 3 stars on GitHub - 1 maintainer
search-kwic 0.2
Tool for KWIC representation of paralleltext. Find a word in the parallel text corresponding to q...1 version - Latest release: almost 6 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 1 maintainer
corpus-similarity 1.0.1
Measuring corpus similarity in Python3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 41 downloads last month - 10 stars on GitHub - 1 maintainer
Related Keywords
linguistics
14
nlp
12
corpus
11
natural-language-processing
7
python
6
language
5
computational linguistics
4
natural language processing
4
NLP
4
parallel
4
api
4
corpus-linguistics
3
computational-linguistics
3
speech
3
russian-national-corpus
2
ruscorpora
2
rnc
2
corpus-tools
2
wrapper
2
vrt
2
linguistic-corpora
2
nltk
2
text
2
hacktoberfest
2
word-segmentation
2
corpus-processing
1
unsupervised learning
1
machine learning
1
scrapper
1
scraping-websites
1
python-api
1
Chinese
1
billboard-charts
1
artists
1
scrape
1
songs
1
billboard
1
music
1
LyricWikia
1
lyrics
1
replace
1
extract
1
file
1
parallel-corpus
1
parallel-corpora
1
opus
1
Named Entity Recognition
1
similarity
1
text analytics
1
alignment
1
kwic
1
word segmentation
1
text-analysis
1
yapf
1
pre-commit-hook
1
gofmt
1
formatter
1
codeformatter
1
code
1
autopep8
1
visualization
1
analysis
1
query
1
toolkit
1
utf
1
Cantonese
1
unsupervised-learning
1
linguistica
1
data visualization
1
Natural Language Processing
1
COVID-19 Publications
1
vert
1
collocation
1
universal-dependencies
1
conllu
1
datasets
1
russian
1
typology
1
linguistics-databases
1
language-acquisition
1
databases
1
cross-linguistic-data
1
child-language
1
data
1
reading
1
processing
1
VERT
1
TEI
1
CONLL
1
passwords
1
machine-learning
1
language-model
1
corporate
1
mmt
1
opus_api
1
control
1
version
1
Jyutping
1
cantonese
1
jyutping
1