Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "corpus-tools" keyword

ms3 2.4.4
A parser for MuseScore files, serving as data factory for annotated music corpora.
56 versions - Latest release: 4 months ago - 2 dependent packages - 1 dependent repositories - 384 downloads last month - 28 stars on GitHub - 1 maintainer
Top 5.5% on pypi.org
simplemma 0.9.1
A simple multilingual lemmatizer for Python.
14 versions - Latest release: over 1 year ago - 6 dependent packages - 25 dependent repositories - 10.8 thousand downloads last month - 129 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
trafilatura 1.9.0 💰
Python package and command-line tool designed to gather text on the Web, includes all necessary d...
44 versions - Latest release: 21 days ago - 71 dependent packages - 63 dependent repositories - 484 thousand downloads last month - 2,688 stars on GitHub - 1 maintainer
mfte 1.6.1
MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE writt...
3 versions - Latest release: 5 months ago - 22 downloads last month - 14 stars on GitHub - 1 maintainer
ua-gec 2.1.3
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian language
9 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 166 downloads last month - 254 stars on GitHub - 1 maintainer
opusfilter 3.0.0
Toolbox for filtering parallel corpora
14 versions - Latest release: 7 months ago - 1 dependent package - 2 dependent repositories - 171 downloads last month - 92 stars on GitHub - 2 maintainers
openpecha 0.11.10
OpenPecha Toolkit allows state of the art for distributed standoff annotations on moving texts
218 versions - Latest release: 3 months ago - 1 dependent package - 4 dependent repositories - 1.75 thousand downloads last month - 7 stars on GitHub - 2 maintainers
lyricscorpora 1.0.0
Lyrics API
10 versions - Latest release: about 6 years ago - 1 dependent repositories - 17 downloads last month - 18 stars on GitHub - 1 maintainer
concordancer 0.1.14
Extract concordance lines from corpus with CQL
15 versions - Latest release: over 2 years ago - 1 dependent repositories - 101 downloads last month - 17 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
align 0.1.1
Analyzing Linguistic Interaction with Generalizable techNiques. Read the latest ALIGN tutorials.
10 versions - Latest release: over 1 year ago - 8 dependent repositories - 538 downloads last month - 38 stars on GitHub - 2 maintainers
korp 1.0.3 💰
Korp API library for Python
4 versions - Latest release: over 5 years ago - 1 dependent repositories - 36 downloads last month - 8 stars on GitHub - 1 maintainer
audiomate 6.0.0
Audiomate is a library for working with audio datasets.
10 versions - Latest release: almost 4 years ago - 3 dependent repositories - 185 downloads last month - 128 stars on GitHub - 1 maintainer
corpus-similarity 1.0.1
Measuring corpus similarity in Python
3 versions - Latest release: almost 3 years ago - 1 dependent repositories - 41 downloads last month - 10 stars on GitHub - 1 maintainer
Related Keywords
corpus 6 python 4 corpus-linguistics 4 natural-language-processing 4 nlp 4 corpus-processing 3 corpus-data 2 dataset 2 corpora 2 music 2 similarity 1 scrapper 1 scraping-websites 1 python-api 1 billboard-charts 1 artists 1 scrape 1 songs 1 billboard 1 LyricWikia 1 lyrics 1 layered-text 1 annotations 1 parallel-corpus 1 machine-translation 1 ukrainian-language 1 nlp-datasets 1 grammatical-error-correction 1 grammarly 1 correction 1 error 1 grammatical 1 text 1 ukrainian 1 gec 1 computational linguistics 1 natural language processing 1 text analytics 1 speech-recognition 1 speech 1 noise 1 dataset-manager 1 dataset-filtering 1 dataset-creation 1 data-loader 1 audio-datasets 1 sound 1 audio 1 language 1 korp-api 1 API 1 Korp 1 notebooks 1 word2vec 1 nltk 1 text-analysis 1 ngram-analysis 1 conversation-analysis 1 linguistic-alignment 1 linguistic-analysis 1 python3 1 corpus-query-language 1 concordancer 1 html2text 1 wordlist 1 morphological-analysis 1 low-resource-nlp 1 lemmatizer 1 language-identification 1 language-detection 1 tokenizer 1 tokenization 1 lemmatiser 1 lemmatisation 1 lemmatization 1 xml-parsing 1 xml-parser-library 1 xml-parser 1 tsv-format 1 tsv-files 1 tsv 1 sheet-music-parser 1 sheet-music 1 parser 1 music-scores 1 music-score 1 musescore4 1 musescore3 1 musescore2 1 musescore 1 corpus-generator 1 multivariate-analysis 1 multidimensional-analysis 1 gui 1 corpus-analysis 1 Multifeature tagging 1 Register variation 1 MD analysis 1 Multidimensional analysis 1 Grammatical tagging 1