Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "nlp-datasets" keyword
nlpaeg 0.1.1
Artificial Error Generation (AEG) for Natural Language Processing11 versions - Latest release: over 3 years ago - 1 dependent repositories - 65 downloads last month - 1 stars on GitHub - 1 maintainer
pylines 0.0.4
work with large jsonline files with ease4 versions - Latest release: over 2 years ago - 1 dependent repositories - 46 downloads last month - 11 stars on GitHub - 1 maintainer
ua-gec 2.1.3
UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian language9 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 166 downloads last month - 254 stars on GitHub - 1 maintainer
webcorpus 0.2
Generate large textual corpora for almost any language by crawling the web1 version - Latest release: about 3 years ago - 1 dependent repositories - 21 downloads last month - 7 stars on GitHub - 1 maintainer
texttaglib 0.1.1
Python library for managing and annotating text corpuses in different formats (ELAN, TIG, TTL, et...13 versions - Latest release: about 3 years ago - 3 dependent repositories - 61 downloads last month - 0 stars on GitHub - 1 maintainer
scpscraper 1.0.1
A Python library designed for scraping data from the SCP wiki.21 versions - Latest release: over 3 years ago - 1 dependent repositories - 130 downloads last month - 13 stars on GitHub - 1 maintainer
russian-names 0.1.2
Russian names generator1 version - Latest release: about 5 years ago - 2 dependent repositories - 1.33 thousand downloads last month - 24 stars on GitHub - 1 maintainer
ua-datasets 0.1.1
A collection of ukrainian language datasets11 versions - Latest release: 7 months ago - 1 dependent repositories - 43 downloads last month - 49 stars on GitHub - 2 maintainers
Related Keywords
nlp
4
dataset
4
corpus
3
pypi
2
natural-language-processing
2
ukrainian-language
2
pypi-package
1
nlp-dataset-creation
1
dataset-generation
1
dataset-creation
1
data-collection
1
tensorflow
1
webscraper
1
foundation
1
scp
1
elan
1
annotations
1
python
1
python3
1
scp-foundation
1
training-data-generation
1
webscraping
1
russian
1
names
1
generator
1
text-generation
1
text-processing
1
ua-datasets
1
question-answering
1
text-classification
1
token-classification
1
artificial-error-generation
1
grammatical-error-detection
1
nlp-grammar
1
nlp-machine-learning
1
json
1
json lines
1
jsonlines
1
jsonlines-data
1
gec
1
ukrainian
1
grammatical
1
error
1
correction
1
grammarly
1
corpus-data
1
corpus-tools
1
grammatical-error-correction
1
datasets
1
indic-languages
1
multilingual
1
news-crawler
1
annotation
1
text
1
linguistics
1
ELAN
1
transcription
1