Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org : webcorpus

Generate large textual corpora for almost any language by crawling the web

Registry - Source - Documentation - JSON
purl: pkg:pypi/webcorpus
Keywords: dataset, corpus, datasets, indic-languages, multilingual, news-crawler, nlp, nlp-datasets
License: GPL-2.0,GPL-2.0+
Latest release: about 3 years ago
First release: about 3 years ago
Dependent repositories: 1
Downloads: 21 last month
Stars: 7 on GitHub
Forks: 8 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 17 days ago

    Loading...
    Readme
    Loading...