Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

Top 4.2% forks on rubygems.org

rubygems.org : wp2txt

WP2TXT extracts text and category data from Wikipedia dump files (encoded in XML / compressed with Bzip2), removing MediaWiki markup and other metadata.

Registry - Source - Documentation - JSON
purl: pkg:gem/wp2txt
Keywords: corpus, machine-learning, nlp, ruby, wikipedia, wikipedia-dump
License: MIT
Latest release: about 1 year ago
First release: over 11 years ago
Dependent repositories: 2
Downloads: 64,326 total
Stars: 167 on GitHub
Forks: 39 on GitHub
Total Commits: 130
Committers: 6
Average commits per author: 21.667
Development Distribution Score (DDS): 0.054
More commit stats: commits.ecosyste.ms
See more repository details: repos.ecosyste.ms
Last synced: 23 days ago

    Loading...
    Readme
    Loading...