Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

crates.io "corpus" keyword

corpus-preproc 0.1.0
A preprocessor for text and HTML corpora
1 version - Latest release: over 2 years ago - 487 downloads total - 2 stars on GitHub - 1 maintainer
opus-parse 0.0.3
Library to parse OPUS
3 versions - Latest release: over 6 years ago - 1.97 thousand downloads total - 1 stars on GitHub - 1 maintainer
ptb-reader 0.9.1
Simple parsing of the merged Penn Treebank format.
10 versions - Latest release: about 7 years ago - 6.63 thousand downloads total - 2 stars on GitHub - 1 maintainer
corpus-count 0.1.1
Util to count words and character ngrams in a corpus.
2 versions - Latest release: over 4 years ago - 1.17 thousand downloads total - 0 stars on GitHub - 1 maintainer
tanaka 0.1.0
A Rust interface the Tanaka Corpus of parallel Japanese-English sentences
1 version - Latest release: 6 months ago - 298 downloads total - 0 stars on GitLab.com - 1 maintainer
conllx 0.12.1
Readers/writers for the CoNLL-X dependency format
25 versions - Latest release: over 4 years ago - 7 dependent packages - 4 dependent repositories - 30.4 thousand downloads total - 7 stars on GitHub - 1 maintainer