Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "Tokenizer" keyword

bleuscore 0.1.2
A fast bleu score calculator
4 versions - Latest release: 29 days ago - 601 downloads last month - 0 stars on GitHub - 1 maintainer
zicutter 0.0.10
ZiCutter: cut character smaller
11 versions - Latest release: over 1 year ago - 108 downloads last month - 1 stars on GitHub - 1 maintainer
unicodetokenizer 0.2.2
UnicodeTokenizer: tokenize all Unicode text
25 versions - Latest release: 7 months ago - 270 downloads last month - 0 stars on GitHub - 1 maintainer
sentencex 0.6.1
Sentence segmenter that supports ~300 languages
7 versions - Latest release: 7 months ago - 615 downloads last month - 21 stars on GitHub - 1 maintainer
pykomoran 0.1.6
PyKomoran is Python wrapper for KOMORAN, KOrean MORphical ANalyzer.
7 versions - Latest release: about 3 years ago - 1 dependent repositories - 290 downloads last month - 41 stars on GitHub - 1 maintainer
texo 0.0.4
Sentiment Analysis Multiple language and for all products
4 versions - Latest release: 11 months ago - 46 downloads last month - 1 stars on GitHub - 1 maintainer
zh-sentence 0.0.5
Light-weight sentence tokenizer for Chinese languages.
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 167 downloads last month - 1 stars on GitHub - 1 maintainer
pytokenizer 1.1.4
A streaming tokenizer.
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 72 downloads last month - 0 stars on GitHub - 1 maintainer
kr-sentence 0.0.3
Light-weight sentence tokenizer for Korean.
3 versions - Latest release: over 2 years ago - 1 dependent repositories - 112 downloads last month - 0 stars on GitHub - 1 maintainer
ja-sentence 0.0.5
Light-weight sentence tokenizer for Japanese.
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 50 downloads last month - 1 stars on GitHub - 1 maintainer
atma 0.4.0
Commonly-used & tested NLP tools, include bleu, tokenizer and so on
1 version - Latest release: about 7 years ago - 1 dependent repositories - 13 downloads last month - 6 stars on GitHub - 1 maintainer
sumire 1.0.2
Scikit-learn compatible Japanese text vectorizer for CPU-based Japanese natural language processing.
2 versions - Latest release: 4 months ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
zitokenizer 0.0.8
ZiTokenizer: tokenize world text as Zi
8 versions - Latest release: about 1 year ago - 74 downloads last month - 1 stars on GitHub - 1 maintainer