Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "tokenisation" keyword
example990420 1.1.2
Taiwanese Hokkien Transliterator and Tokeniser9 versions - Latest release: 9 days ago - 307 downloads last month - 10 stars on GitHub - 1 maintainer
taibun 1.1.2
Taiwanese Hokkien Transliterator and Tokeniser10 versions - Latest release: 9 days ago - 419 downloads last month - 10 stars on GitHub - 1 maintainer
Top 9.8% on pypi.org
15 versions - Latest release: 9 months ago - 2 dependent packages - 1 dependent repositories - 1.4 thousand downloads last month - 485 stars on GitHub - 1 maintainer
tokenmonster 1.1.12
Tokenize and decode text with TokenMonster vocabularies.15 versions - Latest release: 9 months ago - 2 dependent packages - 1 dependent repositories - 1.4 thousand downloads last month - 485 stars on GitHub - 1 maintainer
python-ucto 0.6.7
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost a...22 versions - Latest release: 7 months ago - 1 dependent package - 4 dependent repositories - 856 downloads last month - 29 stars on GitHub - 1 maintainer
omorfi 0.9.10 💰
Open morphology for Finnish, python bindings4 versions - Latest release: 4 months ago - 1 dependent repositories - 50 downloads last month - 82 stars on GitHub - 1 maintainer
ffast 0.4.10
FFAST: Fast Fourier Analysis for Sentence embeddings and Tokenisation50 versions - Latest release: over 1 year ago - 1 dependent repositories - 10 downloads last month - 1 maintainer
Related Keywords
python
5
nlp
5
tokenization
4
tokenizer
4
tokeniser
3
nlp-library
3
zhuyin
2
tl
2
romanisation
2
poj
2
natural-language-processing
2
transliterator
2
transliteration
2
romanization
2
hokkien
2
taigi
2
taiwanese
2
taiwan
2
morphology
1
analysis
1
morphological-analysis
1
python-bindings
1
spell-check
1
embedding
1
fast fourier
1
nlu
1
poincare
1
wordnet
1
lite
1
fast
1
sentence encoder
1
finnish
1
text-processing
1
folia
1
computational-linguistics
1
ucto
1
computational_linguistics
1
vocabulary-generator
1
vocabulary-builder
1
vocabulary
1
tokenizing
1
tokenize
1
text-tokenization
1