An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "sentencepiece" keyword

View the packages on the pypi.org package registry that are tagged with the "sentencepiece" keyword.

Top 1.7% on pypi.org
konoha 5.5.6 💰
Add your description here
28 versions - Latest release: 11 months ago - 3 dependent packages - 134 dependent repositories - 101 thousand downloads last month - 241 stars on GitHub - 1 maintainer
kitoken 0.10.1 💰
Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization
2 versions - Latest release: 4 months ago - 533 downloads last month - 16 stars on GitHub - 1 maintainer
escape-unk 1.0.1
Escape unknown symbols in SentecePiece vocabularies
8 versions - Latest release: over 2 years ago - 315 downloads last month - 0 stars on GitHub - 2 maintainers
Top 2.4% on pypi.org
tf-sentencepiece 0.1.92
SentencePiece Encode/Decode ops for TensorFlow
15 versions - Latest release: almost 5 years ago - 1 dependent package - 30 dependent repositories - 5.39 thousand downloads last month - 9,462 stars on GitHub - 1 maintainer
Top 3.6% on pypi.org
pyonmttok 1.37.1
Fast and customizable text tokenization library with BPE and SentencePiece support
66 versions - Latest release: about 2 years ago - 3 dependent packages - 103 dependent repositories - 28.8 thousand downloads last month - 302 stars on GitHub - 4 maintainers
Top 8.0% on pypi.org
tiny-tokenizer 3.4.0 💰
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with...
19 versions - Latest release: over 4 years ago - 16 dependent repositories - 428 downloads last month - 214 stars on GitHub - 1 maintainer
rs-bpe 0.1.0
A ridiculously fast Python BPE (Byte Pair Encoder) implementation written in Rust
1 version - Latest release: about 1 month ago - 1.91 thousand downloads last month - 1 stars on GitHub - 1 maintainer
nepalitokenizers 0.0.2
Pre-trained Tokenizers for the Nepali language with an interface to HuggingFace's tokenizers libr...
2 versions - Latest release: almost 2 years ago - 123 downloads last month - 2 stars on GitHub - 1 maintainer