pypi.org "word-segmentation" keyword
View the packages on the pypi.org package registry that are tagged with the "word-segmentation" keyword.
Top 4.4% on pypi.org
16 versions - Latest release: over 2 years ago - 2 dependent packages - 14 dependent repositories - 2.19 thousand downloads last month - 730 stars on GitHub - 1 maintainer
ckip-transformers 0.3.4
CKIP Transformers16 versions - Latest release: over 2 years ago - 2 dependent packages - 14 dependent repositories - 2.19 thousand downloads last month - 730 stars on GitHub - 1 maintainer
Top 2.4% on pypi.org
15 versions - Latest release: about 5 years ago - 1 dependent package - 30 dependent repositories - 1.22 thousand downloads last month - 9,462 stars on GitHub - 1 maintainer
tf-sentencepiece 0.1.92
SentencePiece Encode/Decode ops for TensorFlow15 versions - Latest release: about 5 years ago - 1 dependent package - 30 dependent repositories - 1.22 thousand downloads last month - 9,462 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
114 versions - Latest release: 4 months ago - 37 dependent packages - 183 dependent repositories - 611 thousand downloads last month - 1,065 stars on GitHub - 2 maintainers
pythainlp 5.1.2
Thai Natural Language Processing library114 versions - Latest release: 4 months ago - 37 dependent packages - 183 dependent repositories - 611 thousand downloads last month - 1,065 stars on GitHub - 2 maintainers
thainlp 0.4.2
Thai NLP library3 versions - Latest release: over 6 years ago - 1 dependent repositories - 92 downloads last month - 1,065 stars on GitHub - 1 maintainer
ckip-classic 1.2.3
CKIP Classic NLP Tools12 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 50 downloads last month - 8 stars on GitHub - 1 maintainer
bert-multitask-learning 0.7.0
BERT for Multi-task Learning64 versions - Latest release: over 4 years ago - 563 downloads last month - 549 stars on GitHub - 1 maintainer
Top 0.5% on pypi.org
33 versions - Latest release: 26 days ago - 802 dependent packages - 18,074 dependent repositories - 30.5 million downloads last month - 9,462 stars on GitHub - 1 maintainer
sentencepiece 0.2.1
Unsupervised text tokenizer and detokenizer.33 versions - Latest release: 26 days ago - 802 dependent packages - 18,074 dependent repositories - 30.5 million downloads last month - 9,462 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
10 versions - Latest release: 10 months ago - 1 dependent package - 3 dependent repositories - 2.19 thousand downloads last month - 35 stars on GitHub - 1 maintainer
nlpo3 1.3.1
Python binding for nlpO3 Thai language processing library in Rust10 versions - Latest release: 10 months ago - 1 dependent package - 3 dependent repositories - 2.19 thousand downloads last month - 35 stars on GitHub - 1 maintainer
giganticode-dataprep 1.0.0a12
A toolkit for pre-processing large source code corpora6 versions - Latest release: over 5 years ago - 1 dependent repositories - 18 downloads last month - 47 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
11 versions - Latest release: almost 2 years ago - 1 dependent repositories - 2.98 thousand downloads last month - 389 stars on GitHub - 1 maintainer
adaseq 0.6.6
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models11 versions - Latest release: almost 2 years ago - 1 dependent repositories - 2.98 thousand downloads last month - 389 stars on GitHub - 1 maintainer
nokcut 0.4 💰
Thai Word Segmentation using TCC + Bidirectional RNNs1 version - Latest release: over 6 years ago - 1 dependent repositories - 6 downloads last month - 8 stars on GitHub - 1 maintainer
trtokenizer 0.0.3
Sentence and word tokenizers for the Turkish language3 versions - Latest release: about 4 years ago - 2 dependent repositories - 112 downloads last month - 20 stars on GitHub - 1 maintainer
toiro 0.0.9
A comparison tool of Japanese tokenizers8 versions - Latest release: about 2 years ago - 1 dependent repositories - 42 downloads last month - 112 stars on GitHub - 1 maintainer
vietokenizer 1.0.3
Vietnamese Tokenizer package based on deep learning method3 versions - Latest release: almost 3 years ago - 27 downloads last month - 2 stars on GitHub - 1 maintainer
python-vncorenlp 0.1.8
python_vncorenlp9 versions - Latest release: about 5 years ago - 1 dependent repositories - 27 downloads last month - 2 stars on GitHub - 1 maintainer
Top 3.1% on pypi.org
15 versions - Latest release: over 4 years ago - 3 dependent packages - 21 dependent repositories - 712 downloads last month - 3,965 stars on GitHub - 1 maintainer
lac 2.1.2
A chinese lexical analysis tool by Baidu NLP.15 versions - Latest release: over 4 years ago - 3 dependent packages - 21 dependent repositories - 712 downloads last month - 3,965 stars on GitHub - 1 maintainer
vgram 0.4.2
V-gram builder library15 versions - Latest release: about 6 years ago - 1 dependent repositories - 38 downloads last month - 7 stars on GitHub - 1 maintainer
pytorch-nlu 0.0.2
Pytorch-NLU2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 8 downloads last month - 349 stars on GitHub - 1 maintainer
cjieba 0.4.4 💰
Python cffi binding for cjieba11 versions - Latest release: almost 5 years ago - 1 dependent package - 1 dependent repositories - 96 downloads last month - 15 stars on GitHub - 1 maintainer
ckipnlp 1.0.3
CKIP CoreNLP25 versions - Latest release: over 2 years ago - 1 dependent repositories - 101 downloads last month - 124 stars on GitHub - 1 maintainer
wordseg 0.0.5 💰
Word segmentation models5 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 4.63 thousand downloads last month - 3 stars on GitHub - 1 maintainer
hellonlp 0.2.41
NLP tools31 versions - Latest release: almost 4 years ago - 1 dependent repositories - 13 downloads last month - 25 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
10 versions - Latest release: about 3 years ago - 2 dependent repositories - 220 downloads last month - 246 stars on GitHub - 2 maintainers
monpa 0.3.3
MONPA is an end-to-end model to jointly conduct Chinese word segmentation, POS and NE labeling10 versions - Latest release: about 3 years ago - 2 dependent repositories - 220 downloads last month - 246 stars on GitHub - 2 maintainers
khmersegment 0.1.2
A Khmer word segmentation tool built for NIPTICT Khmer Word Segmentation CRF model.3 versions - Latest release: over 1 year ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
kitoken 0.10.1 💰
Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization2 versions - Latest release: 9 months ago - 95 downloads last month - 29 stars on GitHub - 1 maintainer
rakutenma 0.3.3 💰
morphological analyzer (word segmentor + PoS Tagger) for Chinese and Japanese5 versions - Latest release: over 8 years ago - 4 dependent repositories - 8 downloads last month - 20 stars on GitHub - 1 maintainer
Top 4.5% on pypi.org
54 versions - Latest release: over 3 years ago - 48 dependent repositories - 1.9 thousand downloads last month - 670 stars on GitHub - 1 maintainer
ekphrasis 0.5.4
Text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekph...54 versions - Latest release: over 3 years ago - 48 dependent repositories - 1.9 thousand downloads last month - 670 stars on GitHub - 1 maintainer
speliuk 0.0.2
Speliuk is a spell checker for the Ukrainian language based on SymSpell and Language Models.2 versions - Latest release: 12 months ago - 110 downloads last month - 3,226 stars on GitHub - 1 maintainer
python-rdrsegmenter 0.1.1
python_rdrsegmenter2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 56 downloads last month - 1 stars on GitHub - 1 maintainer
Top 5.1% on pypi.org
2 versions - Latest release: about 7 years ago - 4 dependent packages - 31 dependent repositories - 2.03 thousand downloads last month - 56 stars on GitHub - 1 maintainer
vncorenlp 1.0.3
A Python wrapper for VnCoreNLP using a bidirectional communication channel.2 versions - Latest release: about 7 years ago - 4 dependent packages - 31 dependent repositories - 2.03 thousand downloads last month - 56 stars on GitHub - 1 maintainer
simjb 0.2.0
A simple version of jieba.3 versions - Latest release: about 3 years ago - 1 dependent repositories - 20 downloads last month - 4 stars on GitHub - 1 maintainer
giganticode-codeprep 1.0.0
A toolkit for pre-processing large source code corpora1 version - Latest release: over 5 years ago - 1 dependent repositories - 13 downloads last month - 47 stars on GitHub - 1 maintainer
thongna 0.2.4
Blazing-fast Thai text processing library powered by Rust7 versions - Latest release: about 1 year ago - 806 downloads last month - 3 stars on GitHub - 1 maintainer
iparser 0.1.8
Integrated and Industrial Strength Dependency Parser1 version - Latest release: over 7 years ago - 1 dependent repositories - 9 downloads last month - 10 stars on GitHub - 1 maintainer
Top 2.4% on pypi.org
25 versions - Latest release: 6 months ago - 13 dependent packages - 118 dependent repositories - 208 thousand downloads last month - 839 stars on GitHub - 1 maintainer
symspellpy 6.9.0 💰
Python SymSpell25 versions - Latest release: 6 months ago - 13 dependent packages - 118 dependent repositories - 208 thousand downloads last month - 839 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
13 versions - Latest release: 4 months ago - 1 dependent package - 3 dependent repositories - 66.1 thousand downloads last month - 319 stars on GitHub - 1 maintainer
kiwipiepy-model 0.21.0 💰
Model for kiwipiepy13 versions - Latest release: 4 months ago - 1 dependent package - 3 dependent repositories - 66.1 thousand downloads last month - 319 stars on GitHub - 1 maintainer
m3tl 0.7.0
BERT for Multi-task Learning1 version - Latest release: almost 4 years ago - 10 downloads last month - 549 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
51 versions - Latest release: 4 months ago - 6 dependent packages - 10 dependent repositories - 103 thousand downloads last month - 319 stars on GitHub - 1 maintainer
kiwipiepy 0.21.0 💰
Kiwi, the Korean Tokenizer for Python51 versions - Latest release: 4 months ago - 6 dependent packages - 10 dependent repositories - 103 thousand downloads last month - 319 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
24 versions - Latest release: over 3 years ago - 1 dependent package - 6 dependent repositories - 2.45 thousand downloads last month - 386 stars on GitHub - 1 maintainer
pycantonese 3.4.0 💰
Cantonese Linguistics and NLP in Python24 versions - Latest release: over 3 years ago - 1 dependent package - 6 dependent repositories - 2.45 thousand downloads last month - 386 stars on GitHub - 1 maintainer
hanlperceptron 0.2.0
Native Python HanLP Perceptron Model: HanLPerceptron3 versions - Latest release: almost 4 years ago - 1 dependent repositories - 22 downloads last month - 7 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
8 versions - Latest release: over 5 years ago - 8 dependent packages - 228 dependent repositories - 93.6 thousand downloads last month - 968 stars on GitHub - 3 maintainers
youtokentome 1.0.6
Unsupervised text tokenizer focused on computational efficiency8 versions - Latest release: over 5 years ago - 8 dependent packages - 228 dependent repositories - 93.6 thousand downloads last month - 968 stars on GitHub - 3 maintainers
politely 4.1.0 💰
An explainable styler for the Korean language22 versions - Latest release: over 1 year ago - 67 downloads last month - 312 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
25 versions - Latest release: over 1 year ago - 5 dependent packages - 28 dependent repositories - 170 thousand downloads last month - 400 stars on GitHub - 1 maintainer
nagisa 0.2.11
A Japanese tokenizer based on recurrent neural networks25 versions - Latest release: over 1 year ago - 5 dependent packages - 28 dependent repositories - 170 thousand downloads last month - 400 stars on GitHub - 1 maintainer
bi-lstm-crf 0.2.1
A PyTorch implementation of the BI-LSTM-CRF model5 versions - Latest release: almost 5 years ago - 1 dependent repositories - 610 downloads last month - 237 stars on GitHub - 1 maintainer
vistickedword 0.9.5
Library to split sticked Vietnamese words7 versions - Latest release: about 5 years ago - 2 dependent repositories - 61 downloads last month - 2 stars on GitHub - 1 maintainer
symspellcpppy 0.0.18
A Fast SymSpell port for python written in C++ using pybind11.18 versions - Latest release: over 2 years ago - 1 dependent repositories - 152 downloads last month - 43 stars on GitHub - 1 maintainer
codeprep 1.0.5
A toolkit for pre-processing large source code corpora4 versions - Latest release: over 4 years ago - 1 dependent repositories - 40 downloads last month - 47 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
3 versions - Latest release: over 6 years ago - 19 dependent repositories - 192 stars on GitHub
mecab-python-windows 0.996.3 💰
Python wrapper for CaboCha: Japanese Dependency Structure Analyzer3 versions - Latest release: over 6 years ago - 19 dependent repositories - 192 stars on GitHub
Related Keywords
nlp
27
natural-language-processing
17
python
11
named-entity-recognition
10
tokenizer
9
ner
7
pos-tagging
7
NLP
7
nlp-library
6
chinese-word-segmentation
6
bert
6
text-processing
4
thai
4
thai-language
4
parser
4
computational-linguistics
4
vietnamese-nlp
4
sequence-labeling
4
morphological-analysis
4
corpus
4
machine
4
learning
4
text-classification
4
natural language processing
4
text-segmentation
4
spelling-correction
4
chinese-nlp
3
language-modeling
3
pre-processing
3
source-code-analysis
3
mining-software-repositories
3
crf
3
pytorch
3
code
3
source
3
data
3
dependency-parser
3
analysis
3
part-of-speech-tagger
3
chinese-text-segmentation
3
morphological
3
fuzzy-matching
3
fuzzy-search
3
spell-check
3
spellcheck
3
spelling
3
symspell
3
korean
3
korean-nlp
3
korean-tokenizer
3
python-library
3
large
3
big
3
computational linguistics
3
pretrained-models
3
thai-nlp
3
hacktoberfest
3
ckip
3
part-of-speech-tagging
3
sentence-segmentation
2
jieba-chinese
2
jieba
2
Korean
2
tensorflow
2
transformers
2
soundex
2
chinese
2
thai-nlp-library
2
pos-tagger
2
thai-soundex
2
sentence-parsing
2
word segmentation
2
linguistics
2
corpora
2
speech
2
language
2
sentencepiece
2
neural-machine-translation
2
pythainlp
2
levenshtein-distance
2
bpe
2
levenshtein
2
edit-distance
2
damerau-levenshtein
2
approximate-string-matching
2
tokenization
2
cws
2
encoder-decoder
2
multi-task-learning
2
multitask-learning
2
japanese
2
part-of-speech
2
transformer
2
nodejs
2
vncorenlp
2
rust
2
python-vncorenlp
2
stop-words
1
word
1
word-split
1