Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "word-segmentation" keyword
Top 9.0% on pypi.org
11 versions - Latest release: 7 months ago - 1 dependent repositories - 334 downloads last month - 367 stars on GitHub - 1 maintainer
adaseq 0.6.6
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models11 versions - Latest release: 7 months ago - 1 dependent repositories - 334 downloads last month - 367 stars on GitHub - 1 maintainer
Top 0.5% on pypi.org
32 versions - Latest release: 4 months ago - 802 dependent packages - 18,074 dependent repositories - 20.8 million downloads last month - 9,462 stars on GitHub - 1 maintainer
sentencepiece 0.2.0
SentencePiece python wrapper32 versions - Latest release: 4 months ago - 802 dependent packages - 18,074 dependent repositories - 20.8 million downloads last month - 9,462 stars on GitHub - 1 maintainer
hellonlp 0.2.41
NLP tools31 versions - Latest release: over 2 years ago - 1 dependent repositories - 59 downloads last month - 22 stars on GitHub - 1 maintainer
cjieba 0.4.4 💰
Python cffi binding for cjieba11 versions - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 204 downloads last month - 15 stars on GitHub - 1 maintainer
toiro 0.0.9
A comparison tool of Japanese tokenizers8 versions - Latest release: 11 months ago - 1 dependent repositories - 103 downloads last month - 111 stars on GitHub - 1 maintainer
trtokenizer 0.0.3
Sentence and word tokenizers for the Turkish language3 versions - Latest release: almost 3 years ago - 2 dependent repositories - 222 downloads last month - 20 stars on GitHub - 1 maintainer
pytorch-nlu 0.0.2
Pytorch-NLU2 versions - Latest release: over 2 years ago - 1 dependent repositories - 25 downloads last month - 291 stars on GitHub - 1 maintainer
simjb 0.2.0
A simple version of jieba.3 versions - Latest release: almost 2 years ago - 1 dependent repositories - 19 downloads last month - 3 stars on GitHub - 1 maintainer
bi-lstm-crf 0.2.1
A PyTorch implementation of the BI-LSTM-CRF model5 versions - Latest release: over 3 years ago - 1 dependent repositories - 33 downloads last month - 224 stars on GitHub - 1 maintainer
iparser 0.1.8
Integrated and Industrial Strength Dependency Parser1 version - Latest release: about 6 years ago - 1 dependent repositories - 14 downloads last month - 10 stars on GitHub - 1 maintainer
wordseg 0.0.5 💰
Word segmentation models5 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 1.54 thousand downloads last month - 3 stars on GitHub - 1 maintainer
m3tl 0.7.0
BERT for Multi-task Learning1 version - Latest release: over 2 years ago - 10 downloads last month - 544 stars on GitHub - 1 maintainer
bert-multitask-learning 0.7.0
BERT for Multi-task Learning64 versions - Latest release: over 3 years ago - 320 downloads last month - 544 stars on GitHub - 1 maintainer
rakutenma 0.3.3 💰
morphological analyzer (word segmentor + PoS Tagger) for Chinese and Japanese5 versions - Latest release: about 7 years ago - 4 dependent repositories - 26 downloads last month - 20 stars on GitHub - 1 maintainer
Top 5.1% on pypi.org
2 versions - Latest release: almost 6 years ago - 4 dependent packages - 31 dependent repositories - 3.22 thousand downloads last month - 55 stars on GitHub - 1 maintainer
vncorenlp 1.0.3
A Python wrapper for VnCoreNLP using a bidirectional communication channel.2 versions - Latest release: almost 6 years ago - 4 dependent packages - 31 dependent repositories - 3.22 thousand downloads last month - 55 stars on GitHub - 1 maintainer
giganticode-dataprep 1.0.0a12
A toolkit for pre-processing large source code corpora6 versions - Latest release: over 4 years ago - 1 dependent repositories - 19 downloads last month - 45 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
108 versions - Latest release: about 1 month ago - 37 dependent packages - 183 dependent repositories - 224 thousand downloads last month - 926 stars on GitHub - 2 maintainers
pythainlp 5.0.3
Thai Natural Language Processing library108 versions - Latest release: about 1 month ago - 37 dependent packages - 183 dependent repositories - 224 thousand downloads last month - 926 stars on GitHub - 2 maintainers
thainlp 0.4.2
Thai NLP library3 versions - Latest release: about 5 years ago - 1 dependent repositories - 28 downloads last month - 926 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
25 versions - Latest release: 4 months ago - 5 dependent packages - 28 dependent repositories - 261 thousand downloads last month - 371 stars on GitHub - 1 maintainer
nagisa 0.2.11
A Japanese tokenizer based on recurrent neural networks25 versions - Latest release: 4 months ago - 5 dependent packages - 28 dependent repositories - 261 thousand downloads last month - 371 stars on GitHub - 1 maintainer
Top 4.4% on pypi.org
16 versions - Latest release: about 1 year ago - 2 dependent packages - 14 dependent repositories - 3.77 thousand downloads last month - 649 stars on GitHub - 1 maintainer
ckip-transformers 0.3.4
CKIP Transformers16 versions - Latest release: about 1 year ago - 2 dependent packages - 14 dependent repositories - 3.77 thousand downloads last month - 649 stars on GitHub - 1 maintainer
ckipnlp 1.0.3
CKIP CoreNLP25 versions - Latest release: about 1 year ago - 1 dependent repositories - 186 downloads last month - 113 stars on GitHub - 1 maintainer
fidel 0.0.10 💰
Python package that can change Amharic language that written in English alphabet to Amharic alpha...10 versions - Latest release: 11 months ago - 1 dependent repositories - 49 downloads last month - 772 stars on GitHub - 1 maintainer
giganticode-codeprep 1.0.0
A toolkit for pre-processing large source code corpora1 version - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 45 stars on GitHub - 1 maintainer
Top 2.4% on pypi.org
22 versions - Latest release: over 1 year ago - 13 dependent packages - 118 dependent repositories - 641 thousand downloads last month - 766 stars on GitHub - 1 maintainer
symspellpy 6.7.7 💰
Python SymSpell22 versions - Latest release: over 1 year ago - 13 dependent packages - 118 dependent repositories - 641 thousand downloads last month - 766 stars on GitHub - 1 maintainer
codeprep 1.0.5
A toolkit for pre-processing large source code corpora4 versions - Latest release: about 3 years ago - 1 dependent repositories - 52 downloads last month - 45 stars on GitHub - 1 maintainer
vistickedword 0.9.5
Library to split sticked Vietnamese words7 versions - Latest release: almost 4 years ago - 2 dependent repositories - 59 downloads last month - 2 stars on GitHub - 1 maintainer
vgram 0.4.2
V-gram builder library15 versions - Latest release: almost 5 years ago - 1 dependent repositories - 54 downloads last month - 7 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
24 versions - Latest release: over 2 years ago - 1 dependent package - 6 dependent repositories - 1.5 thousand downloads last month - 335 stars on GitHub - 1 maintainer
pycantonese 3.4.0 💰
Cantonese Linguistics and NLP in Python24 versions - Latest release: over 2 years ago - 1 dependent package - 6 dependent repositories - 1.5 thousand downloads last month - 335 stars on GitHub - 1 maintainer
Top 2.4% on pypi.org
15 versions - Latest release: about 4 years ago - 1 dependent package - 30 dependent repositories - 5.79 thousand downloads last month - 9,462 stars on GitHub - 1 maintainer
tf-sentencepiece 0.1.92
SentencePiece Encode/Decode ops for TensorFlow15 versions - Latest release: about 4 years ago - 1 dependent package - 30 dependent repositories - 5.79 thousand downloads last month - 9,462 stars on GitHub - 1 maintainer
nokcut 0.4 💰
Thai Word Segmentation using TCC + Bidirectional RNNs1 version - Latest release: over 5 years ago - 1 dependent repositories - 13 downloads last month - 8 stars on GitHub - 1 maintainer
ckip-classic 1.2.3
CKIP Classic NLP Tools12 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 61 downloads last month - 8 stars on GitHub - 1 maintainer
python-vncorenlp 0.1.8
python_vncorenlp9 versions - Latest release: almost 4 years ago - 1 dependent repositories - 67 downloads last month - 2 stars on GitHub - 1 maintainer
python-rdrsegmenter 0.1.1
python_rdrsegmenter2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 96 downloads last month - 1 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
10 versions - Latest release: almost 2 years ago - 2 dependent repositories - 1.78 thousand downloads last month - 244 stars on GitHub - 2 maintainers
monpa 0.3.3
MONPA is an end-to-end model to jointly conduct Chinese word segmentation, POS and NE labeling10 versions - Latest release: almost 2 years ago - 2 dependent repositories - 1.78 thousand downloads last month - 244 stars on GitHub - 2 maintainers
Top 4.5% on pypi.org
54 versions - Latest release: about 2 years ago - 48 dependent repositories - 1.71 thousand downloads last month - 656 stars on GitHub - 1 maintainer
ekphrasis 0.5.4
Text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekph...54 versions - Latest release: about 2 years ago - 48 dependent repositories - 1.71 thousand downloads last month - 656 stars on GitHub - 1 maintainer
hanlperceptron 0.2.0
Native Python HanLP Perceptron Model: HanLPerceptron3 versions - Latest release: over 2 years ago - 1 dependent repositories - 51 downloads last month - 7 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
8 versions - Latest release: over 4 years ago - 8 dependent packages - 228 dependent repositories - 43.4 thousand downloads last month - 941 stars on GitHub - 3 maintainers
youtokentome 1.0.6
Unsupervised text tokenizer focused on computational efficiency8 versions - Latest release: over 4 years ago - 8 dependent packages - 228 dependent repositories - 43.4 thousand downloads last month - 941 stars on GitHub - 3 maintainers
Top 5.0% on pypi.org
41 versions - Latest release: about 2 months ago - 6 dependent packages - 10 dependent repositories - 27.8 thousand downloads last month - 189 stars on GitHub - 1 maintainer
kiwipiepy 0.17.1 💰
Kiwi, the Korean Tokenizer for Python41 versions - Latest release: about 2 months ago - 6 dependent packages - 10 dependent repositories - 27.8 thousand downloads last month - 189 stars on GitHub - 1 maintainer
politely 4.1.0 💰
An explainable styler for the Korean language22 versions - Latest release: 2 months ago - 200 downloads last month - 223 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
3 versions - Latest release: over 5 years ago - 19 dependent repositories - 192 stars on GitHub
mecab-python-windows 0.996.3 removed 💰
Python wrapper for CaboCha: Japanese Dependency Structure Analyzer3 versions - Latest release: over 5 years ago - 19 dependent repositories - 192 stars on GitHub
Top 4.8% on pypi.org
3 versions - Latest release: about 3 years ago - 3 dependent packages - 27 dependent repositories - 3.87 thousand downloads last month - 228 stars on GitHub - 1 maintainer
mecab 0.996.3 💰
MeCab binding for many OSs (Windows, macOS, and Linux)3 versions - Latest release: about 3 years ago - 3 dependent packages - 27 dependent repositories - 3.87 thousand downloads last month - 228 stars on GitHub - 1 maintainer
Top 3.1% on pypi.org
15 versions - Latest release: about 3 years ago - 3 dependent packages - 21 dependent repositories - 1.45 thousand downloads last month - 3,765 stars on GitHub - 1 maintainer
lac 2.1.2
A chinese lexical analysis tool by Baidu NLP.15 versions - Latest release: about 3 years ago - 3 dependent packages - 21 dependent repositories - 1.45 thousand downloads last month - 3,765 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
9 versions - Latest release: 3 months ago - 1 dependent package - 3 dependent repositories - 18.5 thousand downloads last month - 189 stars on GitHub - 1 maintainer
kiwipiepy-model 0.17.0 💰
Model for kiwipiepy9 versions - Latest release: 3 months ago - 1 dependent package - 3 dependent repositories - 18.5 thousand downloads last month - 189 stars on GitHub - 1 maintainer
symspellcpppy 0.0.18
A Fast SymSpell port for python written in C++ using pybind11.18 versions - Latest release: about 1 year ago - 1 dependent repositories - 133 downloads last month - 38 stars on GitHub - 1 maintainer
vietokenizer 1.0.3
Vietnamese Tokenizer package based on deep learning method3 versions - Latest release: over 1 year ago - 49 downloads last month - 2 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
9 versions - Latest release: about 1 year ago - 1 dependent package - 3 dependent repositories - 1.08 thousand downloads last month - 30 stars on GitHub - 2 maintainers
nlpo3 1.3.0
Python binding for nlpO3 Thai language processing library in Rust9 versions - Latest release: about 1 year ago - 1 dependent package - 3 dependent repositories - 1.08 thousand downloads last month - 30 stars on GitHub - 2 maintainers
Related Keywords
nlp
25
natural-language-processing
17
python
11
named-entity-recognition
10
pos-tagging
8
nlp-library
7
NLP
7
tokenizer
7
ner
7
chinese-word-segmentation
6
bert
6
morphological-analysis
5
thai-language
4
sequence-labeling
4
spelling-correction
4
corpus
4
machine
4
text-classification
4
vietnamese-nlp
4
learning
4
natural language processing
4
text-segmentation
4
parser
4
morphological
3
source-code-analysis
3
hacktoberfest
3
pretrained-models
3
chinese-text-segmentation
3
fuzzy-search
3
mining-software-repositories
3
language-modeling
3
pre-processing
3
dependency-parser
3
part-of-speech-tagger
3
computational linguistics
3
code
3
source
3
data
3
large
3
big
3
fuzzy-matching
3
spell-check
3
pytorch
3
spellcheck
3
part-of-speech-tagging
3
ckip
3
spelling
3
symspell
3
thai-nlp
3
python-library
3
chinese-nlp
3
korean-tokenizer
3
korean-nlp
3
korean
3
analysis
3
thai
3
vncorenlp
2
python-vncorenlp
2
pos-tagger
2
chinese
2
levenshtein-distance
2
levenshtein
2
edit-distance
2
damerau-levenshtein
2
approximate-string-matching
2
sentence-parsing
2
thai-soundex
2
thai-nlp-library
2
pythainlp
2
tensorflow
2
hacktoberfest-accepted
2
soundex
2
transformers
2
sentence-segmentation
2
Korean
2
neural-machine-translation
2
mecab
2
japanese
2
tokenization
2
text-processing
2
crf
2
word segmentation
2
linguistics
2
corpora
2
jieba
2
jieba-chinese
2
transformer
2
part-of-speech
2
multitask-learning
2
multi-task-learning
2
encoder-decoder
2
cws
2
computational-linguistics
2
language
2
speech
2
vgram
1
sequential-data
1
feature-extraction
1
Chinese
1
Cantonese
1