An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "word-segmentation" keyword

View the packages on the pypi.org package registry that are tagged with the "word-segmentation" keyword.

Top 4.4% on pypi.org
ckip-transformers 0.3.4
CKIP Transformers
16 versions - Latest release: over 2 years ago - 2 dependent packages - 14 dependent repositories - 2.19 thousand downloads last month - 730 stars on GitHub - 1 maintainer
Top 2.4% on pypi.org
tf-sentencepiece 0.1.92
SentencePiece Encode/Decode ops for TensorFlow
15 versions - Latest release: about 5 years ago - 1 dependent package - 30 dependent repositories - 1.22 thousand downloads last month - 9,462 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
pythainlp 5.1.2
Thai Natural Language Processing library
114 versions - Latest release: 4 months ago - 37 dependent packages - 183 dependent repositories - 611 thousand downloads last month - 1,065 stars on GitHub - 2 maintainers
thainlp 0.4.2
Thai NLP library
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 92 downloads last month - 1,065 stars on GitHub - 1 maintainer
ckip-classic 1.2.3
CKIP Classic NLP Tools
12 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 50 downloads last month - 8 stars on GitHub - 1 maintainer
bert-multitask-learning 0.7.0
BERT for Multi-task Learning
64 versions - Latest release: over 4 years ago - 563 downloads last month - 549 stars on GitHub - 1 maintainer
Top 0.5% on pypi.org
sentencepiece 0.2.1
Unsupervised text tokenizer and detokenizer.
33 versions - Latest release: 26 days ago - 802 dependent packages - 18,074 dependent repositories - 30.5 million downloads last month - 9,462 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
nlpo3 1.3.1
Python binding for nlpO3 Thai language processing library in Rust
10 versions - Latest release: 10 months ago - 1 dependent package - 3 dependent repositories - 2.19 thousand downloads last month - 35 stars on GitHub - 1 maintainer
giganticode-dataprep 1.0.0a12
A toolkit for pre-processing large source code corpora
6 versions - Latest release: over 5 years ago - 1 dependent repositories - 18 downloads last month - 47 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
adaseq 0.6.6
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
11 versions - Latest release: almost 2 years ago - 1 dependent repositories - 2.98 thousand downloads last month - 389 stars on GitHub - 1 maintainer
nokcut 0.4 💰
Thai Word Segmentation using TCC + Bidirectional RNNs
1 version - Latest release: over 6 years ago - 1 dependent repositories - 6 downloads last month - 8 stars on GitHub - 1 maintainer
trtokenizer 0.0.3
Sentence and word tokenizers for the Turkish language
3 versions - Latest release: about 4 years ago - 2 dependent repositories - 112 downloads last month - 20 stars on GitHub - 1 maintainer
toiro 0.0.9
A comparison tool of Japanese tokenizers
8 versions - Latest release: about 2 years ago - 1 dependent repositories - 42 downloads last month - 112 stars on GitHub - 1 maintainer
vietokenizer 1.0.3
Vietnamese Tokenizer package based on deep learning method
3 versions - Latest release: almost 3 years ago - 27 downloads last month - 2 stars on GitHub - 1 maintainer
python-vncorenlp 0.1.8
python_vncorenlp
9 versions - Latest release: about 5 years ago - 1 dependent repositories - 27 downloads last month - 2 stars on GitHub - 1 maintainer
Top 3.1% on pypi.org
lac 2.1.2
A chinese lexical analysis tool by Baidu NLP.
15 versions - Latest release: over 4 years ago - 3 dependent packages - 21 dependent repositories - 712 downloads last month - 3,965 stars on GitHub - 1 maintainer
vgram 0.4.2
V-gram builder library
15 versions - Latest release: about 6 years ago - 1 dependent repositories - 38 downloads last month - 7 stars on GitHub - 1 maintainer
pytorch-nlu 0.0.2
Pytorch-NLU
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 8 downloads last month - 349 stars on GitHub - 1 maintainer
cjieba 0.4.4 💰
Python cffi binding for cjieba
11 versions - Latest release: almost 5 years ago - 1 dependent package - 1 dependent repositories - 96 downloads last month - 15 stars on GitHub - 1 maintainer
ckipnlp 1.0.3
CKIP CoreNLP
25 versions - Latest release: over 2 years ago - 1 dependent repositories - 101 downloads last month - 124 stars on GitHub - 1 maintainer
wordseg 0.0.5 💰
Word segmentation models
5 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 4.63 thousand downloads last month - 3 stars on GitHub - 1 maintainer
hellonlp 0.2.41
NLP tools
31 versions - Latest release: almost 4 years ago - 1 dependent repositories - 13 downloads last month - 25 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
monpa 0.3.3
MONPA is an end-to-end model to jointly conduct Chinese word segmentation, POS and NE labeling
10 versions - Latest release: about 3 years ago - 2 dependent repositories - 220 downloads last month - 246 stars on GitHub - 2 maintainers
khmersegment 0.1.2
A Khmer word segmentation tool built for NIPTICT Khmer Word Segmentation CRF model.
3 versions - Latest release: over 1 year ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
kitoken 0.10.1 💰
Fast and versatile tokenizer for language models, supporting BPE, Unigram and WordPiece tokenization
2 versions - Latest release: 9 months ago - 95 downloads last month - 29 stars on GitHub - 1 maintainer
rakutenma 0.3.3 💰
morphological analyzer (word segmentor + PoS Tagger) for Chinese and Japanese
5 versions - Latest release: over 8 years ago - 4 dependent repositories - 8 downloads last month - 20 stars on GitHub - 1 maintainer
Top 4.5% on pypi.org
ekphrasis 0.5.4
Text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekph...
54 versions - Latest release: over 3 years ago - 48 dependent repositories - 1.9 thousand downloads last month - 670 stars on GitHub - 1 maintainer
speliuk 0.0.2
Speliuk is a spell checker for the Ukrainian language based on SymSpell and Language Models.
2 versions - Latest release: 12 months ago - 110 downloads last month - 3,226 stars on GitHub - 1 maintainer
python-rdrsegmenter 0.1.1
python_rdrsegmenter
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 56 downloads last month - 1 stars on GitHub - 1 maintainer
Top 5.1% on pypi.org
vncorenlp 1.0.3
A Python wrapper for VnCoreNLP using a bidirectional communication channel.
2 versions - Latest release: about 7 years ago - 4 dependent packages - 31 dependent repositories - 2.03 thousand downloads last month - 56 stars on GitHub - 1 maintainer
simjb 0.2.0
A simple version of jieba.
3 versions - Latest release: about 3 years ago - 1 dependent repositories - 20 downloads last month - 4 stars on GitHub - 1 maintainer
giganticode-codeprep 1.0.0
A toolkit for pre-processing large source code corpora
1 version - Latest release: over 5 years ago - 1 dependent repositories - 13 downloads last month - 47 stars on GitHub - 1 maintainer
thongna 0.2.4
Blazing-fast Thai text processing library powered by Rust
7 versions - Latest release: about 1 year ago - 806 downloads last month - 3 stars on GitHub - 1 maintainer
iparser 0.1.8
Integrated and Industrial Strength Dependency Parser
1 version - Latest release: over 7 years ago - 1 dependent repositories - 9 downloads last month - 10 stars on GitHub - 1 maintainer
Top 2.4% on pypi.org
symspellpy 6.9.0 💰
Python SymSpell
25 versions - Latest release: 6 months ago - 13 dependent packages - 118 dependent repositories - 208 thousand downloads last month - 839 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
kiwipiepy-model 0.21.0 💰
Model for kiwipiepy
13 versions - Latest release: 4 months ago - 1 dependent package - 3 dependent repositories - 66.1 thousand downloads last month - 319 stars on GitHub - 1 maintainer
m3tl 0.7.0
BERT for Multi-task Learning
1 version - Latest release: almost 4 years ago - 10 downloads last month - 549 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
kiwipiepy 0.21.0 💰
Kiwi, the Korean Tokenizer for Python
51 versions - Latest release: 4 months ago - 6 dependent packages - 10 dependent repositories - 103 thousand downloads last month - 319 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
pycantonese 3.4.0 💰
Cantonese Linguistics and NLP in Python
24 versions - Latest release: over 3 years ago - 1 dependent package - 6 dependent repositories - 2.45 thousand downloads last month - 386 stars on GitHub - 1 maintainer
hanlperceptron 0.2.0
Native Python HanLP Perceptron Model: HanLPerceptron
3 versions - Latest release: almost 4 years ago - 1 dependent repositories - 22 downloads last month - 7 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
youtokentome 1.0.6
Unsupervised text tokenizer focused on computational efficiency
8 versions - Latest release: over 5 years ago - 8 dependent packages - 228 dependent repositories - 93.6 thousand downloads last month - 968 stars on GitHub - 3 maintainers
politely 4.1.0 💰
An explainable styler for the Korean language
22 versions - Latest release: over 1 year ago - 67 downloads last month - 312 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
nagisa 0.2.11
A Japanese tokenizer based on recurrent neural networks
25 versions - Latest release: over 1 year ago - 5 dependent packages - 28 dependent repositories - 170 thousand downloads last month - 400 stars on GitHub - 1 maintainer
bi-lstm-crf 0.2.1
A PyTorch implementation of the BI-LSTM-CRF model
5 versions - Latest release: almost 5 years ago - 1 dependent repositories - 610 downloads last month - 237 stars on GitHub - 1 maintainer
vistickedword 0.9.5
Library to split sticked Vietnamese words
7 versions - Latest release: about 5 years ago - 2 dependent repositories - 61 downloads last month - 2 stars on GitHub - 1 maintainer
symspellcpppy 0.0.18
A Fast SymSpell port for python written in C++ using pybind11.
18 versions - Latest release: over 2 years ago - 1 dependent repositories - 152 downloads last month - 43 stars on GitHub - 1 maintainer
codeprep 1.0.5
A toolkit for pre-processing large source code corpora
4 versions - Latest release: over 4 years ago - 1 dependent repositories - 40 downloads last month - 47 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
mecab-python-windows 0.996.3 💰
Python wrapper for CaboCha: Japanese Dependency Structure Analyzer
3 versions - Latest release: over 6 years ago - 19 dependent repositories - 192 stars on GitHub
Related Keywords
nlp 27 natural-language-processing 17 python 11 named-entity-recognition 10 tokenizer 9 ner 7 pos-tagging 7 NLP 7 nlp-library 6 chinese-word-segmentation 6 bert 6 text-processing 4 thai 4 thai-language 4 parser 4 computational-linguistics 4 vietnamese-nlp 4 sequence-labeling 4 morphological-analysis 4 corpus 4 machine 4 learning 4 text-classification 4 natural language processing 4 text-segmentation 4 spelling-correction 4 chinese-nlp 3 language-modeling 3 pre-processing 3 source-code-analysis 3 mining-software-repositories 3 crf 3 pytorch 3 code 3 source 3 data 3 dependency-parser 3 analysis 3 part-of-speech-tagger 3 chinese-text-segmentation 3 morphological 3 fuzzy-matching 3 fuzzy-search 3 spell-check 3 spellcheck 3 spelling 3 symspell 3 korean 3 korean-nlp 3 korean-tokenizer 3 python-library 3 large 3 big 3 computational linguistics 3 pretrained-models 3 thai-nlp 3 hacktoberfest 3 ckip 3 part-of-speech-tagging 3 sentence-segmentation 2 jieba-chinese 2 jieba 2 Korean 2 tensorflow 2 transformers 2 soundex 2 chinese 2 thai-nlp-library 2 pos-tagger 2 thai-soundex 2 sentence-parsing 2 word segmentation 2 linguistics 2 corpora 2 speech 2 language 2 sentencepiece 2 neural-machine-translation 2 pythainlp 2 levenshtein-distance 2 bpe 2 levenshtein 2 edit-distance 2 damerau-levenshtein 2 approximate-string-matching 2 tokenization 2 cws 2 encoder-decoder 2 multi-task-learning 2 multitask-learning 2 japanese 2 part-of-speech 2 transformer 2 nodejs 2 vncorenlp 2 rust 2 python-vncorenlp 2 stop-words 1 word 1 word-split 1