Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "word-segmentation" keyword

Top 9.0% on pypi.org
adaseq 0.6.6
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
11 versions - Latest release: 7 months ago - 1 dependent repositories - 334 downloads last month - 367 stars on GitHub - 1 maintainer
Top 0.5% on pypi.org
sentencepiece 0.2.0
SentencePiece python wrapper
32 versions - Latest release: 4 months ago - 802 dependent packages - 18,074 dependent repositories - 20.8 million downloads last month - 9,462 stars on GitHub - 1 maintainer
hellonlp 0.2.41
NLP tools
31 versions - Latest release: over 2 years ago - 1 dependent repositories - 59 downloads last month - 22 stars on GitHub - 1 maintainer
cjieba 0.4.4 💰
Python cffi binding for cjieba
11 versions - Latest release: over 3 years ago - 1 dependent package - 1 dependent repositories - 204 downloads last month - 15 stars on GitHub - 1 maintainer
toiro 0.0.9
A comparison tool of Japanese tokenizers
8 versions - Latest release: 11 months ago - 1 dependent repositories - 103 downloads last month - 111 stars on GitHub - 1 maintainer
trtokenizer 0.0.3
Sentence and word tokenizers for the Turkish language
3 versions - Latest release: almost 3 years ago - 2 dependent repositories - 222 downloads last month - 20 stars on GitHub - 1 maintainer
pytorch-nlu 0.0.2
Pytorch-NLU
2 versions - Latest release: over 2 years ago - 1 dependent repositories - 25 downloads last month - 291 stars on GitHub - 1 maintainer
simjb 0.2.0
A simple version of jieba.
3 versions - Latest release: almost 2 years ago - 1 dependent repositories - 19 downloads last month - 3 stars on GitHub - 1 maintainer
bi-lstm-crf 0.2.1
A PyTorch implementation of the BI-LSTM-CRF model
5 versions - Latest release: over 3 years ago - 1 dependent repositories - 33 downloads last month - 224 stars on GitHub - 1 maintainer
iparser 0.1.8
Integrated and Industrial Strength Dependency Parser
1 version - Latest release: about 6 years ago - 1 dependent repositories - 14 downloads last month - 10 stars on GitHub - 1 maintainer
wordseg 0.0.5 💰
Word segmentation models
5 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 1.54 thousand downloads last month - 3 stars on GitHub - 1 maintainer
m3tl 0.7.0
BERT for Multi-task Learning
1 version - Latest release: over 2 years ago - 10 downloads last month - 544 stars on GitHub - 1 maintainer
bert-multitask-learning 0.7.0
BERT for Multi-task Learning
64 versions - Latest release: over 3 years ago - 320 downloads last month - 544 stars on GitHub - 1 maintainer
rakutenma 0.3.3 💰
morphological analyzer (word segmentor + PoS Tagger) for Chinese and Japanese
5 versions - Latest release: about 7 years ago - 4 dependent repositories - 26 downloads last month - 20 stars on GitHub - 1 maintainer
Top 5.1% on pypi.org
vncorenlp 1.0.3
A Python wrapper for VnCoreNLP using a bidirectional communication channel.
2 versions - Latest release: almost 6 years ago - 4 dependent packages - 31 dependent repositories - 3.22 thousand downloads last month - 55 stars on GitHub - 1 maintainer
giganticode-dataprep 1.0.0a12
A toolkit for pre-processing large source code corpora
6 versions - Latest release: over 4 years ago - 1 dependent repositories - 19 downloads last month - 45 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
pythainlp 5.0.3
Thai Natural Language Processing library
108 versions - Latest release: about 1 month ago - 37 dependent packages - 183 dependent repositories - 224 thousand downloads last month - 926 stars on GitHub - 2 maintainers
thainlp 0.4.2
Thai NLP library
3 versions - Latest release: about 5 years ago - 1 dependent repositories - 28 downloads last month - 926 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
nagisa 0.2.11
A Japanese tokenizer based on recurrent neural networks
25 versions - Latest release: 4 months ago - 5 dependent packages - 28 dependent repositories - 261 thousand downloads last month - 371 stars on GitHub - 1 maintainer
Top 4.4% on pypi.org
ckip-transformers 0.3.4
CKIP Transformers
16 versions - Latest release: about 1 year ago - 2 dependent packages - 14 dependent repositories - 3.77 thousand downloads last month - 649 stars on GitHub - 1 maintainer
ckipnlp 1.0.3
CKIP CoreNLP
25 versions - Latest release: about 1 year ago - 1 dependent repositories - 186 downloads last month - 113 stars on GitHub - 1 maintainer
fidel 0.0.10 💰
Python package that can change Amharic language that written in English alphabet to Amharic alpha...
10 versions - Latest release: 11 months ago - 1 dependent repositories - 49 downloads last month - 772 stars on GitHub - 1 maintainer
giganticode-codeprep 1.0.0
A toolkit for pre-processing large source code corpora
1 version - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 45 stars on GitHub - 1 maintainer
Top 2.4% on pypi.org
symspellpy 6.7.7 💰
Python SymSpell
22 versions - Latest release: over 1 year ago - 13 dependent packages - 118 dependent repositories - 641 thousand downloads last month - 766 stars on GitHub - 1 maintainer
codeprep 1.0.5
A toolkit for pre-processing large source code corpora
4 versions - Latest release: about 3 years ago - 1 dependent repositories - 52 downloads last month - 45 stars on GitHub - 1 maintainer
vistickedword 0.9.5
Library to split sticked Vietnamese words
7 versions - Latest release: almost 4 years ago - 2 dependent repositories - 59 downloads last month - 2 stars on GitHub - 1 maintainer
vgram 0.4.2
V-gram builder library
15 versions - Latest release: almost 5 years ago - 1 dependent repositories - 54 downloads last month - 7 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
pycantonese 3.4.0 💰
Cantonese Linguistics and NLP in Python
24 versions - Latest release: over 2 years ago - 1 dependent package - 6 dependent repositories - 1.5 thousand downloads last month - 335 stars on GitHub - 1 maintainer
Top 2.4% on pypi.org
tf-sentencepiece 0.1.92
SentencePiece Encode/Decode ops for TensorFlow
15 versions - Latest release: about 4 years ago - 1 dependent package - 30 dependent repositories - 5.79 thousand downloads last month - 9,462 stars on GitHub - 1 maintainer
nokcut 0.4 💰
Thai Word Segmentation using TCC + Bidirectional RNNs
1 version - Latest release: over 5 years ago - 1 dependent repositories - 13 downloads last month - 8 stars on GitHub - 1 maintainer
ckip-classic 1.2.3
CKIP Classic NLP Tools
12 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 61 downloads last month - 8 stars on GitHub - 1 maintainer
python-vncorenlp 0.1.8
python_vncorenlp
9 versions - Latest release: almost 4 years ago - 1 dependent repositories - 67 downloads last month - 2 stars on GitHub - 1 maintainer
python-rdrsegmenter 0.1.1
python_rdrsegmenter
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 96 downloads last month - 1 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
monpa 0.3.3
MONPA is an end-to-end model to jointly conduct Chinese word segmentation, POS and NE labeling
10 versions - Latest release: almost 2 years ago - 2 dependent repositories - 1.78 thousand downloads last month - 244 stars on GitHub - 2 maintainers
Top 4.5% on pypi.org
ekphrasis 0.5.4
Text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekph...
54 versions - Latest release: about 2 years ago - 48 dependent repositories - 1.71 thousand downloads last month - 656 stars on GitHub - 1 maintainer
hanlperceptron 0.2.0
Native Python HanLP Perceptron Model: HanLPerceptron
3 versions - Latest release: over 2 years ago - 1 dependent repositories - 51 downloads last month - 7 stars on GitHub - 1 maintainer
Top 2.2% on pypi.org
youtokentome 1.0.6
Unsupervised text tokenizer focused on computational efficiency
8 versions - Latest release: over 4 years ago - 8 dependent packages - 228 dependent repositories - 43.4 thousand downloads last month - 941 stars on GitHub - 3 maintainers
Top 5.0% on pypi.org
kiwipiepy 0.17.1 💰
Kiwi, the Korean Tokenizer for Python
41 versions - Latest release: about 2 months ago - 6 dependent packages - 10 dependent repositories - 27.8 thousand downloads last month - 189 stars on GitHub - 1 maintainer
politely 4.1.0 💰
An explainable styler for the Korean language
22 versions - Latest release: 2 months ago - 200 downloads last month - 223 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
mecab-python-windows 0.996.3 removed 💰
Python wrapper for CaboCha: Japanese Dependency Structure Analyzer
3 versions - Latest release: over 5 years ago - 19 dependent repositories - 192 stars on GitHub
Top 4.8% on pypi.org
mecab 0.996.3 💰
MeCab binding for many OSs (Windows, macOS, and Linux)
3 versions - Latest release: about 3 years ago - 3 dependent packages - 27 dependent repositories - 3.87 thousand downloads last month - 228 stars on GitHub - 1 maintainer
Top 3.1% on pypi.org
lac 2.1.2
A chinese lexical analysis tool by Baidu NLP.
15 versions - Latest release: about 3 years ago - 3 dependent packages - 21 dependent repositories - 1.45 thousand downloads last month - 3,765 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
kiwipiepy-model 0.17.0 💰
Model for kiwipiepy
9 versions - Latest release: 3 months ago - 1 dependent package - 3 dependent repositories - 18.5 thousand downloads last month - 189 stars on GitHub - 1 maintainer
symspellcpppy 0.0.18
A Fast SymSpell port for python written in C++ using pybind11.
18 versions - Latest release: about 1 year ago - 1 dependent repositories - 133 downloads last month - 38 stars on GitHub - 1 maintainer
vietokenizer 1.0.3
Vietnamese Tokenizer package based on deep learning method
3 versions - Latest release: over 1 year ago - 49 downloads last month - 2 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
nlpo3 1.3.0
Python binding for nlpO3 Thai language processing library in Rust
9 versions - Latest release: about 1 year ago - 1 dependent package - 3 dependent repositories - 1.08 thousand downloads last month - 30 stars on GitHub - 2 maintainers
Related Keywords
nlp 25 natural-language-processing 17 python 11 named-entity-recognition 10 pos-tagging 8 nlp-library 7 NLP 7 tokenizer 7 ner 7 chinese-word-segmentation 6 bert 6 morphological-analysis 5 thai-language 4 sequence-labeling 4 spelling-correction 4 corpus 4 machine 4 text-classification 4 vietnamese-nlp 4 learning 4 natural language processing 4 text-segmentation 4 parser 4 morphological 3 source-code-analysis 3 hacktoberfest 3 pretrained-models 3 chinese-text-segmentation 3 fuzzy-search 3 mining-software-repositories 3 language-modeling 3 pre-processing 3 dependency-parser 3 part-of-speech-tagger 3 computational linguistics 3 code 3 source 3 data 3 large 3 big 3 fuzzy-matching 3 spell-check 3 pytorch 3 spellcheck 3 part-of-speech-tagging 3 ckip 3 spelling 3 symspell 3 thai-nlp 3 python-library 3 chinese-nlp 3 korean-tokenizer 3 korean-nlp 3 korean 3 analysis 3 thai 3 vncorenlp 2 python-vncorenlp 2 pos-tagger 2 chinese 2 levenshtein-distance 2 levenshtein 2 edit-distance 2 damerau-levenshtein 2 approximate-string-matching 2 sentence-parsing 2 thai-soundex 2 thai-nlp-library 2 pythainlp 2 tensorflow 2 hacktoberfest-accepted 2 soundex 2 transformers 2 sentence-segmentation 2 Korean 2 neural-machine-translation 2 mecab 2 japanese 2 tokenization 2 text-processing 2 crf 2 word segmentation 2 linguistics 2 corpora 2 jieba 2 jieba-chinese 2 transformer 2 part-of-speech 2 multitask-learning 2 multi-task-learning 2 encoder-decoder 2 cws 2 computational-linguistics 2 language 2 speech 2 vgram 1 sequential-data 1 feature-extraction 1 Chinese 1 Cantonese 1