Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "tokenizing" keyword
Top 9.8% on pypi.org
15 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 1.4 thousand downloads last month - 485 stars on GitHub - 1 maintainer
tokenmonster 1.1.12
Tokenize and decode text with TokenMonster vocabularies.15 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 1.4 thousand downloads last month - 485 stars on GitHub - 1 maintainer
Top 0.2% on pypi.org
59 versions - Latest release: over 1 year ago - 1,440 dependent packages - 57,572 dependent repositories - 21.3 million downloads last month - 12,667 stars on GitHub - 4 maintainers
nltk 3.8.1
Natural Language Toolkit59 versions - Latest release: over 1 year ago - 1,440 dependent packages - 57,572 dependent repositories - 21.3 million downloads last month - 12,667 stars on GitHub - 4 maintainers
Top 1.0% on pypi.org
15 versions - Latest release: over 2 years ago - 19 dependent packages - 768 dependent repositories - 64.6 thousand downloads last month - 3 maintainers
konlpy 0.6.0
Python package for Korean natural language processing.15 versions - Latest release: over 2 years ago - 19 dependent packages - 768 dependent repositories - 64.6 thousand downloads last month - 3 maintainers
Top 4.4% on pypi.org
15 versions - Latest release: over 5 years ago - 2 dependent packages - 15 dependent repositories - 12.2 thousand downloads last month - 620 stars on GitHub - 1 maintainer
jieba-fast 0.53
Use C and Swig to Speed up jieba<Chinese Words Segementation Utilities>15 versions - Latest release: over 5 years ago - 2 dependent packages - 15 dependent repositories - 12.2 thousand downloads last month - 620 stars on GitHub - 1 maintainer
cwsharp 0.2
Chinese Words Segementations2 versions - Latest release: over 6 years ago - 1 dependent repositories - 14 downloads last month - 2 stars on GitHub - 1 maintainer
tinkle 0.0.1
Data Simplified2 versions - Latest release: about 6 years ago - 1 dependent repositories - 46 downloads last month - 1 maintainer
texterra 1.0.1
API for natural language processing.2 versions - Latest release: over 6 years ago - 2 dependent repositories - 24 downloads last month - 1 maintainer
test-konlp 0.0.1
Korean Natural Language Toolkit1 version - Latest release: about 6 years ago - 1 dependent repositories - 7 downloads last month - 1 maintainer
Top 4.1% on pypi.org
9 versions - Latest release: about 2 months ago - 1 dependent package - 103 dependent repositories - 4.87 thousand downloads last month - 1 maintainer
phrasetree 0.0.9
Phrase Tree from Natural Language Toolkit9 versions - Latest release: about 2 months ago - 1 dependent package - 103 dependent repositories - 4.87 thousand downloads last month - 1 maintainer
nltk-ma 0.0.6
NLTK Source6 versions - Latest release: over 2 years ago - 1 dependent repositories - 71 downloads last month - 0 stars on GitHub - 1 maintainer
nltk2-fixed 2.0.6
Natural Language Toolkit1 version - Latest release: over 5 years ago - 1 dependent repositories - 84 downloads last month - 1 maintainer
nlpiper 0.3.1
NLPiper, a lightweight package integrated with a universe of frameworks to pre-process documents.5 versions - Latest release: about 2 years ago - 2 dependent repositories - 36 downloads last month - 17 stars on GitHub - 3 maintainers
neu 1.5
中文自然语言处理工具包4 versions - Latest release: about 6 years ago - 2 dependent repositories - 26 downloads last month - 2 stars on GitHub - 1 maintainer
Top 6.4% on pypi.org
24 versions - Latest release: almost 6 years ago - 30 dependent repositories - 2.37 thousand downloads last month - 50 stars on GitHub - 2 maintainers
metapy 0.2.13
Python bindings for MeTA24 versions - Latest release: almost 6 years ago - 30 dependent repositories - 2.37 thousand downloads last month - 50 stars on GitHub - 2 maintainers
konlp 0.0.58
Korean Natural Language Toolkit48 versions - Latest release: about 1 year ago - 4 dependent repositories - 475 downloads last month - 1 maintainer
kkltk 1.0
kkltk is a toolkit designed for Kinyarwanda and Kirundi languages processing1 version - Latest release: over 3 years ago - 1 dependent repositories - 6 downloads last month - 1 stars on GitHub - 1 maintainer
jk-tokenizingparsing 0.2020.1.20
This python module provides basic classes for tokenizing and parsing.1 version - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 0 stars on GitHub - 1 maintainer
jk-php-tokenizer 0.2020.3.9
This python module is a tokenizer for configuration files written in PHP.1 version - Latest release: about 4 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 1 maintainer
jieba-hant 0.39.1
Traditional Chinese Words Segementation Utilities1 version - Latest release: about 5 years ago - 1 dependent repositories - 92 downloads last month - 0 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
36 versions - Latest release: 9 months ago - 1 dependent package - 7 dependent repositories - 772 downloads last month - 2,301 stars on GitHub - 1 maintainer
harvesttext 0.8.2
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法36 versions - Latest release: 9 months ago - 1 dependent package - 7 dependent repositories - 772 downloads last month - 2,301 stars on GitHub - 1 maintainer
Top 6.9% on pypi.org
6 versions - Latest release: over 5 years ago - 2 dependent packages - 13 dependent repositories - 411 downloads last month - 56 stars on GitHub - 1 maintainer
eunjeon 0.4.0
Python interface for eunjeon project & mecab based morphological analyzer.6 versions - Latest release: over 5 years ago - 2 dependent packages - 13 dependent repositories - 411 downloads last month - 56 stars on GitHub - 1 maintainer
ckip-segmenter 1.0.2
Ckip Segmenter3 versions - Latest release: almost 7 years ago - 1 dependent package - 1 dependent repositories - 21 downloads last month - 8 stars on GitHub - 1 maintainer
banglanltk 0.0.4
Bangla Natural Language Processing Toolkit2 versions - Latest release: almost 4 years ago - 337 downloads last month - 1 maintainer
Top 3.0% on pypi.org
6 versions - Latest release: almost 4 years ago - 11 dependent packages - 146 dependent repositories - 4.48 thousand downloads last month - 301 stars on GitHub - 1 maintainer
tweet-preprocessor 0.6.0
Elegant tweet preprocessing6 versions - Latest release: almost 4 years ago - 11 dependent packages - 146 dependent repositories - 4.48 thousand downloads last month - 301 stars on GitHub - 1 maintainer
tolkien 0.0.1
Token class for lexers and parsers.1 version - Latest release: over 4 years ago - 1 dependent repositories - 9 downloads last month - 5 stars on GitHub - 1 maintainer
koshort 0.4.1.7
koshort is a Python package for Korean internet spoken language crawling and processing... or may...12 versions - Latest release: almost 6 years ago - 1 dependent repositories - 74 downloads last month - 1 maintainer
wbjieba 0.42.1
Chinese Words Segmentation Utilities1 version - Latest release: over 2 years ago - 1 dependent repositories - 68 downloads last month - 32,268 stars on GitHub - 1 maintainer
zuel-test 0.1
Chinese Words Segmentation Utilities1 version - Latest release: over 2 years ago - 1 dependent repositories - 11 downloads last month - 32,268 stars on GitHub - 1 maintainer
clean-plot 0.0.13
clean_plot simplifies cleaning text files for creation of embeddings and making plots from it12 versions - Latest release: over 1 year ago - 1 dependent repositories - 64 downloads last month - 3 stars on GitHub - 1 maintainer
pyscws 0.0.1.1
SCWS,但是python31 version - Latest release: 7 months ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
Related Keywords
NLP
21
natural language processing
15
tagging
13
parsing
11
CL
10
computational linguistics
10
text analytics
9
natural language
9
linguistics
9
Chinese word segementation
8
language
7
syntax
7
nlp
5
python
3
Korean
3
CJK
3
tokenizer
3
konltk
2
koNLP
2
NLPK
2
KoNLTK
2
text-processing
2
sentiment analysis
2
cleaning
2
nlpk
2
konlp
2
korean natural langugae processing
2
morpheme anaylisis
2
chunk
2
text anayltics
2
tokenize
2
tokenization
2
preprocessing
2
unsupervised
1
text-summarization
1
korean
1
text-segmentation
1
text-cleaning
1
sentiment-analysis
1
pyhanlp
1
new-word-discovery
1
named-entity-recognition
1
keyword-extraction
1
harvesttext
1
gitee
1
dependency-parser
1
text cleaning
1
entity linking
1
php
1
text-preprocessing
1
scientific-machine-learning
1
plotting
1
embeddings
1
unicode
1
token
1
regular expression
1
regex
1
lexing
1
lexer
1
lex
1
grammar
1
generator
1
dimensionality reduction
1
tweet
1
processing
1
machine learning
1
synonym
1
stemming
1
bangla
1
part-of-speech tagging
1
morphology
1
key concepts detection
1
disambiguation
1
spelling correction
1
named entity recognition
1
lemmatization
1
text processing
1
mmseg
1
viterbi-hmm
1
swig
1
jieba
1
dag
1
nltk
1
natural-language-processing
1
machine-learning
1
vocabulary-generator
1
vocabulary-builder
1
vocabulary
1
tokenisation
1
text-tokenization
1
low-resource-languages-toolkit
1
low-resource-languages
1
stopwords
1
text preprocessing
1
Kirundi
1
Kinyarwanda
1
Low-resource languages
1
text analysis
1
text mining
1
lingustics
1