Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "tokenizing" keyword

Top 9.8% on pypi.org
tokenmonster 1.1.12
Tokenize and decode text with TokenMonster vocabularies.
15 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 1.4 thousand downloads last month - 485 stars on GitHub - 1 maintainer
Top 0.2% on pypi.org
nltk 3.8.1
Natural Language Toolkit
59 versions - Latest release: over 1 year ago - 1,440 dependent packages - 57,572 dependent repositories - 21.3 million downloads last month - 12,667 stars on GitHub - 4 maintainers
Top 1.0% on pypi.org
konlpy 0.6.0
Python package for Korean natural language processing.
15 versions - Latest release: over 2 years ago - 19 dependent packages - 768 dependent repositories - 64.6 thousand downloads last month - 3 maintainers
Top 4.4% on pypi.org
jieba-fast 0.53
Use C and Swig to Speed up jieba<Chinese Words Segementation Utilities>
15 versions - Latest release: over 5 years ago - 2 dependent packages - 15 dependent repositories - 12.2 thousand downloads last month - 620 stars on GitHub - 1 maintainer
cwsharp 0.2
Chinese Words Segementations
2 versions - Latest release: over 6 years ago - 1 dependent repositories - 14 downloads last month - 2 stars on GitHub - 1 maintainer
tinkle 0.0.1
Data Simplified
2 versions - Latest release: about 6 years ago - 1 dependent repositories - 46 downloads last month - 1 maintainer
texterra 1.0.1
API for natural language processing.
2 versions - Latest release: over 6 years ago - 2 dependent repositories - 24 downloads last month - 1 maintainer
test-konlp 0.0.1
Korean Natural Language Toolkit
1 version - Latest release: about 6 years ago - 1 dependent repositories - 7 downloads last month - 1 maintainer
Top 4.1% on pypi.org
phrasetree 0.0.9
Phrase Tree from Natural Language Toolkit
9 versions - Latest release: about 2 months ago - 1 dependent package - 103 dependent repositories - 4.87 thousand downloads last month - 1 maintainer
nltk-ma 0.0.6
NLTK Source
6 versions - Latest release: over 2 years ago - 1 dependent repositories - 71 downloads last month - 0 stars on GitHub - 1 maintainer
nltk2-fixed 2.0.6
Natural Language Toolkit
1 version - Latest release: over 5 years ago - 1 dependent repositories - 84 downloads last month - 1 maintainer
nlpiper 0.3.1
NLPiper, a lightweight package integrated with a universe of frameworks to pre-process documents.
5 versions - Latest release: about 2 years ago - 2 dependent repositories - 36 downloads last month - 17 stars on GitHub - 3 maintainers
neu 1.5
中文自然语言处理工具包
4 versions - Latest release: about 6 years ago - 2 dependent repositories - 26 downloads last month - 2 stars on GitHub - 1 maintainer
Top 6.4% on pypi.org
metapy 0.2.13
Python bindings for MeTA
24 versions - Latest release: almost 6 years ago - 30 dependent repositories - 2.37 thousand downloads last month - 50 stars on GitHub - 2 maintainers
konlp 0.0.58
Korean Natural Language Toolkit
48 versions - Latest release: about 1 year ago - 4 dependent repositories - 475 downloads last month - 1 maintainer
kkltk 1.0
kkltk is a toolkit designed for Kinyarwanda and Kirundi languages processing
1 version - Latest release: over 3 years ago - 1 dependent repositories - 6 downloads last month - 1 stars on GitHub - 1 maintainer
jk-tokenizingparsing 0.2020.1.20
This python module provides basic classes for tokenizing and parsing.
1 version - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 0 stars on GitHub - 1 maintainer
jk-php-tokenizer 0.2020.3.9
This python module is a tokenizer for configuration files written in PHP.
1 version - Latest release: about 4 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 1 maintainer
jieba-hant 0.39.1
Traditional Chinese Words Segementation Utilities
1 version - Latest release: about 5 years ago - 1 dependent repositories - 92 downloads last month - 0 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
harvesttext 0.8.2
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
36 versions - Latest release: 9 months ago - 1 dependent package - 7 dependent repositories - 772 downloads last month - 2,301 stars on GitHub - 1 maintainer
Top 6.9% on pypi.org
eunjeon 0.4.0
Python interface for eunjeon project & mecab based morphological analyzer.
6 versions - Latest release: over 5 years ago - 2 dependent packages - 13 dependent repositories - 411 downloads last month - 56 stars on GitHub - 1 maintainer
ckip-segmenter 1.0.2
Ckip Segmenter
3 versions - Latest release: almost 7 years ago - 1 dependent package - 1 dependent repositories - 21 downloads last month - 8 stars on GitHub - 1 maintainer
banglanltk 0.0.4
Bangla Natural Language Processing Toolkit
2 versions - Latest release: almost 4 years ago - 337 downloads last month - 1 maintainer
Top 3.0% on pypi.org
tweet-preprocessor 0.6.0
Elegant tweet preprocessing
6 versions - Latest release: almost 4 years ago - 11 dependent packages - 146 dependent repositories - 4.48 thousand downloads last month - 301 stars on GitHub - 1 maintainer
tolkien 0.0.1
Token class for lexers and parsers.
1 version - Latest release: over 4 years ago - 1 dependent repositories - 9 downloads last month - 5 stars on GitHub - 1 maintainer
koshort 0.4.1.7
koshort is a Python package for Korean internet spoken language crawling and processing... or may...
12 versions - Latest release: almost 6 years ago - 1 dependent repositories - 74 downloads last month - 1 maintainer
wbjieba 0.42.1
Chinese Words Segmentation Utilities
1 version - Latest release: over 2 years ago - 1 dependent repositories - 68 downloads last month - 32,268 stars on GitHub - 1 maintainer
zuel-test 0.1
Chinese Words Segmentation Utilities
1 version - Latest release: over 2 years ago - 1 dependent repositories - 11 downloads last month - 32,268 stars on GitHub - 1 maintainer
clean-plot 0.0.13
clean_plot simplifies cleaning text files for creation of embeddings and making plots from it
12 versions - Latest release: over 1 year ago - 1 dependent repositories - 64 downloads last month - 3 stars on GitHub - 1 maintainer
pyscws 0.0.1.1
SCWS,但是python3
1 version - Latest release: 7 months ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
Related Keywords
NLP 21 natural language processing 15 tagging 13 parsing 11 CL 10 computational linguistics 10 text analytics 9 natural language 9 linguistics 9 Chinese word segementation 8 language 7 syntax 7 nlp 5 python 3 Korean 3 CJK 3 tokenizer 3 konltk 2 koNLP 2 NLPK 2 KoNLTK 2 text-processing 2 sentiment analysis 2 cleaning 2 nlpk 2 konlp 2 korean natural langugae processing 2 morpheme anaylisis 2 chunk 2 text anayltics 2 tokenize 2 tokenization 2 preprocessing 2 unsupervised 1 text-summarization 1 korean 1 text-segmentation 1 text-cleaning 1 sentiment-analysis 1 pyhanlp 1 new-word-discovery 1 named-entity-recognition 1 keyword-extraction 1 harvesttext 1 gitee 1 dependency-parser 1 text cleaning 1 entity linking 1 php 1 text-preprocessing 1 scientific-machine-learning 1 plotting 1 embeddings 1 unicode 1 token 1 regular expression 1 regex 1 lexing 1 lexer 1 lex 1 grammar 1 generator 1 dimensionality reduction 1 tweet 1 processing 1 machine learning 1 synonym 1 stemming 1 bangla 1 part-of-speech tagging 1 morphology 1 key concepts detection 1 disambiguation 1 spelling correction 1 named entity recognition 1 lemmatization 1 text processing 1 mmseg 1 viterbi-hmm 1 swig 1 jieba 1 dag 1 nltk 1 natural-language-processing 1 machine-learning 1 vocabulary-generator 1 vocabulary-builder 1 vocabulary 1 tokenisation 1 text-tokenization 1 low-resource-languages-toolkit 1 low-resource-languages 1 stopwords 1 text preprocessing 1 Kirundi 1 Kinyarwanda 1 Low-resource languages 1 text analysis 1 text mining 1 lingustics 1