Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "text processing" keyword

Top 9.4% on pypi.org
wordcloud-fa 0.1.10 💰
A wrapper for wordcloud module for creating persian (and other rtl languages) word cloud.
10 versions - Latest release: over 1 year ago - 6 dependent repositories - 464 downloads last month - 139 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
thai-nner 0.3
Thai Nested Named Entity Recognition
3 versions - Latest release: about 2 years ago - 1 dependent package - 3 dependent repositories - 313 downloads last month - 36 stars on GitHub - 2 maintainers
Top 7.9% on pypi.org
nemo-text-processing 1.0.2
NeMo text processing for ASR and TTS
13 versions - Latest release: about 1 month ago - 2 dependent packages - 1 dependent repositories - 50.2 thousand downloads last month - 230 stars on GitHub - 1 maintainer
thai2transformers 0.1.2
Pretraining transformer based Thai language models
8 versions - Latest release: about 3 years ago - 1 dependent repositories - 261 downloads last month - 114 stars on GitHub - 1 maintainer
freqframe 1.0.0
1 version - Latest release: about 3 years ago - 1 dependent repositories - 5 downloads last month - 1 maintainer
Top 6.1% on pypi.org
wordaxe 1.0.1
Provide hyphenation for python programs and ReportLab paragraphs.
5 versions - Latest release: 10 months ago - 1 dependent package - 4 dependent repositories - 1 maintainer
hanpud 0.1.dev0
Han Pud (ā¸Ģāšˆā¸˛ā¸™ ā¸žā¸šā¸”): Thai super large generative model
1 version - Latest release: about 1 year ago - 6 downloads last month - 1 stars on GitHub - 1 maintainer
cleantextkit 0.1.1
A preprocessor which performs operations of lowering text, removing special characters and removi...
2 versions - Latest release: 10 months ago - 25 downloads last month - 1 maintainer
rebnf 0.9
ReBNF: Regexes for Extended Backus-Naur Form (EBNF)
7 versions - Latest release: 12 months ago - 62 downloads last month - 1 maintainer
texterra 1.0.1
API for natural language processing.
2 versions - Latest release: over 6 years ago - 2 dependent repositories - 24 downloads last month - 1 maintainer
breame 0.1.2
Breame is a lightweight Python package with a number of tools to aid in the detection of words th...
3 versions - Latest release: over 2 years ago - 1 dependent repositories - 153 downloads last month - 11 stars on GitHub - 1 maintainer
Top 3.0% on pypi.org
addheader 0.3.2
A command to manage a header section for a source code tree
9 versions - Latest release: over 1 year ago - 3 dependent packages - 19 dependent repositories - 17.8 thousand downloads last month - 0 stars on GitHub - 1 maintainer
printb 1.0.2 💰
printb is a wrapper for print/input built-ins, that swaps string directions for BIDI languages.
1 version - Latest release: over 2 years ago - 1 dependent repositories - 19 downloads last month - 0 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
textacy 0.13.0
NLP, before and after spaCy
32 versions - Latest release: about 1 year ago - 18 dependent packages - 436 dependent repositories - 49.7 thousand downloads last month - 2,180 stars on GitHub - 1 maintainer
pewanalytics 1.1.1
Utilities for text processing and statistical analysis from Pew Research Center
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 22 downloads last month - 78 stars on GitHub - 1 maintainer
thaibraille 0.1.dev2
Thai Braille for Natural Language Processing.
3 versions - Latest release: about 1 year ago - 19 downloads last month - 3 stars on GitHub - 1 maintainer
tainlp 0.0.1.dev0
Tai Natural Language Processing library
1 version - Latest release: about 1 year ago - 5 downloads last month - 1 maintainer
spacy-pythainlp 0.1
PyThaiNLP For spaCy
9 versions - Latest release: over 1 year ago - 1 dependent repositories - 199 downloads last month - 12 stars on GitHub - 1 maintainer
hassans-frame 0.0.0
1 version - Latest release: about 3 years ago - 1 dependent repositories - 4 downloads last month - 1 maintainer
ttg 0.1.dev3
Thai Text Generator library
3 versions - Latest release: about 4 years ago - 1 dependent repositories - 199 downloads last month - 4 stars on GitHub - 1 maintainer
Top 3.0% on pypi.org
quantulum3 0.9.1 💰
Extract quantities from unstructured text.
40 versions - Latest release: about 2 months ago - 8 dependent packages - 44 dependent repositories - 122 thousand downloads last month - 130 stars on GitHub - 1 maintainer
gatenlp-ml-tner 0.1.0a1
Train and use transformer token classification models using tner
1 version - Latest release: almost 2 years ago - 9 downloads last month - 0 stars on GitHub - 1 maintainer
freq-frame 1.0.0
1 version - Latest release: about 3 years ago - 1 dependent repositories - 4 downloads last month - 1 maintainer
Top 1.9% on pypi.org
pythainlp 5.0.3
Thai Natural Language Processing library
108 versions - Latest release: 29 days ago - 37 dependent packages - 183 dependent repositories - 224 thousand downloads last month - 926 stars on GitHub - 2 maintainers
khamyo 0.2.0 💰
Thai abbreviation to full text library
3 versions - Latest release: 12 months ago - 1 dependent package - 1 dependent repositories - 318 downloads last month - 4 stars on GitHub - 1 maintainer
gatenlp 1.0.8
GATE NLP implementation in Python.
29 versions - Latest release: over 1 year ago - 2 dependent repositories - 337 downloads last month - 57 stars on GitHub - 3 maintainers
intertext 0.0.1
tools for relational discourse analysis
1 version - Latest release: over 6 years ago - 1 dependent repositories - 11 downloads last month - 1 maintainer
textdatasetcleaner 0.0.6
Pipeline for cleaning (preprocessing/normalizing) text datasets
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 31 downloads last month - 38 stars on GitHub - 1 maintainer
pug 0.1.22
Meta package to install the PDX Python User Group utilities.
11 versions - Latest release: about 9 years ago - 8 dependent repositories - 114 downloads last month - 12 stars on GitHub - 1 maintainer
ingredient-slicer 1.0.1
Parses unstructured recipe ingredient text into standardized quantities, units, and foods
19 versions - Latest release: about 1 month ago - 420 downloads last month - 1 maintainer
rdatools 0.1.7
tools for relational discourse analysis
2 versions - Latest release: over 6 years ago - 1 dependent repositories - 9 downloads last month - 1 maintainer
deepcoreml 0.3.4
A collection of Machine Learning techniques for data management, engineering and augmentation.
9 versions - Latest release: about 1 month ago - 104 downloads last month - 1 stars on GitHub - 1 maintainer
easynertag 0.2 💰
Easy tagging for annotate NER corpus
2 versions - Latest release: over 1 year ago - 16 downloads last month - 2 stars on GitHub - 1 maintainer
nlup 0.8
('Core libraries for natural language processing',)
4 versions - Latest release: over 5 years ago - 11 dependent repositories - 1.16 thousand downloads last month - 10 stars on GitHub - 3 maintainers
quantulum 0.1.16
Extract quantities from unstructured text.
17 versions - Latest release: 10 months ago - 1 dependent package - 4 dependent repositories - 73 downloads last month - 119 stars on GitHub - 1 maintainer
pythaitts 0.3.0
Open Source Thai Text-to-speech library in Python
5 versions - Latest release: 5 months ago - 1 dependent repositories - 181 downloads last month - 27 stars on GitHub - 1 maintainer
lekcut 0.1
LEKCut (āš€ā¸Ĩāš‡ā¸ ā¸„ā¸ąā¸”) is a Thai tokenization library that ports the deep learning model to the onnx m...
1 version - Latest release: over 1 year ago - 34 downloads last month - 7 stars on GitHub - 1 maintainer
pythaisa 0.2.1
Python Thai Sentiment Analysis
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 13 downloads last month - 13 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
laonlp 1.1.3 💰
Lao Natural Language Processing library
18 versions - Latest release: 8 months ago - 1 dependent package - 4 dependent repositories - 574 downloads last month - 26 stars on GitHub - 1 maintainer
thaitextaug 0.0.4 💰
Thai Text Augmentation
15 versions - Latest release: about 3 years ago - 1 dependent repositories - 101 downloads last month - 5 stars on GitHub - 1 maintainer
ai-data-preprocessing-queue 1.4.1
Can be used to pre process data before ai processing
3 versions - Latest release: 7 months ago - 177 downloads last month - 1 maintainer
pylda2vec 1.0.0
Mixing Dirichlet Topic Models and Word Embeddings to Make lda2vec
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 31 downloads last month - 29 stars on GitHub - 1 maintainer
multiel 0.5
Multilingual Entity Linking model by BELA model
5 versions - Latest release: 12 months ago - 1 dependent package - 119 downloads last month - 8 stars on GitHub - 1 maintainer
fkscore 2.0.1
Flesch Kincaid readability scoring algorithm
7 versions - Latest release: 5 months ago - 1 dependent repositories - 270 downloads last month - 2 stars on GitHub - 1 maintainer
fixthaipdf 0.2.1 💰
Fix Thai PDF Text
3 versions - Latest release: 7 months ago - 380 downloads last month - 20 stars on GitHub - 1 maintainer
huspacy-nightly 0.11.0.dev261 💰
HuSpaCy: industrial strength Hungarian natural language processing
126 versions - Latest release: 5 months ago - 1 dependent repositories - 303 downloads last month - 142 stars on GitHub - 1 maintainer
auto-mapper 0.1.2
An auto mapper that accepts a list of string and a list of objects of the format {'code', 'name'}...
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 119 downloads last month - 1 maintainer
Top 6.6% on pypi.org
huspacy 0.11.0 💰
HuSpaCy: industrial strength Hungarian natural language processing
21 versions - Latest release: 8 months ago - 1 dependent package - 6 dependent repositories - 933 downloads last month - 142 stars on GitHub - 1 maintainer
hadal 0.0.3
Tool for mining/alignment parallel texts
3 versions - Latest release: 7 months ago - 25 downloads last month - 2 stars on GitHub - 1 maintainer
sotastream 1.0.1
Sotastream is a command line tool that augments a batch of text and produces infinite stream of r...
2 versions - Latest release: 10 months ago - 110 downloads last month - 20 stars on GitHub - 2 maintainers
wakong 1.1.1 💰
Wakong: An appropriate and robust masking algorithm for generating the training objective of text...
3 versions - Latest release: over 1 year ago - 1 dependent repositories - 38 downloads last month - 3 stars on GitHub - 1 maintainer
jange 0.1.7
Easy NLP library for Python
8 versions - Latest release: over 2 years ago - 1 dependent repositories - 76 downloads last month - 17 stars on GitHub - 1 maintainer
linesieve 1.0
An unholy blend of grep, sed, awk, and Python.
13 versions - Latest release: about 1 year ago - 1 dependent repositories - 102 downloads last month - 7 stars on GitHub - 1 maintainer
delb 0.3.0
A library that provides an ergonomic model for XML encoded text documents (e.g. with TEI-XML).
25 versions - Latest release: over 2 years ago - 2 dependent packages - 4 dependent repositories - 586 downloads last month - 15 stars on GitHub - 1 maintainer
zalgolib 0.2.2
A Python library for a _FULL_ Zalgo experience
4 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 3.51 thousand downloads last month - 3 stars on GitHub - 1 maintainer
docdump 1.0.4
A package to extract text from common document types.
5 versions - Latest release: over 3 years ago - 1 dependent repositories - 24 downloads last month - 0 stars on GitHub - 1 maintainer
lttl 2.0.12
LangTech Text Library (LTTL) for text processing and analysis
23 versions - Latest release: over 3 years ago - 1 dependent repositories - 1.45 thousand downloads last month - 3 stars on GitHub - 1 maintainer
qante 0.0.5
qante - Query ANnotated TExt
5 versions - Latest release: 9 months ago - 28 downloads last month - 5 stars on GitHub - 1 maintainer
punjabi-stemmer 1.0.1
A Python library for stemming Punjabi language words, including preprocessing for noise removal.
2 versions - Latest release: 3 months ago - 22 downloads last month - 1 stars on GitHub - 1 maintainer
thaixtransformers 0.1.0
ThaiXtransformers: Use Pretraining RoBERTa based Thai language models from VISTEC-depa AI Researc...
1 version - Latest release: 11 months ago - 73 downloads last month - 7 stars on GitHub - 1 maintainer
processtext 0.1.7
An open-source python package to process text data
10 versions - Latest release: 4 months ago - 50 downloads last month - 4 stars on GitHub - 1 maintainer
tex-untag 1.3.0
A script for removing all of a given markup tag from a set of TeX files.
6 versions - Latest release: over 2 years ago - 1 dependent repositories - 57 downloads last month - 1 stars on GitHub - 1 maintainer
pypage 2.0.9
Light-weight Python Templating Engine
7 versions - Latest release: 9 months ago - 1 dependent package - 1 dependent repositories - 58 downloads last month - 31 stars on GitHub - 1 maintainer
arabicscript 0.1.4
Tools for Arabic script
4 versions - Latest release: almost 8 years ago - 1 dependent repositories - 25 downloads last month - 8 stars on GitHub - 1 maintainer
ck-textprocessor 0.0.1 removed
A preprocessor which performs operations of lowering text, removing special characters and removi...
1 version - Latest release: 10 months ago - 1 maintainer
Related Keywords
natural language processing 29 nlp 24 text analytics 20 localization 18 NLP 18 computational linguistics 17 python 14 Thai language 11 natural-language-processing 9 ThaiNLP 8 Thai NLP 7 nlp-library 7 text 7 machine-learning 7 text mining 6 hacktoberfest 5 text-processing 5 thai-language 5 thai-nlp 5 thai 5 text-mining 5 data science 5 python3 4 thai-nlp-library 4 spacy 4 parsing 3 tagging 3 data analysis 3 lemmatization 3 named entity recognition 3 deep learning 3 machine learning 3 regex 3 information-extraction 3 linguistics 3 language 3 statistics 3 Thai 3 information extraction 2 quantities 2 universal-dependencies 2 pythainlp 2 discourse analysis 2 units-of-measure 2 network analysis 2 math 2 science 2 citation analysis 2 textmining 2 zotero 2 bibliometrics 2 neural net 2 search 2 scientometrics 2 text analysis 2 machine translation 2 measurements 2 units 2 spacy-pipeline 2 word embeddings 2 ner 2 thainlp 2 pos tagging 2 sentence splitting 2 sbd 2 sentence boundary detection 2 tts 2 tokenization 2 language processing 2 Hungarian 2 huspacy 2 deep-learning 2 gatenlp 2 python-gatenlp 2 topic-modeling 2 spacy-models 2 pos-tagger 2 parser 2 named-entity-recognition 2 morphological-analysis 2 syntax 2 hunlp 2 hungarian 2 dependency-parsing 2 text cleaner 2 text preprocessing 2 ai 2 artificial intelligence 2 spacy model 2 word vectors 2 parallel-corpora 1 parallel-corpus 1 parallel-sentence-mining 1 sentence-alignment 1 data augmentation 1 machine-translation 1 alignment 1 text alignment 1 parallel corpora 1 text similarity 1