Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "text mining" keyword

Top 6.6% on pypi.org
huspacy 0.11.0 💰
HuSpaCy: industrial strength Hungarian natural language processing
21 versions - Latest release: 7 months ago - 1 dependent package - 6 dependent repositories - 933 downloads last month - 142 stars on GitHub - 1 maintainer
Top 3.0% on pypi.org
texthero 1.1.0
Text preprocessing, representation and visualization from zero to hero.
10 versions - Latest release: almost 3 years ago - 1 dependent package - 29 dependent repositories - 8.49 thousand downloads last month - 2,869 stars on GitHub - 1 maintainer
textherox 1.2.0
Text preprocessing, representation and visualization from zero to hero.
1 version - Latest release: over 1 year ago - 30 downloads last month - 2,865 stars on GitHub - 1 maintainer
20220429-pdfminer-jameslp310 0.0.2
PDF parser and analyzer
1 version - Latest release: about 2 years ago - 1 dependent repositories - 82 downloads last month - 5,496 stars on GitHub - 1 maintainer
Top 0.5% on pypi.org
pdfminer.six 20231228
PDF parser and analyzer
26 versions - Latest release: 5 months ago - 162 dependent packages - 2,496 dependent repositories - 3.66 million downloads last month - 5,496 stars on GitHub - 3 maintainers
e.pdfminer.six 0.0.1
PDF parser and analyzer
1 version - Latest release: over 4 years ago - 64 downloads last month - 5,496 stars on GitHub - 1 maintainer
grub 0.1.3
A ridiculously simple search engine factory
17 versions - Latest release: almost 2 years ago - 4 dependent repositories - 578 downloads last month - 2 stars on GitHub - 1 maintainer
easylda 0.2.7
easily bult LDA Topic Models with just a list of docs (e.g. a list of twitter posts in CSV/TXT
48 versions - Latest release: over 6 years ago - 1 dependent repositories - 408 downloads last month - 3 stars on GitHub - 1 maintainer
dsutils 0.2.0
data science utils for data preprocessing for feeding various models, pipelining, time data forma...
20 versions - Latest release: about 6 years ago - 1 dependent repositories - 199 downloads last month - 0 stars on GitHub - 1 maintainer
vadersentiment-swedish 1.0.3
VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon an...
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 68 downloads last month - 6 stars on GitHub - 1 maintainer
awessome 0.0.14
awessome
6 versions - Latest release: over 3 years ago - 1 dependent repositories - 51 downloads last month - 2 stars on GitHub - 1 maintainer
orange3-textable-prototypes 0.26
Additional widgets for the Textable add-on to Orange 3.
21 versions - Latest release: over 2 years ago - 1 dependent repositories - 195 downloads last month - 5 stars on GitHub - 1 maintainer
bent 0.0.62
BENT: Biomedical Entity Annotator
52 versions - Latest release: 3 days ago - 1 dependent repositories - 1.92 thousand downloads last month - 9 stars on GitHub - 1 maintainer
fummytransformers 0.0.18
Fast and dummy way of using transformers to establish quick baselines
17 versions - Latest release: over 1 year ago - 109 downloads last month - 0 stars on GitHub - 1 maintainer
pdfminer.rtl 1.0.1
PDF parser and analyzer
8 versions - Latest release: about 1 month ago - 289 downloads last month - 1 maintainer
material-parser 1.2
Grobid superconductors tools material parser
2 versions - Latest release: about 1 year ago - 19 downloads last month - 6 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
pyxpdf 0.2.3
Powerful and Pythonic PDF processing library based on xpdf-4.02
6 versions - Latest release: over 3 years ago - 3 dependent repositories - 885 downloads last month - 38 stars on GitHub - 1 maintainer
frenchnlp 0.2.3
State of the art toolchain for natural language processing in French
13 versions - Latest release: almost 3 years ago - 1 dependent repositories - 88 downloads last month - 1 stars on GitHub - 1 maintainer
spacy-wrap 1.4.5
Wrappers for including pre-trained transformers in spaCy pipelines
21 versions - Latest release: 7 months ago - 3 dependent packages - 1 dependent repositories - 604 downloads last month - 46 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
pdfminer 20131022
PDF parser and analyzer
41 versions - Latest release: 9 months ago - 46 dependent packages - 1,423 dependent repositories - 225 thousand downloads last month - 5,143 stars on GitHub - 1 maintainer
nlpbaselines 0.0.49
Quickly establish strong baselines for NLP tasks
27 versions - Latest release: 6 months ago - 1 dependent repositories - 175 downloads last month - 0 stars on GitHub - 1 maintainer
textdescriptives 2.8.1
A library for calculating a variety of features from text using spaCy
35 versions - Latest release: 12 days ago - 4 dependent packages - 1 dependent repositories - 2.26 thousand downloads last month - 291 stars on GitHub - 2 maintainers
pdfdocx 1.7
读取pdf、docx文件,返回文件内的文本数据。
8 versions - Latest release: 8 months ago - 1 dependent repositories - 437 downloads last month - 1 stars on GitHub - 1 maintainer
orange3-textable 3.1.11
Textable add-on for Orange 3 data mining software package.
29 versions - Latest release: almost 3 years ago - 1 dependent repositories - 1.95 thousand downloads last month - 23 stars on GitHub - 1 maintainer
cntext 1.9.0
Chinese text analysis library, which can perform word frequency statistics, dictionary expansion,...
35 versions - Latest release: 5 months ago - 782 downloads last month - 213 stars on GitHub - 1 maintainer
lttl 2.0.12
LangTech Text Library (LTTL) for text processing and analysis
23 versions - Latest release: over 3 years ago - 1 dependent repositories - 1.45 thousand downloads last month - 3 stars on GitHub - 1 maintainer
grobid-quantities-client 0.4.0
A minimal client for grobid-quantities service.
5 versions - Latest release: over 1 year ago - 2 dependent packages - 2 dependent repositories - 53 downloads last month - 1 maintainer
Top 9.5% on pypi.org
augmenty 1.4.4
An augmentation library based on SpaCy for joint augmentation of text and labels.
33 versions - Latest release: 2 months ago - 4 dependent packages - 1 dependent repositories - 432 downloads last month - 147 stars on GitHub - 1 maintainer
textmining-module 0.1.4
A Python Module for Comprehensive Text Mining, including Keyword Extraction and Text Analysis.
5 versions - Latest release: 3 months ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
storynavigator 0.0.19
Narrative analysis add-on for the Orange 3 data mining software package.
18 versions - Latest release: about 1 month ago - 167 downloads last month - 0 stars on GitHub - 1 maintainer
seesus 1.2.1
a social, environmental, and economic sustainability classifier based on the UN Sustainable Devel...
4 versions - Latest release: about 1 month ago - 56 downloads last month - 3 stars on GitHub - 1 maintainer
seedot 3.0
SEEDOT: Tool for Enhancing Sentiment Lexicon with Machine Learning
2 versions - Latest release: 11 months ago - 17 downloads last month - 2 stars on GitHub - 1 maintainer
grogu1 1.1
Pacchetto contentente 4 dizionari che fungono da miglioramento di Vader per argomanti specifici, ...
1 version - Latest release: 11 months ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
textprepro 0.0.1
Everything Everyway All At Once Text Preprocessing.
2 versions - Latest release: about 1 year ago - 24 downloads last month - 1 stars on GitHub - 1 maintainer
bibx 0.0.1a9
Python bibliometric tools.
10 versions - Latest release: 2 months ago - 1 dependent repositories - 44 downloads last month - 1 maintainer
textana4sc 0.4
文本分析库,可对文本进行词频统计、词典扩充、情绪分析等
4 versions - Latest release: about 1 year ago - 11 downloads last month - 1 maintainer
hades-nlp 0.1.2
Homologous Automated Document Exploration and Summarization - A powerful tool for comparing simil...
3 versions - Latest release: 10 months ago - 36 downloads last month - 7 stars on GitHub - 2 maintainers
textanalyze4sc 2.0
文本分析库,可对文本进行词频统计、词典扩充、情绪分析等
7 versions - Latest release: about 1 year ago - 10 downloads last month - 1 maintainer
leia-br 0.0.1
LeIA (Léxico para Inferência Adaptada) é um fork do léxico e ferramenta para análise de sentiment...
1 version - Latest release: over 1 year ago - 1 dependent repositories - 187 downloads last month - 2 stars on GitHub - 1 maintainer
vadernew 2.0
Pacchetto contentente 4 dizionari che fungono da miglioramento di Vader per argomanti specifici, ...
9 versions - Latest release: almost 2 years ago - 1 dependent repositories - 8 downloads last month - 0 stars on GitHub - 1 maintainer
yapdfminer 1.2.2
PDF parser and analyzer
8 versions - Latest release: almost 5 years ago - 1 dependent repositories - 82 downloads last month - 2 stars on GitHub - 1 maintainer
vader-sentiment 3.2.1
VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon an...
2 versions - Latest release: over 5 years ago - 5 dependent repositories - 342 downloads last month - 1 stars on GitHub - 1 maintainer
vader-multi 3.2.2 💰
VADER Sentiment Analysis. VADER (Valence Aware Dictionary and sEntiment Reasoner) is a lexicon an...
2 versions - Latest release: over 4 years ago - 2 dependent repositories - 795 downloads last month - 16 stars on GitHub - 1 maintainer
utilss 0.1.7
Useful tools to work with text mining in Python
6 versions - Latest release: about 4 years ago - 1 dependent repositories - 100 downloads last month - 0 stars on GitHub - 1 maintainer
textdatasetcleaner 0.0.6
Pipeline for cleaning (preprocessing/normalizing) text datasets
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 31 downloads last month - 38 stars on GitHub - 1 maintainer
swinger 2.1
A sentiment classifier for Chinese
12 versions - Latest release: almost 7 years ago - 1 dependent repositories - 52 downloads last month - 36 stars on GitHub - 1 maintainer
slang 0.1.12
A structural approach to signal ML
20 versions - Latest release: 7 months ago - 2 dependent repositories - 146 downloads last month - 5 stars on GitHub - 1 maintainer
simtext 1.3
文本、文档相似性计算
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 127 downloads last month - 11 stars on GitHub - 1 maintainer
Top 3.0% on pypi.org
quantulum3 0.9.1 💰
Extract quantities from unstructured text.
40 versions - Latest release: about 1 month ago - 8 dependent packages - 44 dependent repositories - 167 thousand downloads last month - 130 stars on GitHub - 1 maintainer
quantulum 0.1.16
Extract quantities from unstructured text.
17 versions - Latest release: 9 months ago - 1 dependent package - 4 dependent repositories - 52 downloads last month - 119 stars on GitHub - 1 maintainer
pybursts 0.1.1
A Python port of the 'burst detection' algorithm by Kleinberg, originally implemented in R
2 versions - Latest release: over 9 years ago - 4 dependent repositories - 17 downloads last month - 1 maintainer
pdf-wrangler 0.0.31
PDFMiner Wrapper for extractions
9 versions - Latest release: over 2 years ago - 1 dependent repositories - 58 downloads last month - 1 stars on GitHub - 1 maintainer
pdfminer.six-i 20190823
PDF parser and analyzer
5 versions - Latest release: over 4 years ago - 1 dependent repositories - 869 downloads last month - 2 maintainers
pdfminer.aemc 20231229
PDF parser and analyzer
9 versions - Latest release: 12 days ago - 1 dependent package - 200 downloads last month - 1 maintainer
pdfminer.hitalent 20221118
PDF parser and analyzer
20 versions - Latest release: over 1 year ago - 159 downloads last month - 1 maintainer
pdfmajor 1.3.13
PDF parser
36 versions - Latest release: over 4 years ago - 1 dependent repositories - 285 downloads last month - 21 stars on GitHub - 1 maintainer
orange-textable-prototypes 0.1.6
Extra widgets for the Textable text analysis package.
6 versions - Latest release: over 7 years ago - 1 dependent repositories - 86 downloads last month - 1 stars on GitHub - 1 maintainer
orange-text 1.2a1
Orange Text Mining add-on for Orange data mining software package.
1 version - Latest release: almost 12 years ago - 1 dependent repositories - 33 downloads last month - 2 maintainers
orange-textable 2.0.1
Textable add-on for Orange 2.7 data mining software package.
18 versions - Latest release: over 7 years ago - 1 dependent repositories - 279 downloads last month - 1 maintainer
mledu 0.0.2
build machine learning models for education purpose
4 versions - Latest release: about 6 years ago - 1 dependent repositories - 58 downloads last month - 1 maintainer
Top 6.4% on pypi.org
metapy 0.2.13
Python bindings for MeTA
24 versions - Latest release: almost 6 years ago - 30 dependent repositories - 2.37 thousand downloads last month - 50 stars on GitHub - 2 maintainers
lingualytics 0.1.3
A multilingual text analytics package.
4 versions - Latest release: over 3 years ago - 3 dependent repositories - 51 downloads last month - 36 stars on GitHub - 1 maintainer
jgtextrank 0.1.6
Yet another Python implementation of TextRank: package for the creation, manipulation, and study ...
5 versions - Latest release: over 4 years ago - 1 dependent repositories - 52 downloads last month - 13 stars on GitHub - 1 maintainer
gorpy 2.0.4
Grep tool with extensions for reading files in many different ways
9 versions - Latest release: 9 months ago - 1 dependent repositories - 67 downloads last month - 0 stars on GitHub - 1 maintainer
decat 1.0.3
De-concatenate strings that do not have white-spaces.
4 versions - Latest release: about 1 year ago - 1 dependent repositories - 33 downloads last month - 0 stars on GitHub - 1 maintainer
chemdataextractor-api 0.0.1
Chemdataextractor REST API wrapper
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 17 downloads last month - 0 stars on GitHub - 1 maintainer
bagofconcepts 0.1.0
This is python implementation of Bag-of-Concepts, as proposed by the paper "Bag-of-Concepts: Comp...
2 versions - Latest release: almost 2 years ago - 31 downloads last month - 20 stars on GitHub - 1 maintainer
statcamp 0.0.2
stat funcs from datacamp stat thinking
2 versions - Latest release: over 6 years ago - 1 dependent repositories - 25 downloads last month - 0 stars on GitHub - 1 maintainer
lexis 0.1.2
Wordnet wrapper - Easy access to words and their relationships
5 versions - Latest release: about 1 year ago - 2 dependent packages - 1 dependent repositories - 98 downloads last month - 1 stars on GitHub - 1 maintainer
multistop 1.3
文本分析停用词表,支持中英德法等15种语言。
1 version - Latest release: almost 2 years ago - 1 dependent repositories - 47 downloads last month - 1 maintainer
Top 5.9% on pypi.org
pdfminer2 20151206 💰
PDF parser and analyzer
1 version - Latest release: over 8 years ago - 3 dependent packages - 37 dependent repositories - 65.5 thousand downloads last month - 24 stars on GitHub - 1 maintainer
pdfminer-cython 20200304
PDF parser and analyzer
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 6 downloads last month - 5,143 stars on GitHub - 1 maintainer
pdfminer-with-logger 1.0.0
PDF parser and analyzer
1 version - Latest release: over 3 years ago - 1 dependent repositories - 22 downloads last month - 5,142 stars on GitHub - 1 maintainer
keypartx 0.1.20
A Graph-based Perception(Text) Representation
40 versions - Latest release: about 1 year ago - 173 downloads last month - 33 stars on GitHub - 1 maintainer
huspacy-nightly 0.11.0.dev261 💰
HuSpaCy: industrial strength Hungarian natural language processing
126 versions - Latest release: 5 months ago - 1 dependent repositories - 275 downloads last month - 142 stars on GitHub - 1 maintainer
Related Keywords
nlp 25 text analysis 20 natural language processing 17 pdf parser 15 pdf converter 14 text-mining 12 layout analysis 12 sentiment analysis 12 python 11 sentiment 10 natural-language-processing 9 machine-learning 9 text 8 mining 8 opinion 8 analysis 8 data 8 opinion analysis 8 twitter sentiment 8 opinion mining 8 social media 8 twitter 8 social 8 media 8 NLP 8 pdf 7 machine learning 7 vader 7 text processing 6 spacy 5 parsing 5 data mining 5 text analytics 4 textable 4 data science 4 corpus 4 text representation 4 text preprocessing 4 information-extraction 4 text similarity 4 french 3 npl 3 named entity recognition 3 transformers 3 natual language processing 3 spaCy 3 tagging 3 bert 3 named-entity-recognition 3 spacy-models 3 spacy-pipeline 3 measurements 3 orange3 add-on 3 text visualization 3 orange add-on 3 orange 3 text-preprocessing 3 text-representation 3 orange3 3 spacy-extension 3 information extraction 3 parser 3 normalization 2 text-analytics 2 regular-expressions 2 hacktoberfest 2 Natural Language Processing 2 physics 2 quantities 2 units 2 units-of-measure 2 huspacy 2 keywords-extraction 2 pytorch 2 python3 2 spacy-nlp 2 clustering 2 text-classification 2 statistics 2 chinese 2 word-embeddings 2 word vectors 2 word embeddings 2 texthero 2 text-visualization 2 text-clustering 2 nlp-pipeline 2 hungarian 2 dependency-parsing 2 topic modeling 2 spacy model 2 ner 2 lemmatization 2 pos tagging 2 sentence splitting 2 sbd 2 sentence boundary detection 2 topic-modeling 2 tokenization 2 language processing 2