An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "text-preprocessing" keyword

Top 1.8% on pypi.org
clean-text 0.7.1
Functions to preprocess and normalize text.
9 versions - Latest release: about 1 month ago - 15 dependent packages - 97 dependent repositories - 126 thousand downloads last month - 1,002 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
trafilatura 2.0.0 💰
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction...
50 versions - Latest release: over 1 year ago - 71 dependent packages - 63 dependent repositories - 3.92 million downloads last month - 5,440 stars on GitHub - 1 maintainer
nlp-preprocessing 0.2.0
A Package for text preprocessing
14 versions - Latest release: over 5 years ago - 1 dependent repositories - 60 downloads last month - 16 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
text-curation 1.6.0
Deterministic, profile-based text curation pipelines for Hugging Face Datasets
14 versions - Latest release: 11 days ago - 532 downloads last month - 2 stars on GitHub - 1 maintainer
textfeature 0.0.10
transform unstructured text to feature vector using word2vec, lexicon and ...
4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 31 downloads last month - 0 stars on GitHub - 1 maintainer
kkltk 1.0
kkltk is a toolkit designed for Kinyarwanda and Kirundi languages processing
1 version - Latest release: over 5 years ago - 1 dependent repositories - 8 downloads last month - 1 stars on GitHub - 1 maintainer
bangla-postagger 0.13.0
A Bangla Parts of Speech Tagger using Bangla-English Alignment
12 versions - Latest release: over 3 years ago - 1 dependent repositories - 193 downloads last month - 0 stars on GitHub - 1 maintainer
Top 9.5% on pypi.org
text-preprocessing 0.1.1 💰
A python package for text preprocessing task in natural language processing
8 versions - Latest release: over 3 years ago - 2 dependent repositories - 1.12 thousand downloads last month - 63 stars on GitHub - 1 maintainer
python-mecab 1.0.1
A repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
4 versions - Latest release: about 6 years ago - 1 dependent repositories - 59 downloads last month - 28 stars on GitHub - 1 maintainer
mnlp 0.1.2
Mongolian Natural Language Processing Module.
3 versions - Latest release: almost 7 years ago - 1 dependent repositories - 38 downloads last month - 6 stars on GitHub - 1 maintainer
textprepro 0.0.1
Everything Everyway All At Once Text Preprocessing.
2 versions - Latest release: almost 3 years ago - 13 downloads last month - 2 stars on GitHub - 1 maintainer
data-preprocessors 0.58.0
An easy to use tool for Data Preprocessing specially for Text Preprocessing
48 versions - Latest release: over 1 year ago - 1 dependent repositories - 533 downloads last month - 2 stars on GitHub - 1 maintainer
gotext 0.9.5
GoText is a universal text extraction and preprocessing tool for python which supportss wide vari...
2 versions - Latest release: about 4 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 1 maintainer
jange 0.1.7
Easy NLP library for Python
8 versions - Latest release: over 4 years ago - 1 dependent repositories - 30 downloads last month - 18 stars on GitHub - 1 maintainer
templatext 0.0.2
Text preprocessing template for NLP.
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
mim-nlp 0.2.1
A Python package with ready-to-use models for various NLP tasks and text preprocessing utilities....
2 versions - Latest release: over 1 year ago - 14 downloads last month - 2 stars on GitHub - 2 maintainers
Top 3.0% on pypi.org
texthero 1.1.0
Text preprocessing, representation and visualization from zero to hero.
10 versions - Latest release: over 4 years ago - 1 dependent package - 29 dependent repositories - 2.07 thousand downloads last month - 2,908 stars on GitHub - 1 maintainer
prenlp 0.0.13
Preprocessing Library for Natural Language Processing
12 versions - Latest release: over 5 years ago - 1 dependent repositories - 79 downloads last month - 159 stars on GitHub - 1 maintainer
textherox 1.2.0
Text preprocessing, representation and visualization from zero to hero.
1 version - Latest release: over 3 years ago - 13 downloads last month - 2,908 stars on GitHub - 1 maintainer
nlp-text-cleaner 1.0.11
Clean the text for NLP project
12 versions - Latest release: over 3 years ago - 1 dependent repositories - 102 downloads last month - 2 stars on GitHub - 1 maintainer
tpreprocessing 0.0.1
simple text preprocessing package that preprocess text by simple functions
1 version - Latest release: over 2 years ago - 6 downloads last month - 1 maintainer
vocabulous 0.1.3
A bootstrapping language detection system that builds dictionaries from noisy and ambiguous train...
4 versions - Latest release: 3 months ago - 42 downloads last month - 2 stars on GitHub - 1 maintainer
textgo 1.4
Let's go and play with text!
13 versions - Latest release: over 5 years ago - 1 dependent repositories - 69 downloads last month - 45 stars on GitHub - 1 maintainer
proces 0.1.7
text preprocess.
8 versions - Latest release: over 2 years ago - 2 dependent packages - 50 dependent repositories - 185 thousand downloads last month - 5 stars on GitHub - 1 maintainer
Related Keywords
nlp 17 natural-language-processing 8 text-cleaning 7 python 7 machine-learning 6 text-processing 5 text preprocessing 5 text 4 text-classification 4 text-mining 3 data-preprocessing 3 text mining 3 python3 3 text-representation 3 text representation 2 deep-learning 2 text visualization 2 nlp-pipeline 2 NLP 2 musfiqdehan 2 data-science 2 text-clustering 2 data-preprocessors 2 text-visualization 2 texthero 2 nlp-library 2 word-embeddings 2 llm 2 scraping 2 python-package 2 lemmatization 1 summarization 1 seq2seq 1 text-regression 1 transfer-learning 1 neural-network 1 language 1 preprocessing 1 visualization 1 topic-modeling 1 tei-xml 1 text-extraction 1 clustering 1 text analytics 1 text processing 1 similarity-score 1 datacleaning 1 text utils 1 document extraction 1 text-similarity 1 text-search 1 bert 1 multilingual-nlp 1 dataset 1 data-cleaning 1 language-detection 1 dictionary-building 1 bootstrapping 1 text cleaning 1 preprocessing-library 1 text-normalization 1 user-generated-content 1 corpus 1 html2text 1 news-crawler 1 scraper 1 pytorch 1 transformers 1 deduplication 1 text extraction 1 stopwords 1 news-aggregator 1 Kirundi 1 Kinyarwanda 1 computational linguistics 1 natural language processing 1 Low-resource languages 1 word2vec 1 text2vec 1 bag-of-words 1 curation 1 data-curation 1 datasets 1 huggingface 1 tokenization 1 rag 1 readability 1 spacy-nlp 1 rss-feed 1 tei 1 webscraping 1 textfile 1 Natural Language Processing 1 web-scraping 1 mongolian-text-classification 1 hacktoberfest 1 mongolian 1 tokenizer 1 python-c-extension 1 mecab 1