Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "cleaning" keyword
folderclean 1.2
Clean your folders in a single line of code! FolderCleans lets you for example clean your folders...1 version - Latest release: almost 4 years ago - 1 dependent repositories - 13 downloads last month - 1 stars on GitHub - 1 maintainer
openclean-geo 0.1.0
Geo-Spatial extension for the open data cleaning library1 version - Latest release: about 3 years ago - 1 dependent repositories - 25 downloads last month - 2 stars on GitHub - 2 maintainers
neatbook 0.20
One line of code that makes a notebook that writes code that writes code.19 versions - Latest release: about 6 years ago - 1 dependent repositories - 62 downloads last month - 1 stars on GitHub - 2 maintainers
pcleaner 2.6.2 💰
An AI-powered tool to clean manga panels.36 versions - Latest release: 2 days ago - 361 downloads last month - 146 stars on GitHub - 1 maintainer
tnkeeh 0.0.9
Arabic cleaning, normalization and segmentation library.9 versions - Latest release: about 3 years ago - 2 dependent repositories - 282 downloads last month - 45 stars on GitHub - 2 maintainers
datagovernance 0.1
A very basic cleaning data1 version - Latest release: almost 3 years ago - 1 dependent repositories - 8 downloads last month - 2 maintainers
cleanmydata 1.0.6
Library for data cleaning operations6 versions - Latest release: over 1 year ago - 36 downloads last month - 0 stars on GitHub - 2 maintainers
dpyp 1.0.0
A pandas convenience wrapper for small-scale data pipelines1 version - Latest release: 11 days ago - 192 downloads last month - 2 stars on GitHub - 2 maintainers
cocorepr 0.1.0
COCO Dataset cleaning tool29 versions - Latest release: almost 3 years ago - 62 downloads last month - 2 stars on GitHub - 2 maintainers
neatdata 0.18
Cleaning code17 versions - Latest release: about 6 years ago - 1 dependent repositories - 50 downloads last month - 0 stars on GitHub - 2 maintainers
vivek2dropoffnan 0.0.1
A simple package to perform basic data cleaning in EDA1 version - Latest release: about 3 years ago - 1 dependent repositories - 9 downloads last month - 2 maintainers
arrangepy 1.1.2 💰
Arrange your files in distinct folder, help you clean your PC3 versions - Latest release: about 3 years ago - 1 dependent repositories - 41 downloads last month - 24 stars on GitHub - 1 maintainer
datagovernancenew 0.0.2
A very basic cleaning data1 version - Latest release: almost 3 years ago - 1 dependent repositories - 16 downloads last month - 2 maintainers
claydates 1.0.6
Package used for cleaning, restructuring, logging, and plotting of financial data retrieved from ...7 versions - Latest release: over 1 year ago - 67 downloads last month - 0 stars on GitHub - 2 maintainers
Top 2.8% on pypi.org
33 versions - Latest release: over 1 year ago - 1 dependent package - 55 dependent repositories - 43.6 thousand downloads last month - 1,902 stars on GitHub - 8 maintainers
dataprep 0.4.5
Dataprep: Data Preparation in Python33 versions - Latest release: over 1 year ago - 1 dependent package - 55 dependent repositories - 43.6 thousand downloads last month - 1,902 stars on GitHub - 8 maintainers
predf 0.1
Library with common utility functions for pre processing the data1 version - Latest release: about 4 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 1 maintainer
openclean-metanome 0.2.0
openclean Metanome Python Package2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 25 downloads last month - 2 stars on GitHub - 2 maintainers
Top 9.5% on pypi.org
21 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 496 downloads last month - 213 stars on GitHub - 1 maintainer
py-autoclean 1.1.3
AutoClean - Python Package for Automated Preprocessing & Cleaning of Datasets21 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 496 downloads last month - 213 stars on GitHub - 1 maintainer
py-autocleanre 1.1.4
AutoClean - Python Package for Automated Preprocessing & Cleaning of Datasets1 version - Latest release: about 1 year ago - 21 downloads last month - 213 stars on GitHub - 2 maintainers
substitutionstring 0.2.0
Manipulate substitution of string, as for instance deletion and insertion, without loss of inform...1 version - Latest release: over 2 years ago - 1 dependent repositories - 14 downloads last month - 0 stars on framagit.org - 2 maintainers
vacuum-cleaner 0.1.3 💰
Deep Vacuum Cleaner5 versions - Latest release: almost 6 years ago - 1 dependent repositories - 63 downloads last month - 2 stars on GitHub - 1 maintainer
exchangecleaning 1.0.0
clean exchange data1 version - Latest release: almost 2 years ago - 6 downloads last month - 2 maintainers
company-name-matching 0.4.3
Returns a score of 2 companies to be the same40 versions - Latest release: over 2 years ago - 391 downloads last month - 2 maintainers
clean-df 0.3.0
Python module to report, clean, and optimize Pandas Dataframes effectively7 versions - Latest release: 8 months ago - 70 downloads last month - 3 stars on GitHub - 2 maintainers
company-name-matching2 0.0.6
Returns a score of 2 companies to be the same4 versions - Latest release: over 2 years ago - 49 downloads last month - 2 maintainers
githubdata 16.1.2
A simple Python package to easily download from and manage a GitHub "Data repository"53 versions - Latest release: 7 months ago - 3 dependent packages - 8 dependent repositories - 460 downloads last month - 0 stars on GitHub - 2 maintainers
dwm 1.1.0
Best practices for marketing data quality management11 versions - Latest release: almost 6 years ago - 2 dependent repositories - 37 downloads last month - 12 stars on GitHub - 4 maintainers
prep-buddy 0.5.11
A library for cleaning, transforming and executing all other preparation tasks for large datasets...3 versions - Latest release: over 7 years ago - 1 dependent repositories - 8 downloads last month - 8 stars on GitHub - 1 maintainer
mordineznlp 0.1.0
Powerfull python tool for modern NLP processing34 versions - Latest release: about 2 years ago - 1 dependent repositories - 118 downloads last month - 2 stars on GitHub - 2 maintainers
chunkyp 0.0.2
Ray-based preprocesisng pipeline.2 versions - Latest release: over 3 years ago - 1 dependent repositories - 22 downloads last month - 0 stars on GitHub - 1 maintainer
clean-plot 0.0.13
clean_plot simplifies cleaning text files for creation of embeddings and making plots from it12 versions - Latest release: over 1 year ago - 1 dependent repositories - 64 downloads last month - 3 stars on GitHub - 1 maintainer
listwise 1.0.4
ListWise.com email validation wrapper.1 version - Latest release: over 7 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 2 maintainers
shawarma 0.0.1
A simple package to perform basic data cleaning in EDA1 version - Latest release: about 3 years ago - 1 dependent repositories - 15 downloads last month - 2 maintainers
tddc 0.1.1
Scaffold out methods and tests for collaborative data cleaning.2 versions - Latest release: over 7 years ago - 1 dependent repositories - 5 downloads last month - 5 stars on GitHub - 2 maintainers
Top 8.7% on pypi.org
8 versions - Latest release: almost 3 years ago - 5 dependent repositories - 544 downloads last month - 61 stars on GitHub - 2 maintainers
openclean-core 0.4.1
Library for data cleaning and data profiling8 versions - Latest release: almost 3 years ago - 5 dependent repositories - 544 downloads last month - 61 stars on GitHub - 2 maintainers
mirutil 25.1.0
Tools for getting and cleaning TSE data185 versions - Latest release: 7 months ago - 3 dependent packages - 6 dependent repositories - 149 downloads last month - 1 stars on GitHub - 1 maintainer
dataglitch 0.0.2
DataGlitch is a Python package designed to address common data challenges, including handling mix...2 versions - Latest release: 12 months ago - 12 downloads last month - 0 stars on GitHub - 2 maintainers
clean-docstrings 0.1
Utility functions to clean docstrings in various programming languages1 version - Latest release: almost 3 years ago - 1 dependent repositories - 8 downloads last month - 0 stars on GitHub - 2 maintainers
banglanltk 0.0.4
Bangla Natural Language Processing Toolkit2 versions - Latest release: over 3 years ago - 282 downloads last month - 2 maintainers
autoclean 1.0.2
A library for cleaning text data1 version - Latest release: about 2 years ago - 1 dependent repositories - 165 downloads last month - 1 stars on GitHub - 1 maintainer
tubelearns 2.1.0
Python script for extracting, cleaning, and tokenizing YouTube video transcripts for Pre-Processi...8 versions - Latest release: about 2 months ago - 55 downloads last month - 4 stars on GitHub - 4 maintainers
Top 5.2% on pypi.org
6 versions - Latest release: over 7 years ago - 1 dependent package - 19 dependent repositories - 313 downloads last month - 1,039 stars on GitHub - 2 maintainers
datacleaner 0.1.5
A Python tool that automatically cleans data sets and readies them for analysis.6 versions - Latest release: over 7 years ago - 1 dependent package - 19 dependent repositories - 313 downloads last month - 1,039 stars on GitHub - 2 maintainers
takin 1.1.4
A Python Toolkit for File Processing, Text Cleaning and Data Splitting4 versions - Latest release: over 1 year ago - 1 dependent repositories - 14 downloads last month - 23 stars on GitHub - 2 maintainers
clean-html-for-llm 1.0.0
A library for cleaning HTML content by removing specified tags and attributes.1 version - Latest release: 23 days ago - 1 maintainer
neverbounce-sdk 4.3.0
Official Python SDK for the NeverBounce API18 versions - Latest release: over 3 years ago - 1 dependent repositories - 45.4 thousand downloads last month - 14 stars on GitHub - 4 maintainers
datadoctor 1.0.15
A Python package for data cleaning and preprocessing.14 versions - Latest release: 11 months ago - 42 downloads last month - 2 stars on GitHub - 2 maintainers
dripper 0.3.1
Cleaning your messy data.7 versions - Latest release: about 9 years ago - 3 dependent repositories - 276 downloads last month - 8 stars on GitHub - 2 maintainers
visuallayer 0.0.15
Open, Clean Datasets for Computer Vision.5 versions - Latest release: 11 months ago - 21 downloads last month - 64 stars on GitHub - 2 maintainers
kmor 1.0.7
K-means clustering with outlier removal numpy implementation8 versions - Latest release: almost 4 years ago - 1 dependent repositories - 23 downloads last month - 6 stars on GitHub - 2 maintainers
activedetect 0.1.3
A Library For Error Detection For Predictive Analytics6 versions - Latest release: over 7 years ago - 1 dependent repositories - 14 downloads last month - 10 stars on GitHub - 1 maintainer
pdcheckers 0.1
Investigate consistency and dirtiness of a pandas DataFrame1 version - Latest release: over 1 year ago - 10 downloads last month - 0 stars on GitHub - 1 maintainer
etl-toolbox 0.0.3
Useful ETL functions for Python3 versions - Latest release: over 3 years ago - 1 dependent repositories - 28 downloads last month - 2 stars on GitHub - 2 maintainers
augmenting 0.0.0
An image dataset augmentation package.1 version - Latest release: about 1 year ago - 1 dependent repositories - 8 downloads last month - 2 maintainers
cleanliness 0.1.1
Basic cleaning of text2 versions - Latest release: over 5 years ago - 1 dependent repositories - 30 downloads last month - 0 stars on GitHub - 2 maintainers
prefea 0.1
Library with common utility functions for pre processing the data1 version - Latest release: about 4 years ago - 1 dependent repositories - 6 downloads last month - 1 stars on GitHub - 2 maintainers
vl-datasets 0.0.11
Open, Clean Datasets for Computer Vision.11 versions - Latest release: 12 months ago - 56 downloads last month - 63 stars on GitHub - 2 maintainers
preio 0.2
Library with common utility functions for pre processing the data2 versions - Latest release: about 4 years ago - 1 dependent repositories - 6 downloads last month - 1 stars on GitHub - 1 maintainer
pyrmt 0.1.0
Python for Random Matrix Theory: cleaning schemes for noisy correlation matrices1 version - Latest release: almost 7 years ago - 1 dependent package - 1 dependent repositories - 23 downloads last month - 38 stars on GitHub - 2 maintainers
omni-ai 0.0.1
Python library for Automated Data Cleaning, Automated Preprocessing, Automated Visualization and ...1 version - Latest release: about 3 years ago - 1 dependent repositories - 7 downloads last month - 2 maintainers
Top 9.0% on pypi.org
1 version - Latest release: over 1 year ago - 45 downloads last month
data-cleaning-pb 1.0 removed
Library for data cleaning operations1 version - Latest release: over 1 year ago - 45 downloads last month
tubelearn 1.0.0 removed
Python script for extracting and cleaning YouTube video transcripts for Pre-Processing in machine...1 version - Latest release: 7 months ago - 1 maintainer
tidytweets 0.15 removed
Clean tweets to perform various NLP tasks such as topic analysis, word embeddings, sentiment anal...2 versions - Latest release: 10 months ago - 115 downloads last month - 0 stars on GitHub - 4 maintainers
dcleaning 0.0.1 removed
This library to facilitate the process of cleaning data where you can delete duplicates, fill nul...1 version - Latest release: about 1 year ago
mir-tse 20220809.2 removed
Tools for getting and cleaning TSE data7 versions - Latest release: over 1 year ago
exchangebankcleaning 1.0.0 removed
clean exchangebank data1 version - Latest release: almost 2 years ago - 1 maintainer
Related Keywords
data
29
python
14
data-science
8
pandas
7
preprocessing
6
machine learning
5
nlp
5
machine-learning
4
NLP
4
dataset
4
text
4
data_cleaning
3
Tehran
3
TSE
3
Stocks
3
Finance
3
processing
3
automated
3
opensource
3
feature
3
data science
3
vision
2
plotting
2
generative
2
datasets-preparation
2
computer-vision
2
computer
2
data-centric
2
autoclean
2
natural language processing
2
computer vision
2
normalization
2
pre-processing
2
learning
2
companies
2
matching
2
duplicates
2
names
2
wrangling
2
raw data
2
transcript
2
video
2
validation
2
email
2
tokenizing
2
automation
2
data cleaning
2
tool
2
classification
2
clean
2
cleansing
1
data-splitting
1
file-processing
1
text-cleaning
1
html
1
neverbounce
1
api
1
verification
1
machine
1
ML
1
numpy
1
scikit-learn
1
fuzzywuzzy
1
Artificial
1
Intelligence
1
AI
1
collaborative
1
package
1
engineering
1
source code
1
comment
1
comment cleaning
1
bangla
1
tagging
1
stemming
1
synonym
1
unsupervised
1
segmentation
1
code generation
1
automated machine learning
1
lifeeasy
1
machine-learning-datasets
1
file
1
splitting
1
training
1
deep learning
1
applied-mathematics
1
correlation-matrices
1
noise-reduction
1
random-matrix-theory
1
portfolio-optimization
1
visualization
1
modelling
1
Text
1
Twitter
1
API
1
natural-language-processing
1
tweet-analysis
1
tweets
1
twitter
1