Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "cleaning" keyword

mirutil 25.1.0
Tools for getting and cleaning TSE data
185 versions - Latest release: 7 months ago - 3 dependent packages - 6 dependent repositories - 1.22 thousand downloads last month - 1 stars on GitHub - 1 maintainer
githubdata 16.1.2
A simple Python package to easily download from and manage a GitHub "Data repository"
53 versions - Latest release: 7 months ago - 3 dependent packages - 8 dependent repositories - 460 downloads last month - 0 stars on GitHub - 1 maintainer
company-name-matching 0.4.3
Returns a score of 2 companies to be the same
40 versions - Latest release: over 2 years ago - 391 downloads last month - 1 maintainer
pcleaner 2.6.3 💰
An AI-powered tool to clean manga panels.
37 versions - Latest release: 16 days ago - 585 downloads last month - 151 stars on GitHub - 1 maintainer
mordineznlp 0.1.0
Powerfull python tool for modern NLP processing
34 versions - Latest release: about 2 years ago - 1 dependent repositories - 118 downloads last month - 2 stars on GitHub - 1 maintainer
Top 2.8% on pypi.org
dataprep 0.4.5
Dataprep: Data Preparation in Python
33 versions - Latest release: almost 2 years ago - 1 dependent package - 55 dependent repositories - 43.6 thousand downloads last month - 1,902 stars on GitHub - 4 maintainers
cocorepr 0.1.0
COCO Dataset cleaning tool
29 versions - Latest release: about 3 years ago - 62 downloads last month - 2 stars on GitHub - 1 maintainer
Top 9.5% on pypi.org
py-autoclean 1.1.3
AutoClean - Python Package for Automated Preprocessing & Cleaning of Datasets
21 versions - Latest release: almost 2 years ago - 1 dependent package - 1 dependent repositories - 429 downloads last month - 218 stars on GitHub - 1 maintainer
neatbook 0.20
One line of code that makes a notebook that writes code that writes code.
19 versions - Latest release: over 6 years ago - 1 dependent repositories - 62 downloads last month - 1 stars on GitHub - 1 maintainer
neverbounce-sdk 4.3.0
Official Python SDK for the NeverBounce API
18 versions - Latest release: almost 4 years ago - 1 dependent repositories - 58.1 thousand downloads last month - 15 stars on GitHub - 2 maintainers
neatdata 0.18
Cleaning code
17 versions - Latest release: over 6 years ago - 1 dependent repositories - 50 downloads last month - 0 stars on GitHub - 1 maintainer
datadoctor 1.0.15
A Python package for data cleaning and preprocessing.
14 versions - Latest release: 12 months ago - 50 downloads last month - 2 stars on GitHub - 1 maintainer
clean-plot 0.0.13
clean_plot simplifies cleaning text files for creation of embeddings and making plots from it
12 versions - Latest release: over 1 year ago - 1 dependent repositories - 64 downloads last month - 3 stars on GitHub - 1 maintainer
vl-datasets 0.0.11
Open, Clean Datasets for Computer Vision.
11 versions - Latest release: 12 months ago - 98 downloads last month - 64 stars on GitHub - 1 maintainer
dwm 1.1.0
Best practices for marketing data quality management
11 versions - Latest release: about 6 years ago - 2 dependent repositories - 37 downloads last month - 12 stars on GitHub - 2 maintainers
tnkeeh 0.0.9
Arabic cleaning, normalization and segmentation library.
9 versions - Latest release: about 3 years ago - 2 dependent repositories - 282 downloads last month - 45 stars on GitHub - 1 maintainer
Top 8.7% on pypi.org
openclean-core 0.4.1
Library for data cleaning and data profiling
8 versions - Latest release: almost 3 years ago - 5 dependent repositories - 578 downloads last month - 63 stars on GitHub - 1 maintainer
tubelearns 2.1.0
Python script for extracting, cleaning, and tokenizing YouTube video transcripts for Pre-Processi...
8 versions - Latest release: 2 months ago - 68 downloads last month - 5 stars on GitHub - 2 maintainers
kmor 1.0.7
K-means clustering with outlier removal numpy implementation
8 versions - Latest release: about 4 years ago - 1 dependent repositories - 31 downloads last month - 6 stars on GitHub - 1 maintainer
claydates 1.0.6
Package used for cleaning, restructuring, logging, and plotting of financial data retrieved from ...
7 versions - Latest release: over 1 year ago - 67 downloads last month - 0 stars on GitHub - 1 maintainer
mir-tse 20220809.2 removed
Tools for getting and cleaning TSE data
7 versions - Latest release: almost 2 years ago
dripper 0.3.1
Cleaning your messy data.
7 versions - Latest release: over 9 years ago - 3 dependent repositories - 388 downloads last month - 8 stars on GitHub - 1 maintainer
clean-df 0.3.0
Python module to report, clean, and optimize Pandas Dataframes effectively
7 versions - Latest release: 9 months ago - 70 downloads last month - 3 stars on GitHub - 1 maintainer
activedetect 0.1.3
A Library For Error Detection For Predictive Analytics
6 versions - Latest release: over 7 years ago - 1 dependent repositories - 42 downloads last month - 10 stars on GitHub - 1 maintainer
cleanmydata 1.0.6
Library for data cleaning operations
6 versions - Latest release: over 1 year ago - 36 downloads last month - 0 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
datacleaner 0.1.5
A Python tool that automatically cleans data sets and readies them for analysis.
6 versions - Latest release: over 7 years ago - 1 dependent package - 19 dependent repositories - 457 downloads last month - 1,039 stars on GitHub - 1 maintainer
visuallayer 0.0.15
Open, Clean Datasets for Computer Vision.
5 versions - Latest release: 11 months ago - 35 downloads last month - 65 stars on GitHub - 1 maintainer
vacuum-cleaner 0.1.3 💰
Deep Vacuum Cleaner
5 versions - Latest release: almost 6 years ago - 1 dependent repositories - 63 downloads last month - 2 stars on GitHub - 1 maintainer
takin 1.1.4
A Python Toolkit for File Processing, Text Cleaning and Data Splitting
4 versions - Latest release: over 1 year ago - 1 dependent repositories - 28 downloads last month - 25 stars on GitHub - 1 maintainer
company-name-matching2 0.0.6
Returns a score of 2 companies to be the same
4 versions - Latest release: over 2 years ago - 49 downloads last month - 1 maintainer
arrangepy 1.1.2 💰
Arrange your files in distinct folder, help you clean your PC
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 41 downloads last month - 24 stars on GitHub - 1 maintainer
etl-toolbox 0.0.3
Useful ETL functions for Python
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 42 downloads last month - 2 stars on GitHub - 1 maintainer
prep-buddy 0.5.11
A library for cleaning, transforming and executing all other preparation tasks for large datasets...
3 versions - Latest release: over 7 years ago - 1 dependent repositories - 8 downloads last month - 8 stars on GitHub - 1 maintainer
tidytweets 0.15 removed
Clean tweets to perform various NLP tasks such as topic analysis, word embeddings, sentiment anal...
2 versions - Latest release: 10 months ago - 115 downloads last month - 0 stars on GitHub - 2 maintainers
banglanltk 0.0.4
Bangla Natural Language Processing Toolkit
2 versions - Latest release: almost 4 years ago - 337 downloads last month - 1 maintainer
chunkyp 0.0.2
Ray-based preprocesisng pipeline.
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 22 downloads last month - 0 stars on GitHub - 1 maintainer
dataglitch 0.0.2
DataGlitch is a Python package designed to address common data challenges, including handling mix...
2 versions - Latest release: almost 1 year ago - 12 downloads last month - 0 stars on GitHub - 1 maintainer
cleanliness 0.1.1
Basic cleaning of text
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 30 downloads last month - 0 stars on GitHub - 1 maintainer
openclean-metanome 0.2.0
openclean Metanome Python Package
2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 25 downloads last month - 2 stars on GitHub - 1 maintainer
preio 0.2
Library with common utility functions for pre processing the data
2 versions - Latest release: about 4 years ago - 1 dependent repositories - 6 downloads last month - 1 stars on GitHub - 1 maintainer
tddc 0.1.1
Scaffold out methods and tests for collaborative data cleaning.
2 versions - Latest release: over 7 years ago - 1 dependent repositories - 19 downloads last month - 5 stars on GitHub - 1 maintainer
datagovernancenew 0.0.2
A very basic cleaning data
1 version - Latest release: about 3 years ago - 1 dependent repositories - 16 downloads last month - 1 maintainer
exchangecleaning 1.0.0
clean exchange data
1 version - Latest release: almost 2 years ago - 6 downloads last month - 1 maintainer
folderclean 1.2
Clean your folders in a single line of code! FolderCleans lets you for example clean your folders...
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 13 downloads last month - 1 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
data-cleaning-pb 1.0 removed
Library for data cleaning operations
1 version - Latest release: over 1 year ago - 45 downloads last month
autoclean 1.0.2
A library for cleaning text data
1 version - Latest release: over 2 years ago - 1 dependent repositories - 214 downloads last month - 1 stars on GitHub - 1 maintainer
dcleaning 0.0.1 removed
This library to facilitate the process of cleaning data where you can delete duplicates, fill nul...
1 version - Latest release: over 1 year ago
dpyp 1.0.0
A pandas convenience wrapper for small-scale data pipelines
1 version - Latest release: 29 days ago - 202 downloads last month - 2 stars on GitHub - 1 maintainer
pdcheckers 0.1
Investigate consistency and dirtiness of a pandas DataFrame
1 version - Latest release: over 1 year ago - 10 downloads last month - 0 stars on GitHub - 1 maintainer
py-autocleanre 1.1.4
AutoClean - Python Package for Automated Preprocessing & Cleaning of Datasets
1 version - Latest release: about 1 year ago - 22 downloads last month - 219 stars on GitHub - 1 maintainer
clean-docstrings 0.1
Utility functions to clean docstrings in various programming languages
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 16 downloads last month - 0 stars on GitHub - 1 maintainer
tubelearn 1.0.0 removed
Python script for extracting and cleaning YouTube video transcripts for Pre-Processing in machine...
1 version - Latest release: 8 months ago - 1 maintainer
prefea 0.1
Library with common utility functions for pre processing the data
1 version - Latest release: about 4 years ago - 1 dependent repositories - 9 downloads last month - 1 stars on GitHub - 1 maintainer
substitutionstring 0.2.0
Manipulate substitution of string, as for instance deletion and insertion, without loss of inform...
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 14 downloads last month - 0 stars on framagit.org - 1 maintainer
datagovernance 0.1
A very basic cleaning data
1 version - Latest release: about 3 years ago - 1 dependent repositories - 8 downloads last month - 1 maintainer
listwise 1.0.4
ListWise.com email validation wrapper.
1 version - Latest release: over 7 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 1 maintainer
omni-ai 0.0.1
Python library for Automated Data Cleaning, Automated Preprocessing, Automated Visualization and ...
1 version - Latest release: about 3 years ago - 1 dependent repositories - 7 downloads last month - 1 maintainer
openclean-geo 0.1.0
Geo-Spatial extension for the open data cleaning library
1 version - Latest release: about 3 years ago - 1 dependent repositories - 25 downloads last month - 2 stars on GitHub - 1 maintainer
pyrmt 0.1.0
Python for Random Matrix Theory: cleaning schemes for noisy correlation matrices
1 version - Latest release: almost 7 years ago - 1 dependent package - 1 dependent repositories - 24 downloads last month - 39 stars on GitHub - 1 maintainer
predf 0.1
Library with common utility functions for pre processing the data
1 version - Latest release: about 4 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 1 maintainer
augmenting 0.0.0
An image dataset augmentation package.
1 version - Latest release: about 1 year ago - 1 dependent repositories - 27 downloads last month - 1 maintainer
shawarma 0.0.1
A simple package to perform basic data cleaning in EDA
1 version - Latest release: about 3 years ago - 1 dependent repositories - 15 downloads last month - 1 maintainer
vivek2dropoffnan 0.0.1
A simple package to perform basic data cleaning in EDA
1 version - Latest release: about 3 years ago - 1 dependent repositories - 9 downloads last month - 1 maintainer
exchangebankcleaning 1.0.0 removed
clean exchangebank data
1 version - Latest release: almost 2 years ago - 1 maintainer