Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "cleaning" keyword

folderclean 1.2
Clean your folders in a single line of code! FolderCleans lets you for example clean your folders...
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 13 downloads last month - 1 stars on GitHub - 1 maintainer
openclean-geo 0.1.0
Geo-Spatial extension for the open data cleaning library
1 version - Latest release: about 3 years ago - 1 dependent repositories - 25 downloads last month - 2 stars on GitHub - 2 maintainers
neatbook 0.20
One line of code that makes a notebook that writes code that writes code.
19 versions - Latest release: about 6 years ago - 1 dependent repositories - 62 downloads last month - 1 stars on GitHub - 2 maintainers
pcleaner 2.6.2 💰
An AI-powered tool to clean manga panels.
36 versions - Latest release: 2 days ago - 361 downloads last month - 146 stars on GitHub - 1 maintainer
tnkeeh 0.0.9
Arabic cleaning, normalization and segmentation library.
9 versions - Latest release: about 3 years ago - 2 dependent repositories - 282 downloads last month - 45 stars on GitHub - 2 maintainers
datagovernance 0.1
A very basic cleaning data
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 8 downloads last month - 2 maintainers
cleanmydata 1.0.6
Library for data cleaning operations
6 versions - Latest release: over 1 year ago - 36 downloads last month - 0 stars on GitHub - 2 maintainers
dpyp 1.0.0
A pandas convenience wrapper for small-scale data pipelines
1 version - Latest release: 11 days ago - 192 downloads last month - 2 stars on GitHub - 2 maintainers
cocorepr 0.1.0
COCO Dataset cleaning tool
29 versions - Latest release: almost 3 years ago - 62 downloads last month - 2 stars on GitHub - 2 maintainers
neatdata 0.18
Cleaning code
17 versions - Latest release: about 6 years ago - 1 dependent repositories - 50 downloads last month - 0 stars on GitHub - 2 maintainers
vivek2dropoffnan 0.0.1
A simple package to perform basic data cleaning in EDA
1 version - Latest release: about 3 years ago - 1 dependent repositories - 9 downloads last month - 2 maintainers
arrangepy 1.1.2 💰
Arrange your files in distinct folder, help you clean your PC
3 versions - Latest release: about 3 years ago - 1 dependent repositories - 41 downloads last month - 24 stars on GitHub - 1 maintainer
datagovernancenew 0.0.2
A very basic cleaning data
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 16 downloads last month - 2 maintainers
claydates 1.0.6
Package used for cleaning, restructuring, logging, and plotting of financial data retrieved from ...
7 versions - Latest release: over 1 year ago - 67 downloads last month - 0 stars on GitHub - 2 maintainers
Top 2.8% on pypi.org
dataprep 0.4.5
Dataprep: Data Preparation in Python
33 versions - Latest release: over 1 year ago - 1 dependent package - 55 dependent repositories - 43.6 thousand downloads last month - 1,902 stars on GitHub - 8 maintainers
predf 0.1
Library with common utility functions for pre processing the data
1 version - Latest release: about 4 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 1 maintainer
openclean-metanome 0.2.0
openclean Metanome Python Package
2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 25 downloads last month - 2 stars on GitHub - 2 maintainers
Top 9.5% on pypi.org
py-autoclean 1.1.3
AutoClean - Python Package for Automated Preprocessing & Cleaning of Datasets
21 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 496 downloads last month - 213 stars on GitHub - 1 maintainer
py-autocleanre 1.1.4
AutoClean - Python Package for Automated Preprocessing & Cleaning of Datasets
1 version - Latest release: about 1 year ago - 21 downloads last month - 213 stars on GitHub - 2 maintainers
substitutionstring 0.2.0
Manipulate substitution of string, as for instance deletion and insertion, without loss of inform...
1 version - Latest release: over 2 years ago - 1 dependent repositories - 14 downloads last month - 0 stars on framagit.org - 2 maintainers
vacuum-cleaner 0.1.3 💰
Deep Vacuum Cleaner
5 versions - Latest release: almost 6 years ago - 1 dependent repositories - 63 downloads last month - 2 stars on GitHub - 1 maintainer
exchangecleaning 1.0.0
clean exchange data
1 version - Latest release: almost 2 years ago - 6 downloads last month - 2 maintainers
company-name-matching 0.4.3
Returns a score of 2 companies to be the same
40 versions - Latest release: over 2 years ago - 391 downloads last month - 2 maintainers
clean-df 0.3.0
Python module to report, clean, and optimize Pandas Dataframes effectively
7 versions - Latest release: 8 months ago - 70 downloads last month - 3 stars on GitHub - 2 maintainers
company-name-matching2 0.0.6
Returns a score of 2 companies to be the same
4 versions - Latest release: over 2 years ago - 49 downloads last month - 2 maintainers
githubdata 16.1.2
A simple Python package to easily download from and manage a GitHub "Data repository"
53 versions - Latest release: 7 months ago - 3 dependent packages - 8 dependent repositories - 460 downloads last month - 0 stars on GitHub - 2 maintainers
dwm 1.1.0
Best practices for marketing data quality management
11 versions - Latest release: almost 6 years ago - 2 dependent repositories - 37 downloads last month - 12 stars on GitHub - 4 maintainers
prep-buddy 0.5.11
A library for cleaning, transforming and executing all other preparation tasks for large datasets...
3 versions - Latest release: over 7 years ago - 1 dependent repositories - 8 downloads last month - 8 stars on GitHub - 1 maintainer
mordineznlp 0.1.0
Powerfull python tool for modern NLP processing
34 versions - Latest release: about 2 years ago - 1 dependent repositories - 118 downloads last month - 2 stars on GitHub - 2 maintainers
chunkyp 0.0.2
Ray-based preprocesisng pipeline.
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 22 downloads last month - 0 stars on GitHub - 1 maintainer
clean-plot 0.0.13
clean_plot simplifies cleaning text files for creation of embeddings and making plots from it
12 versions - Latest release: over 1 year ago - 1 dependent repositories - 64 downloads last month - 3 stars on GitHub - 1 maintainer
listwise 1.0.4
ListWise.com email validation wrapper.
1 version - Latest release: over 7 years ago - 1 dependent repositories - 11 downloads last month - 1 stars on GitHub - 2 maintainers
shawarma 0.0.1
A simple package to perform basic data cleaning in EDA
1 version - Latest release: about 3 years ago - 1 dependent repositories - 15 downloads last month - 2 maintainers
tddc 0.1.1
Scaffold out methods and tests for collaborative data cleaning.
2 versions - Latest release: over 7 years ago - 1 dependent repositories - 5 downloads last month - 5 stars on GitHub - 2 maintainers
Top 8.7% on pypi.org
openclean-core 0.4.1
Library for data cleaning and data profiling
8 versions - Latest release: almost 3 years ago - 5 dependent repositories - 544 downloads last month - 61 stars on GitHub - 2 maintainers
mirutil 25.1.0
Tools for getting and cleaning TSE data
185 versions - Latest release: 7 months ago - 3 dependent packages - 6 dependent repositories - 149 downloads last month - 1 stars on GitHub - 1 maintainer
dataglitch 0.0.2
DataGlitch is a Python package designed to address common data challenges, including handling mix...
2 versions - Latest release: 12 months ago - 12 downloads last month - 0 stars on GitHub - 2 maintainers
clean-docstrings 0.1
Utility functions to clean docstrings in various programming languages
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 8 downloads last month - 0 stars on GitHub - 2 maintainers
banglanltk 0.0.4
Bangla Natural Language Processing Toolkit
2 versions - Latest release: over 3 years ago - 282 downloads last month - 2 maintainers
autoclean 1.0.2
A library for cleaning text data
1 version - Latest release: about 2 years ago - 1 dependent repositories - 165 downloads last month - 1 stars on GitHub - 1 maintainer
tubelearns 2.1.0
Python script for extracting, cleaning, and tokenizing YouTube video transcripts for Pre-Processi...
8 versions - Latest release: about 2 months ago - 55 downloads last month - 4 stars on GitHub - 4 maintainers
Top 5.2% on pypi.org
datacleaner 0.1.5
A Python tool that automatically cleans data sets and readies them for analysis.
6 versions - Latest release: over 7 years ago - 1 dependent package - 19 dependent repositories - 313 downloads last month - 1,039 stars on GitHub - 2 maintainers
takin 1.1.4
A Python Toolkit for File Processing, Text Cleaning and Data Splitting
4 versions - Latest release: over 1 year ago - 1 dependent repositories - 14 downloads last month - 23 stars on GitHub - 2 maintainers
clean-html-for-llm 1.0.0
A library for cleaning HTML content by removing specified tags and attributes.
1 version - Latest release: 23 days ago - 1 maintainer
neverbounce-sdk 4.3.0
Official Python SDK for the NeverBounce API
18 versions - Latest release: over 3 years ago - 1 dependent repositories - 45.4 thousand downloads last month - 14 stars on GitHub - 4 maintainers
datadoctor 1.0.15
A Python package for data cleaning and preprocessing.
14 versions - Latest release: 11 months ago - 42 downloads last month - 2 stars on GitHub - 2 maintainers
dripper 0.3.1
Cleaning your messy data.
7 versions - Latest release: about 9 years ago - 3 dependent repositories - 276 downloads last month - 8 stars on GitHub - 2 maintainers
visuallayer 0.0.15
Open, Clean Datasets for Computer Vision.
5 versions - Latest release: 11 months ago - 21 downloads last month - 64 stars on GitHub - 2 maintainers
kmor 1.0.7
K-means clustering with outlier removal numpy implementation
8 versions - Latest release: almost 4 years ago - 1 dependent repositories - 23 downloads last month - 6 stars on GitHub - 2 maintainers
activedetect 0.1.3
A Library For Error Detection For Predictive Analytics
6 versions - Latest release: over 7 years ago - 1 dependent repositories - 14 downloads last month - 10 stars on GitHub - 1 maintainer
pdcheckers 0.1
Investigate consistency and dirtiness of a pandas DataFrame
1 version - Latest release: over 1 year ago - 10 downloads last month - 0 stars on GitHub - 1 maintainer
etl-toolbox 0.0.3
Useful ETL functions for Python
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 28 downloads last month - 2 stars on GitHub - 2 maintainers
augmenting 0.0.0
An image dataset augmentation package.
1 version - Latest release: about 1 year ago - 1 dependent repositories - 8 downloads last month - 2 maintainers
cleanliness 0.1.1
Basic cleaning of text
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 30 downloads last month - 0 stars on GitHub - 2 maintainers
prefea 0.1
Library with common utility functions for pre processing the data
1 version - Latest release: about 4 years ago - 1 dependent repositories - 6 downloads last month - 1 stars on GitHub - 2 maintainers
vl-datasets 0.0.11
Open, Clean Datasets for Computer Vision.
11 versions - Latest release: 12 months ago - 56 downloads last month - 63 stars on GitHub - 2 maintainers
preio 0.2
Library with common utility functions for pre processing the data
2 versions - Latest release: about 4 years ago - 1 dependent repositories - 6 downloads last month - 1 stars on GitHub - 1 maintainer
pyrmt 0.1.0
Python for Random Matrix Theory: cleaning schemes for noisy correlation matrices
1 version - Latest release: almost 7 years ago - 1 dependent package - 1 dependent repositories - 23 downloads last month - 38 stars on GitHub - 2 maintainers
omni-ai 0.0.1
Python library for Automated Data Cleaning, Automated Preprocessing, Automated Visualization and ...
1 version - Latest release: about 3 years ago - 1 dependent repositories - 7 downloads last month - 2 maintainers
Top 9.0% on pypi.org
data-cleaning-pb 1.0 removed
Library for data cleaning operations
1 version - Latest release: over 1 year ago - 45 downloads last month
tubelearn 1.0.0 removed
Python script for extracting and cleaning YouTube video transcripts for Pre-Processing in machine...
1 version - Latest release: 7 months ago - 1 maintainer
tidytweets 0.15 removed
Clean tweets to perform various NLP tasks such as topic analysis, word embeddings, sentiment anal...
2 versions - Latest release: 10 months ago - 115 downloads last month - 0 stars on GitHub - 4 maintainers
dcleaning 0.0.1 removed
This library to facilitate the process of cleaning data where you can delete duplicates, fill nul...
1 version - Latest release: about 1 year ago
mir-tse 20220809.2 removed
Tools for getting and cleaning TSE data
7 versions - Latest release: over 1 year ago
exchangebankcleaning 1.0.0 removed
clean exchangebank data
1 version - Latest release: almost 2 years ago - 1 maintainer