Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "datacleaning" keyword

medaprep 0.1.1
medaprep is a data preparation and feature engineering toolkit for geospatial applications.
2 versions - Latest release: almost 2 years ago - 28 downloads last month - 1 stars on GitHub - 1 maintainer
toolstack 0.1.5
A collection of useful tools to speed-up the data processing, cleaning and pipelining.
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 34 downloads last month - 0 stars on GitHub - 1 maintainer
spark-lean 0.3.3
An interactive PySpark-based Data Cleaning Library
4 versions - Latest release: about 6 years ago - 1 dependent repositories - 20 downloads last month - 7 stars on GitHub - 2 maintainers
romcrap 0.1
A library that will transform the life of a Data Scientist
1 version - Latest release: almost 8 years ago - 2 dependent repositories - 7 downloads last month - 0 stars on GitHub - 1 maintainer
manoelgadifa 0.2
A library that will transform the life of a Data Scientist
2 versions - Latest release: almost 8 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 1 maintainer
itallic 0.0.8
Detects potential corrupt entries in a dataframe with lat,lng and country tagged data.
8 versions - Latest release: about 3 years ago - 1 dependent repositories - 37 downloads last month - 0 stars on GitHub - 1 maintainer
Top 8.6% on pypi.org
hypergbm 0.3.2
A full pipeline AutoML tool integrated various GBM models
19 versions - Latest release: 3 months ago - 1 dependent repositories - 2.43 thousand downloads last month - 319 stars on GitHub - 1 maintainer
gotext 0.9.5
GoText is a universal text extraction and preprocessing tool for python which supportss wide vari...
2 versions - Latest release: over 2 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
dirtyclean 0.1
get rid of unicode punctuation and other garbage from strings
1 version - Latest release: almost 7 years ago - 1 dependent repositories - 10 downloads last month - 3 stars on GitHub - 1 maintainer
Top 2.8% on pypi.org
dataprep 0.4.5
Dataprep: Data Preparation in Python
33 versions - Latest release: almost 2 years ago - 1 dependent package - 55 dependent repositories - 43.6 thousand downloads last month - 1,902 stars on GitHub - 4 maintainers
covid-alberta 0.0.4
This is a small package to look at some of the alberta specific covid data.
3 versions - Latest release: about 4 years ago - 1 dependent repositories - 27 downloads last month - 1 stars on GitHub - 1 maintainer
cooka 0.1.5
A lightweight AutoML system.
4 versions - Latest release: over 2 years ago - 1 dependent repositories - 84 downloads last month - 319 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
great-expectations-experimental 0.1.20240502061
Always know what to expect from your data.
501 versions - Latest release: 11 days ago - 1 dependent repositories - 247 thousand downloads last month - 9,129 stars on GitHub - 4 maintainers
Top 5.3% on pypi.org
cleantext 1.1.4
An open-source python package to clean raw text data
8 versions - Latest release: over 2 years ago - 3 dependent packages - 21 dependent repositories - 27.4 thousand downloads last month - 65 stars on GitHub - 1 maintainer
ricco 1.4.1
A handy ETL&GEOM kit
54 versions - Latest release: 2 months ago - 1 dependent repositories - 371 downloads last month - 1 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
great-expectations 0.18.13
Always know what to expect from your data.
264 versions - Latest release: 14 days ago - 42 dependent packages - 284 dependent repositories - 19.3 million downloads last month - 9,420 stars on GitHub - 8 maintainers
limpieza 0.1
A library that will clean a dataframe
1 version - Latest release: almost 8 years ago - 2 dependent repositories - 8 downloads last month - 0 stars on GitHub - 1 maintainer
great-expectations-cta 0.15.43
Always know what to expect from your data.
2 versions - Latest release: over 1 year ago - 1 dependent package - 35 downloads last month - 9,124 stars on GitHub - 1 maintainer