Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
conda-forge.org "datacleaning" keyword
openrefine 3.5.0 💰
OpenRefine is a free, open source power tool for working with messy data and improving it3 versions - Latest release: over 2 years ago - 9,377 stars on GitHub
dataprep 0.4.5
Open-source low code data preparation library in python. Collect, clean and visualization your da...8 versions - Latest release: almost 2 years ago - 4 dependent repositories - 1,571 stars on GitHub
hypergbm 0.2.5
HyperGBM is a full pipeline automated machine learning (AutoML) toolkit designed for tabular data...8 versions - Latest release: about 2 years ago - 257 stars on GitHub
great-expectations 0.15.32
Great Expectations helps teams save time and promote analytic integrity by offering a unique appr...144 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 8,121 stars on GitHub
cleantext 1.1.4
An open-source package for python to clean raw text data1 version - Latest release: almost 2 years ago - 46 stars on GitHub
medaprep 0.1.1
medaprep is a data preparation and feature engineering toolkit for geospatial applications.1 version - Latest release: over 1 year ago - 1 stars on GitHub
Related Keywords
data-science
4
eda
3
exploratory-data-analysis
3
data-unit-tests
1
data-quality
1
data-profiling
1
data-profilers
1
data-engineering
1
cleandata
1
xgboost
1
tabular-data
1
sklearn
1
semi-supervised-learning
1
rapidsai
1
pseudo-labeling
1
preprocessing
1
data-analysis
1
datacleaner
1
dataquality
1
dataunittest
1
exploratory-analysis
1
exploratorydataanalysis
1
mlops
1
pipeline
1
pipeline-debt
1
pipeline-testing
1
pipeline-tests
1
cleaning-data
1
cleantext
1
nlp
1
python
1
data
1
xarray
1
data-wrangling
1
datacleansing
1
datajournalism
1
datamining
1
java
1
journalism
1
opendata
1
reconciliation
1
wikidata
1
apis
1
apiwrapper
1
cleaning
1
connector
1
data-exploration
1
dataconnector
1
dataprep
1
datapreparation
1
webconnector
1
adversarial-validation
1
automl
1
catboost
1
dask
1
dask-distributed
1
distributed-training
1
ensemble-learning
1
fullpipeline
1
gbm
1
gpu-acceleration
1
lightgbm
1