Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

conda-forge.org "data-quality" keyword

cleanlab 2.1.0
cleanlab is the data-centric ML ops package for machine learning with noisy labels. cleanlab clea...
5 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 5,564 stars on GitHub
data-diff 0.2.8
Efficiently diff data in or across relational databases
6 versions - Latest release: over 1 year ago - 1,766 stars on GitHub
tangled-up-in-unicode 0.2.0
This module provides access to character properties for all Unicode characters, from the Unicode ...
5 versions - Latest release: over 2 years ago - 4 dependent packages - 13 dependent repositories - 3 stars on GitHub
feast 0.26.0
Feature Store for Machine Learning
1 version - Latest release: over 1 year ago - 4,088 stars on GitHub
great-expectations 0.15.32
Great Expectations helps teams save time and promote analytic integrity by offering a unique appr...
144 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 8,121 stars on GitHub
traceml 1.0.0
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for P...
1 version - Latest release: almost 2 years ago - 463 stars on GitHub
Top 5.0% on conda-forge.org
pandas-profiling 3.4.0
Create HTML profiling reports from pandas DataFrame objects
23 versions - Latest release: over 1 year ago - 6 dependent packages - 69 dependent repositories - 10,373 stars on GitHub
r-pointblank 0.11.2
Data quality assessment and metadata reporting for data frames and database tables
9 versions - Latest release: over 1 year ago - 712 stars on GitHub
Related Keywords
data-science 6 data-engineering 3 python 3 mlops 3 machine-learning 3 exploratory-data-analysis 3 data-profiling 3 data-exploration 2 data-validation 2 exploration 2 data-analysis 2 dataquality 2 eda 2 pandas 2 statistics 2 plotly 1 pytorch 1 pandas-summary 1 spark 1 matplotlib 1 explainable-ai 1 dataops 1 dataframes 1 data-visualization 1 data-quality-checks 1 dask 1 pipeline-tests 1 pipeline-testing 1 pipeline-debt 1 pipeline 1 yaml-configuration 1 testing-tools 1 schema-validation 1 reporting-tool 1 easy-to-understand 1 database-tables 1 data-verification 1 data-profiler 1 data-management 1 data-inference 1 data-frames 1 data-dictionaries 1 data-checker 1 data-assertions 1 pandas-profiling 1 pandas-dataframe 1 jupyter-notebook 1 jupyter 1 html-report 1 hacktoberfest 1 deep-learning 1 big-data-analytics 1 tracking 1 tensorflow 1 oracle-database 1 mysql 1 dataengineering 1 databricks-sql 1 database 1 data-quality-monitoring 1 weak-supervision 1 robust-machine-learning 1 outlier-detection 1 out-of-distribution-detection 1 noisy-labels 1 label-errors 1 image-tagging 1 entity-recognition 1 data-labeling 1 data-cleaning 1 data-centric-ai 1 crowdsourcing 1 classification 1 annotations 1 active-learning 1 exploratorydataanalysis 1 exploratory-analysis 1 dataunittest 1 datacleaning 1 datacleaner 1 data-unit-tests 1 data-profilers 1 cleandata 1 ml 1 features 1 feature-store 1 big-data 1 unicode 1 linguistics 1 linguistic-analysis 1 trino 1 sql 1 snowflake 1 rdbms 1 postgresql 1 postgres 1