Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "dataquality" keyword
Top 8.8% on pypi.org
3 versions - Latest release: 4 months ago - 1 dependent repositories - 1.2 thousand downloads last month - 888 stars on GitHub - 1 maintainer
zingg 0.4.0
Zingg Entity Resolution, Data Mastering and Deduplication3 versions - Latest release: 4 months ago - 1 dependent repositories - 1.2 thousand downloads last month - 888 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
12 versions - Latest release: 14 days ago - 6 dependent packages - 53 dependent repositories - 11.8 million downloads last month - 649 stars on GitHub - 4 maintainers
pydeequ 1.3.0
PyDeequ - Unit Tests for Data12 versions - Latest release: 14 days ago - 6 dependent packages - 53 dependent repositories - 11.8 million downloads last month - 649 stars on GitHub - 4 maintainers
pydeequalb 0.0.4
PyDeequ - Unit Tests for Data3 versions - Latest release: over 1 year ago - 24 downloads last month - 649 stars on GitHub - 2 maintainers
Top 4.7% on pypi.org
1 version - Latest release: over 1 year ago - 421 stars on GitHub
pydeequ-alb 0.0.1 removed
PyDeequ - Unit Tests for Data1 version - Latest release: over 1 year ago - 421 stars on GitHub
Top 2.1% on pypi.org
29 versions - Latest release: 3 days ago - 8 dependent packages - 19 dependent repositories - 21.7 thousand downloads last month - 8,710 stars on GitHub - 5 maintainers
cleanlab 2.6.4
The standard package for data-centric AI, machine learning with label errors, and automatically f...29 versions - Latest release: 3 days ago - 8 dependent packages - 19 dependent repositories - 21.7 thousand downloads last month - 8,710 stars on GitHub - 5 maintainers
example-package-elisno 2.6.24
The standard package for data-centric AI, machine learning with label errors, and automatically f...7 versions - Latest release: 2 months ago - 50 downloads last month - 8,694 stars on GitHub - 1 maintainer
cuallee 0.10.1
Python library for data validation on DataFrame APIs including Snowflake/Snowpark, Apache/PySpark...74 versions - Latest release: 10 days ago - 1 dependent package - 1 dependent repositories - 11.2 thousand downloads last month - 109 stars on GitHub - 2 maintainers
Top 9.4% on pypi.org
122 versions - Latest release: 3 days ago - 4.9 thousand downloads last month - 4,168 stars on GitHub - 1 maintainer
openmetadata-managed-apis 1.3.4.0
Airflow REST APIs to create and manage DAGS122 versions - Latest release: 3 days ago - 4.9 thousand downloads last month - 4,168 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
273 versions - Latest release: almost 2 years ago - 2 dependent packages - 2 dependent repositories - 31.7 thousand downloads last month - 4,168 stars on GitHub - 1 maintainer
openmetadata-ingestion 0.10.1
Ingestion Framework for OpenMetadata273 versions - Latest release: almost 2 years ago - 2 dependent packages - 2 dependent repositories - 31.7 thousand downloads last month - 4,168 stars on GitHub - 1 maintainer
seedspark 0.4.3
SeedSpark is an Extensible PySpark utility package to create production spark pipelines and dev-t...14 versions - Latest release: 9 months ago - 1 dependent repositories - 101 downloads last month - 2 stars on GitHub - 2 maintainers
datasae 0.5.1
Data Quality Framework provides by Jabar Digital Service60 versions - Latest release: 2 months ago - 338 downloads last month - 3 stars on GitHub - 2 maintainers
chaosgenius 0.0.1
Chaos Genius: The Open-Source Business Observability Platform1 version - Latest release: over 2 years ago - 1 dependent repositories - 22 downloads last month - 702 stars on GitHub - 1 maintainer
analytics-command-center 3.0.14
Command Center for Data Ingestion, Advanced Analytics and Artificial Intelligence process1 version - Latest release: over 2 years ago - 26 downloads last month - 11 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
264 versions - Latest release: 11 days ago - 42 dependent packages - 284 dependent repositories - 19.3 million downloads last month - 9,420 stars on GitHub - 8 maintainers
great-expectations 0.18.13
Always know what to expect from your data.264 versions - Latest release: 11 days ago - 42 dependent packages - 284 dependent repositories - 19.3 million downloads last month - 9,420 stars on GitHub - 8 maintainers
dq-client 0.5.0
Python library which allows to use http://dataquality.pl in easy way.5 versions - Latest release: almost 6 years ago - 1 dependent repositories - 71 downloads last month - 0 stars on GitHub - 2 maintainers
datachecks 0.2.5
Open Source Data Quality Monitoring12 versions - Latest release: 4 months ago - 118 downloads last month - 128 stars on GitHub - 1 maintainer
altimate-datapilot 0.0.8
Assistant for Data Teams8 versions - Latest release: about 1 month ago - 215 downloads last month - 18 stars on GitHub - 2 maintainers
Top 6.3% on pypi.org
501 versions - Latest release: 8 days ago - 1 dependent repositories - 247 thousand downloads last month - 9,129 stars on GitHub - 7 maintainers
great-expectations-experimental 0.1.20240502061
Always know what to expect from your data.501 versions - Latest release: 8 days ago - 1 dependent repositories - 247 thousand downloads last month - 9,129 stars on GitHub - 7 maintainers
openmetadata-sqlalchemy-bigquery 1.2.0
SQLAlchemy dialect for BigQuery by OpenMetadata4 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 40 downloads last month - 4,168 stars on GitHub - 1 maintainer
idg-metadata-client 1.0.2.0
Ingestion Framework for OpenMetadata1 version - Latest release: 10 months ago - 33 downloads last month - 3,365 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
31 versions - Latest release: almost 2 years ago - 1 dependent repositories - 730 downloads last month - 3,365 stars on GitHub - 1 maintainer
openmetadata-airflow-managed-apis 0.10.1
Airflow REST APIs to create and manage DAGS31 versions - Latest release: almost 2 years ago - 1 dependent repositories - 730 downloads last month - 3,365 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
12 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 204 downloads last month - 3,365 stars on GitHub - 1 maintainer
openmetadata-ingestion-core 0.10.0
These are the generated Python classes from JSON Schema12 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 204 downloads last month - 3,365 stars on GitHub - 1 maintainer
cz-data-diff 0.0.4
Command-line tool and Python library to efficiently diff rows across two different databases.3 versions - Latest release: 5 months ago - 40 downloads last month - 2,846 stars on GitHub - 4 maintainers
Top 4.5% on pypi.org
74 versions - Latest release: 3 months ago - 1 dependent package - 2 dependent repositories - 50.8 thousand downloads last month - 2,846 stars on GitHub - 10 maintainers
data-diff 0.11.1
Command-line tool and Python library to efficiently diff rows across two different databases.74 versions - Latest release: 3 months ago - 1 dependent package - 2 dependent repositories - 50.8 thousand downloads last month - 2,846 stars on GitHub - 10 maintainers
data-diff-customize 1.0.3
Command-line tool and Python library to efficiently diff rows across two different databases.5 versions - Latest release: 6 months ago - 66 downloads last month - 2,846 stars on GitHub - 2 maintainers
dataqualitytransformation 0.0.1
Cleaning and standardizing feature values of the products1 version - Latest release: almost 2 years ago - 1 dependent repositories - 18 downloads last month - 2 maintainers
Top 5.8% on pypi.org
49 versions - Latest release: 5 months ago - 2 dependent repositories - 63.4 thousand downloads last month - 1,495 stars on GitHub - 1 maintainer
re-data 0.11.0
re_data - data quality framework49 versions - Latest release: 5 months ago - 2 dependent repositories - 63.4 thousand downloads last month - 1,495 stars on GitHub - 1 maintainer
Top 4.1% on pypi.org
84 versions - Latest release: 3 months ago - 2 dependent packages - 9 dependent repositories - 5.27 thousand downloads last month - 320 stars on GitHub - 3 maintainers
lale 0.8.0
Library for Semi-Automated Data Science84 versions - Latest release: 3 months ago - 2 dependent packages - 9 dependent repositories - 5.27 thousand downloads last month - 320 stars on GitHub - 3 maintainers
qafs 0.1.1
Quality Aware Feature Store.3 versions - Latest release: over 2 years ago - 1 dependent repositories - 27 downloads last month - 8 stars on GitHub - 2 maintainers
altimate-datapilot-cli 0.0.10
Assistant for Data Teams3 versions - Latest release: 19 days ago - 398 downloads last month - 18 stars on GitHub - 2 maintainers
risk-command-center 1.0.37
Risk Command Center, manage your risk easly.2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 10 downloads last month - 11 stars on GitHub - 1 maintainer
dqlauncher 1.0
DataQuality functions over pyspark.sql SparkSession and DataFrame1 version - Latest release: 6 months ago - 6 downloads last month - 1 stars on GitHub - 2 maintainers
altimate-cli 0.0.8
Assistant for Data Teams1 version - Latest release: about 1 month ago - 168 downloads last month - 18 stars on GitHub - 2 maintainers
datapilot-cli 0.0.8
Assistant for Data Teams1 version - Latest release: about 1 month ago - 159 downloads last month - 18 stars on GitHub - 2 maintainers
dqvalidator 0.1
Quality functions over PySpark DataFrames1 version - Latest release: 7 months ago - 5 downloads last month - 1 stars on GitHub - 2 maintainers
data_check 0.19.0
simple data validation22 versions - Latest release: about 2 months ago - 322 downloads last month - 4 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
19 versions - Latest release: 5 months ago - 1 dependent package - 13 dependent repositories - 33.5 thousand downloads last month - 116 stars on GitHub - 2 maintainers
pandas-dq 1.29
Clean your data using a scikit-learn transformer in a single line of code19 versions - Latest release: 5 months ago - 1 dependent package - 13 dependent repositories - 33.5 thousand downloads last month - 116 stars on GitHub - 2 maintainers
great-expectations-cta 0.15.43
Always know what to expect from your data.2 versions - Latest release: over 1 year ago - 1 dependent package - 35 downloads last month - 9,124 stars on GitHub - 1 maintainer
dataculpa-client 1.4.1
Python client for Data Culpa APIs23 versions - Latest release: about 2 years ago - 1 dependent repositories - 62 downloads last month - 9 stars on GitHub - 2 maintainers
dataqualitytransforming 0.0.2 removed
Cleaning and standardizing feature values of the products2 versions - Latest release: almost 2 years ago
dataqualityprocessing 0.0.1 removed
Cleaning and standardizing feature values of the products1 version - Latest release: almost 2 years ago
dataqualitypipeline 0.0.2 removed
Cleaning and standardizing feature values of the products2 versions - Latest release: almost 2 years ago
dataqualitytransformer 0.0.2 removed
Cleaning and standardizing feature values of the products2 versions - Latest release: almost 2 years ago
Related Keywords
data-quality
21
data-science
17
data-profiling
15
dbt
14
data-engineering
14
python
13
data-validation
10
dataengineering
10
data-governance
9
snowflake
9
data-quality-checks
8
data-observability
8
data-unit-tests
8
data
8
data-profilers
7
dataunittest
7
metadata-management
6
metadata
6
datadiscovery
6
datacatalog
6
data-lineage
6
data-discovery
6
data-contracts
6
data-catalog
6
data-quality-monitoring
5
database
5
validation
5
sql
5
dataops
4
bigquery
4
dbt-core
4
postgresql
4
postgres
4
quality
4
pipeline
4
testing
4
mlops
4
pipeline-testing
4
pydeequ
4
pipeline-tests
4
mysql
4
exploratory-data-analysis
3
exploratorydataanalysis
3
data-collaboration
3
pipeline-debt
3
exploratory-analysis
3
eda
3
machine-learning
3
datacleaning
3
datacleaner
3
science
3
cleandata
3
trino
3
data-engineer
3
rdbms
3
oracle-database
3
databricks-sql
3
datavalidation
3
dataqualitycheck
3
deequ
3
bigdata
3
spark
3
outlier-detection
3
data-analysis
3
classification
2
confident_learning
2
data_cleaning
2
machine_learning
2
analytics
2
ml
2
scikit-learn
2
trabaja-sobre-spark
2
monitoring
2
spark-sql
2
parquet
2
huemul-bigdatagovernance
2
huemul
2
etl
2
chile
2
cloudera
2
hortonworks
2
data-warehouse
2
datamart
2
gdpr
2
hadoop
2
hive
2
pandas
2
data-testing
2
validation-library
2
weak-supervision
2
out-of-distribution-detection
2
noisy-labels
2
llms
2
labeling
2
datasets
2
data-labeling
2
data-curation
2
data-cleaning
2
data-centric-ai
2
annotation
2