Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "data-profiling" keyword
Top 0.7% on pypi.org
263 versions - Latest release: about 2 hours ago - 42 dependent packages - 284 dependent repositories - 18.6 million downloads last month - 9,420 stars on GitHub - 8 maintainers
great-expectations 0.18.13
Always know what to expect from your data.263 versions - Latest release: about 2 hours ago - 42 dependent packages - 284 dependent repositories - 18.6 million downloads last month - 9,420 stars on GitHub - 8 maintainers
seedspark 0.4.3
SeedSpark is an Extensible PySpark utility package to create production spark pipelines and dev-t...14 versions - Latest release: 9 months ago - 1 dependent repositories - 99 downloads last month - 2 stars on GitHub - 2 maintainers
piperider-nightly 0.42.0.20240429
PiperRider CLI528 versions - Latest release: about 3 hours ago - 4.22 thousand downloads last month - 450 stars on GitHub - 1 maintainer
cleanlab-studio 2.0.2
Client interface for all things Cleanlab Studio77 versions - Latest release: 10 days ago - 1 dependent repositories - 3.09 thousand downloads last month - 20 stars on GitHub - 5 maintainers
Top 6.3% on pypi.org
498 versions - Latest release: about 6 hours ago - 1 dependent repositories - 242 thousand downloads last month - 9,129 stars on GitHub - 7 maintainers
great-expectations-experimental 0.1.20240429010
Always know what to expect from your data.498 versions - Latest release: about 6 hours ago - 1 dependent repositories - 242 thousand downloads last month - 9,129 stars on GitHub - 7 maintainers
compars 0.0.0
DataFrame comparison done right (AKA the Bear-agnostic DataFrame comparison library)1 version - Latest release: 9 days ago - 150 downloads last month - 0 stars on GitHub - 2 maintainers
Top 9.4% on pypi.org
32 versions - Latest release: over 1 year ago - 1 dependent repositories - 321 downloads last month - 1,441 stars on GitHub - 2 maintainers
pyoptimus 0.1.0
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion.32 versions - Latest release: over 1 year ago - 1 dependent repositories - 321 downloads last month - 1,441 stars on GitHub - 2 maintainers
odd-collector 0.1.18
ODD Collector1 version - Latest release: about 1 year ago - 13 downloads last month - 40 stars on GitHub - 2 maintainers
openmetadata-sqlalchemy-bigquery 1.2.0
SQLAlchemy dialect for BigQuery by OpenMetadata4 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 37 downloads last month - 4,149 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
120 versions - Latest release: 11 days ago - 3.94 thousand downloads last month - 4,149 stars on GitHub - 1 maintainer
openmetadata-managed-apis 1.3.3.0
Airflow REST APIs to create and manage DAGS120 versions - Latest release: 11 days ago - 3.94 thousand downloads last month - 4,149 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
271 versions - Latest release: almost 2 years ago - 2 dependent packages - 2 dependent repositories - 27.5 thousand downloads last month - 4,149 stars on GitHub - 1 maintainer
openmetadata-ingestion 0.10.1
Ingestion Framework for OpenMetadata271 versions - Latest release: almost 2 years ago - 2 dependent packages - 2 dependent repositories - 27.5 thousand downloads last month - 4,149 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
31 versions - Latest release: almost 2 years ago - 1 dependent repositories - 582 downloads last month - 3,365 stars on GitHub - 1 maintainer
openmetadata-airflow-managed-apis 0.10.1
Airflow REST APIs to create and manage DAGS31 versions - Latest release: almost 2 years ago - 1 dependent repositories - 582 downloads last month - 3,365 stars on GitHub - 1 maintainer
idg-metadata-client 1.0.2.0
Ingestion Framework for OpenMetadata1 version - Latest release: 10 months ago - 29 downloads last month - 3,365 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
12 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 184 downloads last month - 3,365 stars on GitHub - 1 maintainer
openmetadata-ingestion-core 0.10.0
These are the generated Python classes from JSON Schema12 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 184 downloads last month - 3,365 stars on GitHub - 1 maintainer
dqops 1.2.0
DQOps Data Quality Operations Center16 versions - Latest release: 4 days ago - 189 downloads last month - 52 stars on GitHub - 2 maintainers
Top 2.1% on pypi.org
28 versions - Latest release: about 1 month ago - 8 dependent packages - 19 dependent repositories - 16.1 thousand downloads last month - 8,645 stars on GitHub - 5 maintainers
cleanlab 2.6.3
The standard package for data-centric AI, machine learning with label errors, and automatically f...28 versions - Latest release: about 1 month ago - 8 dependent packages - 19 dependent repositories - 16.1 thousand downloads last month - 8,645 stars on GitHub - 5 maintainers
example-package-elisno 2.6.24
The standard package for data-centric AI, machine learning with label errors, and automatically f...7 versions - Latest release: about 2 months ago - 52 downloads last month - 8,645 stars on GitHub - 1 maintainer
Top 2.1% on pypi.org
35 versions - Latest release: 5 months ago - 8 dependent packages - 167 dependent repositories - 64.5 thousand downloads last month - 2,824 stars on GitHub - 1 maintainer
sweetviz 2.3.1
A pandas-based library to visualize and compare datasets.35 versions - Latest release: 5 months ago - 8 dependent packages - 167 dependent repositories - 64.5 thousand downloads last month - 2,824 stars on GitHub - 1 maintainer
easiersdk 0.1.16
This library contains code for interacting with EASIER.AI platform.102 versions - Latest release: almost 3 years ago - 1 dependent repositories - 563 downloads last month - 2,824 stars on GitHub - 1 maintainer
haisweetviz 1.0.2
A pandas-based library to visualize and compare datasets.2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 15 downloads last month - 2,824 stars on GitHub - 1 maintainer
Top 4.6% on pypi.org
36 versions - Latest release: 10 months ago - 1 dependent package - 6 dependent repositories - 11.6 thousand downloads last month - 483 stars on GitHub - 3 maintainers
popmon 1.4.6
Monitor the stability of a pandas or spark dataset36 versions - Latest release: 10 months ago - 1 dependent package - 6 dependent repositories - 11.6 thousand downloads last month - 483 stars on GitHub - 3 maintainers
Top 1.0% on pypi.org
18 versions - Latest release: about 1 month ago - 20 dependent packages - 79 dependent repositories - 1.21 million downloads last month - 11,645 stars on GitHub - 1 maintainer
ydata-profiling 4.7.0
Generate profile report for pandas DataFrame18 versions - Latest release: about 1 month ago - 20 dependent packages - 79 dependent repositories - 1.21 million downloads last month - 11,645 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
12 versions - Latest release: 3 days ago - 6 dependent packages - 53 dependent repositories - 11.6 million downloads last month - 645 stars on GitHub - 4 maintainers
pydeequ 1.3.0
PyDeequ - Unit Tests for Data12 versions - Latest release: 3 days ago - 6 dependent packages - 53 dependent repositories - 11.6 million downloads last month - 645 stars on GitHub - 4 maintainers
Top 3.5% on pypi.org
20 versions - Latest release: over 1 year ago - 1 dependent package - 14 dependent repositories - 111 thousand downloads last month - 490 stars on GitHub - 2 maintainers
datatile 1.0.3
A library for managing, summarizing, and visualizing data.20 versions - Latest release: over 1 year ago - 1 dependent package - 14 dependent repositories - 111 thousand downloads last month - 490 stars on GitHub - 2 maintainers
Top 3.3% on pypi.org
86 versions - Latest release: over 2 years ago - 3 dependent packages - 8 dependent repositories - 121 thousand downloads last month - 492 stars on GitHub - 2 maintainers
traceml 1.14.2
Engine for ML/Data tracking, visualization, dashboards, and model UI for Polyaxon.86 versions - Latest release: over 2 years ago - 3 dependent packages - 8 dependent repositories - 121 thousand downloads last month - 492 stars on GitHub - 2 maintainers
pydeequalb 0.0.4
PyDeequ - Unit Tests for Data3 versions - Latest release: over 1 year ago - 26 downloads last month - 643 stars on GitHub - 2 maintainers
Top 4.7% on pypi.org
1 version - Latest release: over 1 year ago - 421 stars on GitHub
pydeequ-alb 0.0.1 removed
PyDeequ - Unit Tests for Data1 version - Latest release: over 1 year ago - 421 stars on GitHub
Top 2.5% on pypi.org
7 versions - Latest release: over 2 years ago - 4 dependent packages - 137 dependent repositories - 93.4 thousand downloads last month - 490 stars on GitHub - 2 maintainers
pandas-summary 0.2.0
An extension to pandas describe function.7 versions - Latest release: over 2 years ago - 4 dependent packages - 137 dependent repositories - 93.4 thousand downloads last month - 490 stars on GitHub - 2 maintainers
Top 0.5% on pypi.org
40 versions - Latest release: about 1 year ago - 46 dependent packages - 1,970 dependent repositories - 653 thousand downloads last month - 12,039 stars on GitHub - 4 maintainers
pandas-profiling 3.6.6
Deprecated 'pandas-profiling' package, use 'ydata-profiling' instead40 versions - Latest release: about 1 year ago - 46 dependent packages - 1,970 dependent repositories - 653 thousand downloads last month - 12,039 stars on GitHub - 4 maintainers
cleanlab-cli 0.1.14
Command line interface for all things Cleanlab Studio16 versions - Latest release: over 1 year ago - 129 downloads last month - 20 stars on GitHub - 6 maintainers
Top 6.0% on pypi.org
170 versions - Latest release: almost 3 years ago - 5 dependent repositories - 4.43 thousand downloads last month - 466 stars on GitHub - 1 maintainer
piperider 1.0.2
PiperRider CLI170 versions - Latest release: almost 3 years ago - 5 dependent repositories - 4.43 thousand downloads last month - 466 stars on GitHub - 1 maintainer
piperider-cli 0.1.3.12
PiperRider CLI9 versions - Latest release: almost 2 years ago - 1 dependent repositories - 41 downloads last month - 467 stars on GitHub - 2 maintainers
zarque-profiling 0.5.10
Data profiling tools for Big Data6 versions - Latest release: 10 months ago - 419 downloads last month - 2 stars on GitHub - 2 maintainers
Top 9.8% on pypi.org
125 versions - Latest release: 9 days ago - 1 dependent package - 947 downloads last month - 452 stars on GitHub - 2 maintainers
haupt 2.1.8
Lineage metadata API, artifacts streams, sandbox, ML-API, and spaces for Polyaxon.125 versions - Latest release: 9 days ago - 1 dependent package - 947 downloads last month - 452 stars on GitHub - 2 maintainers
desbordante 2.0.0
Science-intensive high-performance data profiler3 versions - Latest release: 12 days ago - 138 downloads last month - 61 stars on GitHub - 1 maintainer
lineagemd 0.0.0
Lineage metadata for ML/AI/Data.1 version - Latest release: over 1 year ago - 12 downloads last month - 452 stars on GitHub - 2 maintainers
hauptai 0.0.0
Haupt ai.1 version - Latest release: over 1 year ago - 5 downloads last month - 452 stars on GitHub - 1 maintainer
great-expectations-cta 0.15.43
Always know what to expect from your data.2 versions - Latest release: over 1 year ago - 1 dependent package - 35 downloads last month - 9,124 stars on GitHub - 1 maintainer
metacrafter 0.0.2
Metacrafter metadata classification tool2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 9 downloads last month - 35 stars on GitHub - 2 maintainers
Top 4.8% on pypi.org
83 versions - Latest release: almost 4 years ago - 8 dependent repositories - 8.75 thousand downloads last month - 1,439 stars on GitHub - 4 maintainers
optimuspyspark 2.2.32
Optimus is the missing framework for cleaning and pre-processing data in a distributed fashion wi...83 versions - Latest release: almost 4 years ago - 8 dependent repositories - 8.75 thousand downloads last month - 1,439 stars on GitHub - 4 maintainers
Top 6.0% on pypi.org
12 versions - Latest release: 3 months ago - 2 dependent packages - 2 dependent repositories - 10.8 thousand downloads last month - 917 stars on GitHub - 10 maintainers
cleanvision 0.3.6
Find issues in image datasets12 versions - Latest release: 3 months ago - 2 dependent packages - 2 dependent repositories - 10.8 thousand downloads last month - 917 stars on GitHub - 10 maintainers
panda-helper 0.0.2
Data profiler for Pandas2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 25 downloads last month - 2 stars on GitHub - 2 maintainers
haiqv-profiling 0.0.1
Generate profile report for pandas DataFrame1 version - Latest release: over 3 years ago - 1 dependent repositories - 3 downloads last month - 11,966 stars on GitHub - 2 maintainers
gate-drift 0.1.5
Data drift detection tool for machine learning pipelines.5 versions - Latest release: about 1 year ago - 2 dependent repositories - 13 downloads last month - 19 stars on GitHub - 2 maintainers
raymon 0.0.39
Python package for data logging and monitoring.14 versions - Latest release: over 2 years ago - 1 dependent repositories - 9 downloads last month - 18 stars on GitHub - 2 maintainers
Related Keywords
data-science
35
data-quality
29
python
17
machine-learning
17
data-exploration
16
dataquality
15
data-analysis
14
exploratory-data-analysis
14
pandas
13
eda
13
data-visualization
12
data-validation
12
mlops
11
data-observability
11
data-quality-checks
10
statistics
10
data-engineering
9
deep-learning
9
dbt
9
jupyter
8
datadiscovery
7
metadata
7
datacatalog
7
data-cleaning
7
data-governance
7
data-discovery
7
data-profilers
7
data-unit-tests
7
spark
7
dataunittest
7
data-catalog
7
pandas-dataframe
6
tracking
6
tensorflow
6
plotly
6
matplotlib
6
pytorch
6
snowflake
6
exploration
6
metadata-management
6
dataengineering
6
data-lineage
6
data-contracts
6
polyaxon
5
ipython
5
hacktoberfest
5
datacleaner
5
kubernetes
5
data-centric-ai
5
visualization
5
dask
5
dataops
5
dataframes
4
outlier-detection
4
noisy-labels
4
artificial-intelligence
4
data-labeling
4
data-curation
4
lineage
4
bigquery
4
data-profiler
4
neural-networks
3
gcs
3
ai
3
data-reliability
3
google cloud storage
3
big-data-analytics
3
azure
3
microsoft
3
html-report
3
data_cleaning
3
machine_learning
3
jupyter-notebook
3
pandas-profiling
3
deequ
3
pydeequ
3
s3
3
aws
3
data-collaboration
3
data-pipeline
3
continuous-integration
3
code-review
3
ui
3
pipeline-tests
3
pipeline-testing
3
pipeline-debt
3
exploratorydataanalysis
3
exploratory-analysis
3
datacleaning
3
cleandata
3
datavalidation
3
validation
3
quality
3
pipeline
3
testing
3
science
3
data
3
reinforcement-learning
3
google cloud
3
tensorFlow
3