An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-validation" keyword

pydantic-storage 0.1.1
A lightweight, type-safe storage system for Pydantic models with support for multiple backends a...
2 versions - Latest release: 9 months ago - 12 downloads last month - 1 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
cerberus 1.3.8 πŸ’°
Lightweight, extensible schema and data validation tool for Pythondictionaries.
30 versions - Latest release: 6 months ago - 140 dependent packages - 2,028 dependent repositories - 8.41 million downloads last month - 3,114 stars on GitHub - 2 maintainers
eurybia 1.4.0
Eurybia monitor model drift over time and securize model deployment with data validation
17 versions - Latest release: about 2 months ago - 1 dependent repositories - 96 downloads last month - 214 stars on GitHub - 1 maintainer
vowl 0.0.2
A SQL-powered data quality validation library for pandas and spark DataFrames.
2 versions - Latest release: 15 days ago - 1 maintainer
vinzy-datadiff 0.1.0
A comprehensive DataFrame comparison library for identifying differences between pandas DataFrames
1 version - Latest release: 4 months ago - 37 downloads last month - 1 maintainer
Top 1.9% on pypi.org
deepchecks 0.19.1
Package for validating your machine learning model and data
59 versions - Latest release: over 1 year ago - 7 dependent packages - 92 dependent repositories - 53.9 thousand downloads last month - 4,011 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
evidently 0.7.21
Open-source tools to analyze, monitor, and debug machine learning model in production.
152 versions - Latest release: 2 months ago - 8 dependent packages - 340 dependent repositories - 1.15 million downloads last month - 7,435 stars on GitHub - 2 maintainers
Top 1.4% on pypi.org
pandera 0.31.1 πŸ’°
A light-weight and flexible data validation and testing tool for statistical data objects.
121 versions - Latest release: 28 days ago - 97 dependent packages - 229 dependent repositories - 8.52 million downloads last month - 3,009 stars on GitHub - 3 maintainers
finlab-sentinel 0.1.7
Defensive monitoring layer for finlab data.get API - detect unexpected data changes
8 versions - Latest release: 4 months ago - 63 downloads last month - 0 stars on GitHub - 1 maintainer
Top 8.5% on pypi.org
duckguard 3.2.0
A Python-native data quality tool with AI superpowers, built on DuckDB for speed
7 versions - Latest release: 3 months ago - 205 downloads last month - 1 maintainer
iflow-mcp_csv-editor 1.0.1
MCP server for comprehensive CSV file operations with pandas-based tools
1 version - Latest release: 6 months ago - 14 downloads last month - 18 stars on GitHub - 1 maintainer
janus-validation 0.1.1
A Python library for robust data validation, serialization, and schema versioning.
2 versions - Latest release: over 1 year ago - 28 downloads last month - 0 stars on GitHub - 1 maintainer
flycatcher 0.1.0
Define your data schema once. Validate at scale. Stay columnar.
1 version - Latest release: 6 months ago - 45 downloads last month - 3 stars on GitHub - 1 maintainer
datascreeniq 1.0.12
Real-time data quality screening API β€” PASS / WARN / BLOCK in milliseconds
13 versions - Latest release: about 1 month ago - 307 downloads last month - 1 maintainer
mlschema 0.1.6 πŸ’°
Lightweight orchestration layer that turns pandas DataFrames into front-end-ready JSON schemas, e...
7 versions - Latest release: 22 days ago - 433 downloads last month - 1 maintainer
serializable-excel 0.1.6
A user-friendly Python library for seamless bidirectional conversion between Excel spreadsheets a...
5 versions - Latest release: 5 months ago - 133 downloads last month - 2 stars on GitHub - 1 maintainer
Top 2.1% on pypi.org
cleanlab 2.9.0
The standard package for data-centric AI, machine learning with label errors, and automatically f...
35 versions - Latest release: 4 months ago - 11 dependent packages - 19 dependent repositories - 56.6 thousand downloads last month - 11,361 stars on GitHub - 6 maintainers
datafun-streaming 0.7.0
Utilities for streaming data analytics with Kafka and DuckDB.
7 versions - Latest release: 3 days ago - 1 maintainer
databricks-labs-lakebridge 0.12.2
Fast and predictable migrations to Databricks Lakehouse Platform. This tool is designed to help y...
21 versions - Latest release: 3 months ago - 10.1 thousand downloads last month - 114 stars on GitHub - 1 maintainer
fauxdata-cli 0.1.4
CLI for generating and validating fake datasets
5 versions - Latest release: about 1 month ago - 399 downloads last month - 1 stars on GitHub - 1 maintainer
grib-check 0.0.8
Tools for checking GRIB files
9 versions - Latest release: about 1 month ago - 41 downloads last month - 0 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
phone-and-mail-verifier 0.1.2
Production-ready Python library for email and phone number validation. Validate emails and phone ...
2 versions - Latest release: 4 months ago - 58 downloads last month - 1 maintainer
samesame 0.3.1
Statistical tests for model monitoring, data validation, and drift detection.
7 versions - Latest release: 9 days ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
goldencheck 1.2.0 πŸ’°
Data validation that discovers rules from your data so you don't have to write them
13 versions - Latest release: 6 days ago - 850 downloads last month - 1 stars on GitHub - 1 maintainer
hermes-orm 0.1.7
A high-performance ORM for Python with support for migrations, relations, and caching.
8 versions - Latest release: over 1 year ago - 40 downloads last month - 1 stars on GitHub - 1 maintainer
mcp-server-hyperplexity 1.0.15
MCP server for Hyperplexity β€” generate, validate, and fact-check research tables with AI
10 versions - Latest release: about 2 months ago - 311 downloads last month - 1 maintainer
sentri 1.0.0
Sentri - a production-ready, configurable data quality validation framework
1 version - Latest release: 5 months ago - 11 downloads last month - 2 stars on GitHub - 1 maintainer
garlic-validator 1.0.1
Cerberus and validator-collection based custom validator package (garlic_validator) for python pr...
2 versions - Latest release: almost 4 years ago - 1 dependent repositories - 10 downloads last month - 2 stars on GitHub - 1 maintainer
validoopsie 1.8.0
Validoopsie is a simple and light data validation library.
31 versions - Latest release: about 2 months ago - 556 downloads last month - 79 stars on GitHub - 1 maintainer
goldenpipe 1.1.0
Pluggable pipeline framework for data quality workflows
7 versions - Latest release: 6 days ago - 1 maintainer
sliq 0.2.0
Sliq automatically fixes dataset schema issues, missing values, duplicate rows, and formatting er...
3 versions - Latest release: 3 months ago - 57 downloads last month - 1 maintainer
msgspec-ext 0.5.1
High-performance settings management and validation library extending msgspec
5 versions - Latest release: about 2 months ago - 1.05 thousand downloads last month - 7 stars on GitHub - 1 maintainer
dataset-inspector 0.0.2
Lightweight CLI tool to analyze CSV datasets and detect common ML data issues.
2 versions - Latest release: about 2 months ago - 162 downloads last month - 1 maintainer
qualink 0.0.3
Blazing fast data quality framework for Python, built on Apache DataFusion
3 versions - Latest release: about 2 months ago - 175 downloads last month - 4 stars on GitHub - 1 maintainer
validada 0.0.1
Another python package for defensive data analysis.
1 version - Latest release: almost 11 years ago - 3 dependent repositories - 34 downloads last month - 29 stars on GitHub - 1 maintainer
pylib-validator 0.1.0
Validate Python objects with schema definitions. Perfect for API validation.
1 version - Latest release: 6 months ago - 10 downloads last month - 0 stars on GitHub - 1 maintainer
xdata 0.0.3
Simple data validation library
3 versions - Latest release: about 9 years ago - 1 dependent repositories - 29 downloads last month - 23 stars on GitHub - 1 maintainer
data_check 0.20.0
simple data validation
23 versions - Latest release: about 1 year ago - 118 downloads last month - 5 stars on GitHub - 1 maintainer
dict-patterns 0.4.0
A template engine for dictionary data – useful for tests!
6 versions - Latest release: 22 days ago - 122 downloads last month - 0 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
openmetadata-airflow-managed-apis 0.10.1
Airflow REST APIs to create and manage DAGS
31 versions - Latest release: almost 4 years ago - 1 dependent repositories - 175 downloads last month - 3,365 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
openmetadata-managed-apis 1.12.6.4
Airflow REST APIs to create and manage DAGS
399 versions - Latest release: 11 days ago - 18.3 thousand downloads last month - 5,446 stars on GitHub - 1 maintainer
intc 0.1.1
intc: intelligent python config toolkit
2 versions - Latest release: over 1 year ago - 2 dependent packages - 18 downloads last month - 28 stars on GitHub - 1 maintainer
input-checker 0.3.9
Package to perform comparison between data frames
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 17 downloads last month - 5 stars on GitHub - 1 maintainer
alactic-agi 1.0.0
Enterprise AI Dataset Processing Platform - Scalable data acquisition, validation, and structurin...
1 version - Latest release: 7 months ago - 12 downloads last month - 1 maintainer
deepcheckscv 0.0.1
Package for validating your machine learning model and data
1 version - Latest release: over 4 years ago - 1 dependent repositories - 16 downloads last month - 3,889 stars on GitHub - 1 maintainer
skewsentry 0.1.1
Catch training ↔ serving feature skew before you ship to production
2 versions - Latest release: 9 months ago - 6 downloads last month - 1 maintainer
suclepy 1.0.1
SUCLEPY β€” Smart Universal Cleaner Library for Python
2 versions - Latest release: 6 months ago - 3 downloads last month - 0 stars on GitHub - 1 maintainer
databricks-switch-plugin 1.0.2
LLM-powered tool to convert SQL, code, and workflow files into Databricks notebooks.
18 versions - Latest release: 9 months ago - 8.46 thousand downloads last month - 124 stars on GitHub - 1 maintainer
hetman-pipeline 2.3.1
Hetman Pipeline is a flexible, developer-centric validation engine. It is built for those who pri...
17 versions - Latest release: about 2 months ago - 30 downloads last month - 3 stars on GitHub - 1 maintainer
mlchecks 0.0.1
Package for validating your machine learning model and data
1 version - Latest release: over 4 years ago - 1 dependent repositories - 13 downloads last month - 3,924 stars on GitHub - 1 maintainer
dcs-core 0.9.9
Open Source Data Quality Monitoring
31 versions - Latest release: 6 months ago - 136 downloads last month - 141 stars on GitHub - 1 maintainer
fordev 1.0.5 πŸ’°
Gere e valide dados randΓ΄micos com fordev
11 versions - Latest release: over 3 years ago - 1 dependent repositories - 2.29 thousand downloads last month - 36 stars on GitHub - 1 maintainer
pointblank 0.24.0
Find out if your data is what you think it is.
55 versions - Latest release: 20 days ago - 46.9 thousand downloads last month - 402 stars on GitHub - 1 maintainer
sanatio 1.7.0 πŸ’°
Simple and easy to validate data in Python
9 versions - Latest release: 11 months ago - 66 downloads last month - 6 stars on GitHub - 1 maintainer
pydata-constraints 1.0.1
The easiest way to validate your data streams in Python. Whether you have small JSON files or mas...
2 versions - Latest release: 2 months ago - 1 maintainer
pipedog 0.5.0
Data quality and schema drift detection for analysts β€” CLI and desktop GUI for CSV, Excel, Parque...
7 versions - Latest release: 22 days ago - 226 downloads last month - 1 maintainer
apiverve-emailvalidator 1.1.14
Email Validator is a simple tool for validating if an email address is valid or not. It checks th...
6 versions - Latest release: 3 months ago - 63 downloads last month - 0 stars on GitHub - 1 maintainer
mcp-server-subindex 1.0.22
MCP server for Subindex β€” generate, validate, and fact-check research tables with AI
1 version - Latest release: 12 days ago - 1 maintainer
Top 3.9% on pypi.org
openmetadata-ingestion 0.10.1
Ingestion Framework for OpenMetadata
549 versions - Latest release: almost 4 years ago - 3 dependent packages - 2 dependent repositories - 491 thousand downloads last month - 5,446 stars on GitHub - 1 maintainer
adri 7.4.0
Stop Your AI Agents Breaking on Bad Data - Data Quality Assessment Framework
20 versions - Latest release: about 1 month ago - 150 downloads last month - 1 stars on GitHub - 1 maintainer
recce-nightly 1.47.0.20260430
Environment diff tool for dbt
675 versions - Latest release: 12 days ago - 13.3 thousand downloads last month - 248 stars on GitHub - 1 maintainer
openmetadata-sqlalchemy-bigquery 1.2.0
SQLAlchemy dialect for BigQuery by OpenMetadata
4 versions - Latest release: over 4 years ago - 1 dependent package - 1 dependent repositories - 44 downloads last month - 4,168 stars on GitHub - 1 maintainer
deepchecks-cv 0.0.1
Package for validating your machine learning model and data
1 version - Latest release: over 4 years ago - 1 dependent repositories - 15 downloads last month - 3,935 stars on GitHub - 1 maintainer
vlidt 0.2.2
dataclass & validation simple as well as hell.
4 versions - Latest release: over 1 year ago - 20 downloads last month - 7 stars on GitHub - 1 maintainer
Top 5.6% on pypi.org
typical 2.9.0
Typical: Python's Typing Toolkit.
114 versions - Latest release: over 1 year ago - 2 dependent packages - 18 dependent repositories - 821 downloads last month - 179 stars on GitHub - 1 maintainer
dataqe-framework 0.3.5
Reusable Data Validation Framework for data migration, ETL validation, and cross-database reconci...
19 versions - Latest release: about 2 months ago - 379 downloads last month - 0 stars on GitHub - 1 maintainer
isonantic 1.0.1
ISONantic - A Pydantic-like data validation library for ISON format
2 versions - Latest release: 4 months ago - 47 downloads last month - 1 maintainer
databricks-labs-remorph 0.9.1
SQL code converter and data reconcilation tool for accelerating data onboarding to Databricks fro...
19 versions - Latest release: 11 months ago - 1.44 million downloads last month - 105 stars on GitHub - 1 maintainer
devgear 1.0.0
πŸ› οΈ Essential gear for Python developers: decorators, validators, file utils, logging β€” all in one...
1 version - Latest release: 3 months ago - 1 maintainer
recce 1.39.0
Environment diff tool for dbt
168 versions - Latest release: 2 months ago - 31.5 thousand downloads last month - 432 stars on GitHub - 1 maintainer
zodic 0.2.0
A TypeScript Zod-inspired validation library for Python with excellent type safety and developer ...
2 versions - Latest release: 11 months ago - 23 downloads last month - 0 stars on GitHub - 1 maintainer
checkengine 0.2.0
Data-quality checks for PySpark
1 version - Latest release: almost 5 years ago - 16 downloads last month - 30 stars on GitHub
pyvaru 0.3.0
Rule based data validation library for python.
4 versions - Latest release: about 9 years ago - 1 dependent repositories - 210 downloads last month - 20 stars on GitHub - 1 maintainer
Top 10.0% on pypi.org
encord-active 0.1.83
Enable users to improve machine learning models in an active learning fashion via data, label, an...
76 versions - Latest release: over 2 years ago - 1 dependent repositories - 429 downloads last month - 427 stars on GitHub - 1 maintainer
dhi 1.2.0
Ultra-fast data validation for Python - 24M validations/sec, powered by Zig
16 versions - Latest release: 18 days ago - 4.24 thousand downloads last month - 0 stars on GitHub - 1 maintainer
pydantic2zod 0.1.1
Pydantic to zod declaration compiler.
2 versions - Latest release: over 1 year ago - 2.05 thousand downloads last month - 79 stars on GitHub - 1 maintainer
symconstraints 0.0.1
Validate and Impute your data with math expressions
1 version - Latest release: over 1 year ago - 8 downloads last month - 0 stars on GitHub - 1 maintainer
qualipilot 2.0.1
Production-grade data quality checks with pluggable LLM reporting (AWS Bedrock, Ollama, OpenAI-co...
2 versions - Latest release: 15 days ago - 1 maintainer
pyvalidx 0.1.4
Custom field validation
4 versions - Latest release: 5 months ago - 9 downloads last month - 0 stars on GitHub - 1 maintainer
intc-lsp 0.1.1
intc-lsp: intc language server
4 versions - Latest release: over 1 year ago - 16 downloads last month - 28 stars on GitHub - 1 maintainer
truthound 3.1.1
Zero-Configuration Data Quality Framework Powered by Polars
40 versions - Latest release: about 1 month ago - 2.36 thousand downloads last month - 17 stars on GitHub - 1 maintainer
tracebloc-ingestor 0.2.10
A flexible data ingestion library for various file formats
12 versions - Latest release: 3 months ago - 30 downloads last month - 5 stars on GitHub - 4 maintainers
polyantic 0.0.1
Pydantic-native data contracts for DataFrame validation
1 version - Latest release: 17 days ago - 124 downloads last month - 1 maintainer
engineer-your-data 0.1.3
MCP server for data engineering and business intelligence operations
4 versions - Latest release: 7 months ago - 48 downloads last month - 0 stars on GitHub - 1 maintainer
koality 0.13.0
Library for data checks and data quality monitoring based on duckdb.
16 versions - Latest release: about 2 months ago - 1.22 thousand downloads last month - 1 maintainer
apn-validators 0.4.1
Tools for validating user data and input in Python.
5 versions - Latest release: over 1 year ago - 59 downloads last month - 3 stars on GitHub - 1 maintainer
cheminformant 2.4.3
A robust and high-throughput Python client for the PubChem API, designed for automated data retri...
20 versions - Latest release: 8 months ago - 103 downloads last month - 4 stars on GitHub - 1 maintainer
raymon 0.0.39
Python package for data logging and monitoring.
14 versions - Latest release: over 4 years ago - 1 dependent repositories - 103 downloads last month - 18 stars on GitHub - 1 maintainer
nitro-validator 1.0.3
A powerful, standalone, dependency-free data validation library for Python with extensible rules ...
4 versions - Latest release: 25 days ago - 39 downloads last month - 1 stars on GitHub - 1 maintainer
databeak 0.1.2
DataBeak: MCP server for comprehensive CSV file operations with pandas-based tools
6 versions - Latest release: 7 months ago - 39 downloads last month - 1 stars on GitHub - 1 maintainer
daffy 2.8.0
Function decorators for DataFrame validation - columns, data types, and row-level validation with...
55 versions - Latest release: 2 months ago - 1 dependent repositories - 3.19 thousand downloads last month - 13 stars on GitHub - 1 maintainer
opendqv 2.2.6
OpenDQV Core β€” open-source, contract-driven data quality validation engine for data pipelines and...
45 versions - Latest release: 17 days ago - 3.19 thousand downloads last month - 10 stars on GitHub - 1 maintainer
livecheck-language 0.5.0
Natural language data validation β€” write rules in plain English, handle typos, validate files, ge...
1 version - Latest release: 19 days ago - 1 maintainer
laravel-validation 0.1
laravel like data validation library for python language
1 version - Latest release: over 2 years ago - 1 dependent repositories - 11 stars on GitHub - 1 maintainer
lavendertown 0.7.1
A Streamlit-first Python package for detecting and visualizing data quality issues
8 versions - Latest release: 4 months ago - 229 downloads last month - 1 maintainer
deepchecks-core 0.0.1
Package for validating your machine learning model and data
1 version - Latest release: over 4 years ago - 1 dependent repositories - 16 downloads last month - 3,577 stars on GitHub - 1 maintainer
objectiv-modelhub 0.0.28
The open model hub is a growing collection of data models that you can take, combine and run for ...
33 versions - Latest release: over 3 years ago - 1 dependent repositories - 193 downloads last month - 468 stars on GitHub - 1 maintainer
pydqkit 0.0.1
A developer-first Python toolkit for data quality profiling, validation, and interactive HTML rep...
1 version - Latest release: 4 months ago - 14 downloads last month - 1 maintainer
Top 9.7% on pypi.org
openmetadata-ingestion-core 0.10.0 πŸ’°
These are the generated Python classes from JSON Schema
12 versions - Latest release: about 4 years ago - 1 dependent package - 1 dependent repositories - 1.1 thousand downloads last month - 7,743 stars on GitHub - 1 maintainer
titanos 0.1.0
Titanos is a high-performance Python framework for validated data models, powered by msgspec.
1 version - Latest release: 19 days ago - 94 downloads last month - 1 maintainer