Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "data-engineering" keyword
panoramix 0.8.0
A interactive data visualization platform build on SqlAlchemy and druid.io11 versions - Latest release: about 8 years ago - 2 dependent repositories - 51 downloads last month - 58,575 stars on GitHub - 1 maintainer
abel-airflow 1.7.1.3.post3
Programmatically author, schedule and monitor data pipelines1 version - Latest release: over 7 years ago - 13 downloads last month - 34,219 stars on GitHub - 1 maintainer
creavel 0.11.0
A interactive data visualization platform build on SqlAlchemy and druid.io1 version - Latest release: over 7 years ago - 1 dependent repositories - 9 downloads last month - 58,575 stars on GitHub - 1 maintainer
resp 0.1.2
Make the Redis Mass Insertion by using the REdis Serialization Protocol (RESP) simple.1 version - Latest release: over 6 years ago - 4 dependent repositories - 220 downloads last month - 0 stars on GitHub - 1 maintainer
superset-erikxt 0.26.0
A interactive data visualization platform build on SqlAlchemy and druid.io2 versions - Latest release: almost 6 years ago - 1 dependent repositories - 11 downloads last month - 58,575 stars on GitHub - 1 maintainer
superset-erik 0.26.0
A interactive data visualization platform build on SqlAlchemy and druid.io1 version - Latest release: almost 6 years ago - 1 dependent repositories - 15 downloads last month - 58,575 stars on GitHub - 1 maintainer
superset-dywx 0.26.3
A modern, enterprise-ready business intelligence web application1 version - Latest release: almost 6 years ago - 1 dependent repositories - 20 downloads last month - 58,575 stars on GitHub - 1 maintainer
superset-growth 0.26.3
A modern, enterprise-ready business intelligence web application1 version - Latest release: almost 6 years ago - 1 dependent repositories - 17 downloads last month - 58,575 stars on GitHub - 1 maintainer
superset-d1 0.26.3
A modern, enterprise-ready business intelligence web application2 versions - Latest release: almost 6 years ago - 1 dependent repositories - 21 downloads last month - 58,575 stars on GitHub - 1 maintainer
superset-d2 0.26.3
A modern, enterprise-ready business intelligence web application1 version - Latest release: almost 6 years ago - 1 dependent repositories - 19 downloads last month - 58,575 stars on GitHub - 1 maintainer
superset-d3 0.26.3
A modern, enterprise-ready business intelligence web application1 version - Latest release: almost 6 years ago - 1 dependent repositories - 12 downloads last month - 58,575 stars on GitHub - 1 maintainer
heresuperset 0.27.6
A modern, enterprise-ready business intelligence web application7 versions - Latest release: over 5 years ago - 1 dependent repositories - 48 downloads last month - 58,575 stars on GitHub - 2 maintainers
nifty-nesting 0.2.3
Python utilities for arbitrarily nested data structures.6 versions - Latest release: over 5 years ago - 1 dependent repositories - 72 downloads last month - 1 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
50 versions - Latest release: over 5 years ago - 6 dependent packages - 24 dependent repositories - 340 downloads last month - 1,311 stars on GitHub - 1 maintainer
quilt 2.9.15
Quilt is a data package manager50 versions - Latest release: over 5 years ago - 6 dependent packages - 24 dependent repositories - 340 downloads last month - 1,311 stars on GitHub - 1 maintainer
pyxmlparser 0.1.2
CLI interface to convert XML into various formats4 versions - Latest release: about 5 years ago - 1 dependent repositories - 48 downloads last month - 2 stars on GitHub - 1 maintainer
dagster-sqlalchemy 0.3.5
Utilities and examples for working with SQLAlchemy and dagster, an opinionated framework for expr...9 versions - Latest release: about 5 years ago - 1 dependent repositories - 294 downloads last month - 10,330 stars on GitHub - 1 maintainer
stepist 0.1.5
Data process utils9 versions - Latest release: about 5 years ago - 1 dependent repositories - 50 downloads last month - 27 stars on GitHub - 1 maintainer
pandas-ext 0.5.1
Python Pandas extensions for pandas dataframes22 versions - Latest release: about 5 years ago - 1 dependent repositories - 57 downloads last month - 4 stars on GitHub - 2 maintainers
practicalai 0.0.1
practicalAI ยท A practical approach to machine learning5 versions - Latest release: almost 5 years ago - 1 dependent repositories - 35 downloads last month - 35,580 stars on GitHub - 1 maintainer
Top 6.8% on pypi.org
518 versions - Latest release: almost 5 years ago - 1 dependent repositories - 64.7 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
dagster-pagerduty 2019.6.2
Package for pagerduty Dagster framework components.518 versions - Latest release: almost 5 years ago - 1 dependent repositories - 64.7 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
eos-etl 1.0.0
Tools for exporting EOS blockchain data to JSON1 version - Latest release: almost 5 years ago - 1 dependent repositories - 7 downloads last month - 9 stars on GitHub - 1 maintainer
unofficial-superset 0.34.0
A modern, enterprise-ready business intelligence web application4 versions - Latest release: over 4 years ago - 1 dependent repositories - 41 downloads last month - 58,575 stars on GitHub - 1 maintainer
alyeska 0.3.0a1
Alyeska /al-ee-EHS-kah/ n. A Data Pipeline Toolkit3 versions - Latest release: over 4 years ago - 1 dependent repositories - 46 downloads last month - 3 stars on GitHub - 1 maintainer
quilt-installer 0.0.0a5
Quilt Data installation tool4 versions - Latest release: over 4 years ago - 1 dependent repositories - 28 downloads last month - 1,311 stars on GitHub - 1 maintainer
superset-master-prasad-bhosale 0.999.0.dev0
A modern, enterprise-ready business intelligence web application1 version - Latest release: over 4 years ago - 1 dependent repositories - 12 downloads last month - 58,575 stars on GitHub - 1 maintainer
auptimizer 1.0.1
An automatic ML model optimization tool.7 versions - Latest release: over 4 years ago - 1 dependent repositories - 119 downloads last month - 200 stars on GitHub - 1 maintainer
apache-superset-jwi078 0.35.0
A modern, enterprise-ready business intelligence web application1 version - Latest release: over 4 years ago - 1 dependent repositories - 17 downloads last month - 58,575 stars on GitHub - 1 maintainer
stairs-project 0.1.6
Framework for data processing using data pipelines18 versions - Latest release: over 4 years ago - 1 dependent repositories - 126 downloads last month - 46 stars on GitHub - 1 maintainer
quilt-stack-installer 1.0.0
Quilt Data installation tool2 versions - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 1,311 stars on GitHub - 1 maintainer
plai 0.0.0
Programming language to create data manipulation pipelines.2 versions - Latest release: over 4 years ago - 1 dependent repositories - 32 downloads last month - 2 stars on GitHub - 1 maintainer
apache-superset-johan078 0.35.2
A modern, enterprise-ready business intelligence web application2 versions - Latest release: about 4 years ago - 1 dependent repositories - 10 downloads last month - 58,575 stars on GitHub - 1 maintainer
apache-superset-078 0.35.2
A modern, enterprise-ready business intelligence web application1 version - Latest release: about 4 years ago - 19 downloads last month - 58,575 stars on GitHub - 1 maintainer
apache-superset-patched 0.35.2
A modern, enterprise-ready business intelligence web application1 version - Latest release: about 4 years ago - 15 downloads last month - 58,575 stars on GitHub - 1 maintainer
apache-superset-patched-1 0.35.2
A modern, enterprise-ready business intelligence web application1 version - Latest release: about 4 years ago - 23 downloads last month - 58,575 stars on GitHub - 1 maintainer
apache-superset-jw078 0.999.0.dev0
A modern, enterprise-ready business intelligence web application1 version - Latest release: about 4 years ago - 20 downloads last month - 58,575 stars on GitHub - 1 maintainer
diver 0.2.3
diver is a series of tools to speed up common feature-set investigation, conditioning and encodin...20 versions - Latest release: about 4 years ago - 1 dependent repositories - 136 downloads last month - 1 stars on GitHub - 1 maintainer
mario-python 1.7.0
A configurable data pipeline library.9 versions - Latest release: almost 4 years ago - 1 dependent repositories - 61 downloads last month - 0 stars on GitHub - 1 maintainer
dagster-bash 0.7.16
Package for Dagster bash solids.57 versions - Latest release: almost 4 years ago - 1 dependent repositories - 381 downloads last month - 10,330 stars on GitHub - 1 maintainer
apache-airflow-backport-providers-email 2020.6.24
Back-ported airflow.providers.email.* package for Airflow 1.10.*5 versions - Latest release: almost 4 years ago - 1 dependent repositories - 86 downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-superset-red 0.34.1
A modern, enterprise-ready business intelligence web application1 version - Latest release: almost 4 years ago - 11 downloads last month - 57,957 stars on GitHub - 1 maintainer
streamsql 2.0.1
Python SDK for the StreamSQL feature store14 versions - Latest release: almost 4 years ago - 1 dependent repositories - 48 downloads last month - 4 stars on GitHub - 1 maintainer
aiscalator 0.1.18
AIscalate your Jupyter Notebook Prototypes into Airflow Data Products22 versions - Latest release: almost 4 years ago - 277 downloads last month - 5 stars on GitHub - 1 maintainer
coralinede 1.0.1
python library for data engineering2 versions - Latest release: over 3 years ago - 1 dependent repositories - 15 downloads last month - 0 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
72 versions - Latest release: over 3 years ago - 1 dependent repositories - 583 downloads last month - 9,161 stars on GitHub - 1 maintainer
dagster-flyte 0.9.15
A Dagster integration for flyte72 versions - Latest release: over 3 years ago - 1 dependent repositories - 583 downloads last month - 9,161 stars on GitHub - 1 maintainer
haiqv-streaming-dag-editor 1.0.0
A code editor and file manager about dag for haiqv-streaming1 version - Latest release: over 3 years ago - 1 dependent repositories - 8 downloads last month - 33,967 stars on GitHub - 1 maintainer
alphalib 0.0.3
A library for your daily data engineering and data science routines.3 versions - Latest release: over 3 years ago - 1 dependent repositories - 20 downloads last month - 1 stars on GitHub - 1 maintainer
sheetwork 1.0.7 ๐ฐ
A handy CLI tool to ingest GoogleSheets into your database without writing a single line of code27 versions - Latest release: over 3 years ago - 1 dependent repositories - 105 downloads last month - 16 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
17 versions - Latest release: about 3 years ago - 1 dependent package - 55 dependent repositories - 6.21 thousand downloads last month - 34,343 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-google 2021.3.3
Backport provider package apache-airflow-backport-providers-google for Apache Airflow17 versions - Latest release: about 3 years ago - 1 dependent package - 55 dependent repositories - 6.21 thousand downloads last month - 34,343 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-jenkins 2021.3.3
Backport provider package apache-airflow-backport-providers-jenkins for Apache Airflow11 versions - Latest release: about 3 years ago - 1 dependent repositories - 32 downloads last month - 33,967 stars on GitHub - 3 maintainers
Top 7.0% on pypi.org
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 6.03 thousand downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-apache-livy 2021.3.17
Backport provider package apache-airflow-backport-providers-apache-livy for Apache Airflow11 versions - Latest release: about 3 years ago - 1 dependent repositories - 6.03 thousand downloads last month - 33,967 stars on GitHub - 3 maintainers
Top 9.9% on pypi.org
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 68 downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-apache-sqoop 2021.3.17
Backport provider package apache-airflow-backport-providers-apache-sqoop for Apache Airflow11 versions - Latest release: about 3 years ago - 1 dependent repositories - 68 downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-cloudant 2021.3.17
Backport provider package apache-airflow-backport-providers-cloudant for Apache Airflow11 versions - Latest release: about 3 years ago - 1 dependent repositories - 67 downloads last month - 33,967 stars on GitHub - 3 maintainers
Top 9.8% on pypi.org
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 55 downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-grpc 2021.3.17
Backport provider package apache-airflow-backport-providers-grpc for Apache Airflow11 versions - Latest release: about 3 years ago - 1 dependent repositories - 55 downloads last month - 33,967 stars on GitHub - 3 maintainers
Top 7.6% on pypi.org
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 144 downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-mongo 2021.3.17
Backport provider package apache-airflow-backport-providers-mongo for Apache Airflow11 versions - Latest release: about 3 years ago - 1 dependent repositories - 144 downloads last month - 33,967 stars on GitHub - 3 maintainers
Top 8.7% on pypi.org
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 228 downloads last month - 34,343 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-odbc 2021.3.17
Backport provider package apache-airflow-backport-providers-odbc for Apache Airflow11 versions - Latest release: about 3 years ago - 1 dependent repositories - 228 downloads last month - 34,343 stars on GitHub - 3 maintainers
Top 8.7% on pypi.org
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 255 downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-pagerduty 2021.3.17
Backport provider package apache-airflow-backport-providers-pagerduty for Apache Airflow11 versions - Latest release: about 3 years ago - 1 dependent repositories - 255 downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-plexus 2021.3.17
Backport provider package apache-airflow-backport-providers-plexus for Apache Airflow6 versions - Latest release: about 3 years ago - 1 dependent repositories - 17 downloads last month - 33,967 stars on GitHub - 2 maintainers
Top 6.9% on pypi.org
18 versions - Latest release: about 3 years ago - 1 dependent repositories - 9.48 thousand downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-http 2021.4.10
Backport provider package apache-airflow-backport-providers-http for Apache Airflow18 versions - Latest release: about 3 years ago - 1 dependent repositories - 9.48 thousand downloads last month - 33,967 stars on GitHub - 3 maintainers
bytehub 0.4.0
ByteHub Timeseries Feature Store22 versions - Latest release: about 3 years ago - 223 downloads last month - 57 stars on GitHub - 1 maintainer
journalpdfscraper 0.2.1
A project to check if articles are free or paid3 versions - Latest release: about 3 years ago - 1 dependent repositories - 6 downloads last month - 1 stars on GitHub - 1 maintainer
pangeo-forge 0.0.0
Pipeline tools for building and publishing analysis ready datasets1 version - Latest release: about 3 years ago - 1 dependent repositories - 18 downloads last month - 111 stars on GitHub - 1 maintainer
zapr-athena-client 0.1
It is a python library to run the presto query on the AWS Athena.1 version - Latest release: about 3 years ago - 1 dependent repositories - 5 downloads last month - 1 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
2 versions - Latest release: about 3 years ago - 7 dependent repositories - 2 downloads last month - 13,821 stars on GitHub - 1 maintainer
airbyte-cdk-test 0.1.0rc3
A framework for writing Airbyte Connectors.2 versions - Latest release: about 3 years ago - 7 dependent repositories - 2 downloads last month - 13,821 stars on GitHub - 1 maintainer
data-science-kit 0.0.1
Data Science Basic Functions1 version - Latest release: almost 3 years ago - 1 dependent repositories - 16 downloads last month - 1 stars on GitHub - 1 maintainer
contessa 0.2.12
Data-quality framework14 versions - Latest release: almost 3 years ago - 1 dependent repositories - 23 downloads last month - 18 stars on GitHub - 2 maintainers
Top 3.8% on pypi.org
12 versions - Latest release: almost 3 years ago - 6 dependent repositories - 12.6 thousand downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-airflow-upgrade-check 1.4.0
Check for compatibility between Airflow versions12 versions - Latest release: almost 3 years ago - 6 dependent repositories - 12.6 thousand downloads last month - 33,967 stars on GitHub - 3 maintainers
Top 5.2% on pypi.org
235 versions - Latest release: almost 3 years ago - 3 dependent repositories - 2.89 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
dagster-cron 0.11.16
A Dagster integration for cron235 versions - Latest release: almost 3 years ago - 3 dependent repositories - 2.89 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
dataexpectations 0.0.6
Is your data meeting all your expecations1 version - Latest release: almost 3 years ago - 1 dependent repositories - 10 downloads last month - 1 stars on GitHub - 1 maintainer
viewflow 0.1.0
Viewflow is an Airflow-based framework that allows data scientists to create data models without ...2 versions - Latest release: almost 3 years ago - 4 dependent repositories - 53 downloads last month - 120 stars on GitHub - 2 maintainers
resilient-exporters 0.1.6
A package to export data to databases resiliently.7 versions - Latest release: over 2 years ago - 1 dependent repositories - 5 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.8% on pypi.org
10 versions - Latest release: over 2 years ago - 1 dependent package - 3 dependent repositories - 122 thousand downloads last month - 46 stars on GitHub - 1 maintainer
hive-metastore-client 1.0.9
A client for connecting and running DDLs on Hive Metastore with Thrift protocol10 versions - Latest release: over 2 years ago - 1 dependent package - 3 dependent repositories - 122 thousand downloads last month - 46 stars on GitHub - 1 maintainer
beneath 1.4.2
Python client and CLI for Beneath (https://beneath.dev/)39 versions - Latest release: over 2 years ago - 2 dependent repositories - 373 downloads last month - 79 stars on GitHub - 2 maintainers
cargo-crates 0.0.1
An easy way to build data extractors in Docker.1 version - Latest release: over 2 years ago - 1 dependent repositories - 29 downloads last month - 1 stars on GitHub - 1 maintainer
funsies 0.8.1
Funsies is a library to build and execution engine for reproducible, composable and data-persiste...4 versions - Latest release: over 2 years ago - 1 dependent repositories - 32 downloads last month - 38 stars on GitHub - 1 maintainer
py-dagger 0.4.2
Define sophisticated data pipelines with Python and run them on different distributed systems (su...8 versions - Latest release: over 2 years ago - 1 dependent repositories - 79 downloads last month - 13 stars on GitHub - 1 maintainer
airbyte-cdk-velocity-amazon 0.1.1
A framework for writing Airbyte Connectors.1 version - Latest release: over 2 years ago - 1 dependent repositories - 8 downloads last month - 12,521 stars on GitHub - 1 maintainer
airbyte-cdk-velocity 0.1.32
A framework for writing Airbyte Connectors.9 versions - Latest release: over 2 years ago - 1 dependent repositories - 95 downloads last month - 12,512 stars on GitHub - 1 maintainer
Top 9.2% on pypi.org
167 versions - Latest release: over 2 years ago - 1 dependent repositories - 1.53 thousand downloads last month - 10,330 stars on GitHub - 1 maintainer
lakehouse 0.12.15
An orchestration platform for the development, production, and observation of data assets.167 versions - Latest release: over 2 years ago - 1 dependent repositories - 1.53 thousand downloads last month - 10,330 stars on GitHub - 1 maintainer
py-dagger-contrib 0.4.0
Extensions for the Dagger library (py-dagger in PyPI).4 versions - Latest release: over 2 years ago - 1 dependent repositories - 12 downloads last month - 1 stars on GitHub - 1 maintainer
apache-superset-11680 1.3.1
A modern, enterprise-ready business intelligence web application1 version - Latest release: over 2 years ago - 25 downloads last month - 58,575 stars on GitHub - 1 maintainer
apache-superset-11680-1000 1.3.1
A modern, enterprise-ready business intelligence web application1 version - Latest release: over 2 years ago - 20 downloads last month - 58,575 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
2 versions - Latest release: over 2 years ago - 5 dependent repositories - 168 thousand downloads last month - 20 stars on GitHub - 1 maintainer
pyspark-test 0.2.0
Check that left and right spark DataFrame are equal.2 versions - Latest release: over 2 years ago - 5 dependent repositories - 168 thousand downloads last month - 20 stars on GitHub - 1 maintainer
prefect-saturn 0.6.0
Client library for running Prefect Cloud flows in Saturn Cloud14 versions - Latest release: over 2 years ago - 1 dependent repositories - 110 downloads last month - 16 stars on GitHub - 1 maintainer
totype 0.1.0
Data converter1 version - Latest release: over 2 years ago - 1 dependent repositories - 15 downloads last month - 0 stars on GitHub - 1 maintainer
cacheml 1.0.4
Cache ML -- layer on top of joblib to cache parsed datasets, dramatically reducing load time of l...1 version - Latest release: over 2 years ago - 8 downloads last month - 1 stars on GitHub - 1 maintainer
analytics-command-center 3.0.14
Command Center for Data Ingestion, Advanced Analytics and Artificial Intelligence process1 version - Latest release: over 2 years ago - 26 downloads last month - 11 stars on GitHub - 1 maintainer
bitcoin-etl 1.5.2
Tools for exporting Bitcoin blockchain data to JSON10 versions - Latest release: over 2 years ago - 1 dependent repositories - 189 downloads last month - 369 stars on GitHub - 1 maintainer
dbt-sugar 0.2.0 ๐ฐ
A sweet CLI tool to help dbt users enforce documentation and testing on their dbt projects.17 versions - Latest release: over 2 years ago - 1 dependent repositories - 150 downloads last month - 149 stars on GitHub - 1 maintainer
metastore 1.0.0.dev21
Metastore Python SDK. Feature store and data catalog for machine learning.21 versions - Latest release: over 2 years ago - 1 dependent repositories - 119 downloads last month - 0 stars on GitHub - 1 maintainer
gcp-airflow-foundations-dev-jiny 0.2.9
Opinionated framework based on Airflow 2.0 for building pipelines to ingest data into a BigQuery ...1 version - Latest release: over 2 years ago - 1 dependent repositories - 10 downloads last month - 11 stars on GitHub - 1 maintainer
pandasecharts 0.4.2
Visualize your pandas data with one-line code9 versions - Latest release: over 2 years ago - 1 dependent repositories - 93 downloads last month - 4 stars on GitHub - 1 maintainer
uberjob 1.0.0
uberjob is a Python package for building and running call graphs.2 versions - Latest release: over 2 years ago - 1 dependent repositories - 4.88 thousand downloads last month - 27 stars on GitHub - 1 maintainer
plugin-package-template 0.1.477708478
Plugin template project used to quick start development of a new Versatile Data Kit SDK plugin.10 versions - Latest release: about 2 years ago - 1 dependent repositories - 8 downloads last month - 408 stars on GitHub - 1 maintainer
sage-superset 1.0.0
A modern, enterprise-ready business intelligence web application26 versions - Latest release: about 2 years ago - 1 dependent repositories - 9 downloads last month - 58,575 stars on GitHub - 1 maintainer
siphon-data 0.6.1
A data engineering utility library for siphoning data around3 versions - Latest release: about 2 years ago - 1 dependent repositories - 23 downloads last month - 0 stars on GitHub - 1 maintainer
Top 8.4% on pypi.org
66 versions - Latest release: about 2 years ago - 4 dependent repositories - 612 downloads last month - 78 stars on GitHub - 1 maintainer
cauldron-notebook 1.0.9
The Unnotebook: Data Analysis Environment66 versions - Latest release: about 2 years ago - 4 dependent repositories - 612 downloads last month - 78 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
11 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 46.3 thousand downloads last month - 60 stars on GitHub - 1 maintainer
soda-spark 0.3.3
Soda SQL API for PySpark data frame11 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 46.3 thousand downloads last month - 60 stars on GitHub - 1 maintainer
data-hopper 0.1.0
Package for data wrangling in python.1 version - Latest release: about 2 years ago - 1 dependent repositories - 8 downloads last month - 1 stars on GitHub - 1 maintainer
pyduct 0.0.1
A framework for building and running simple data engineering pipelines in Python.1 version - Latest release: about 2 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 1 maintainer
risk-command-center 1.0.37
Risk Command Center, manage your risk easly.2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 10 downloads last month - 11 stars on GitHub - 1 maintainer
Related Keywords
python
627
etl
527
data-integration
441
elt
424
data
402
pipeline
363
snowflake
341
data-analysis
338
data-science
322
bigquery
295
mysql
289
data-pipeline
289
redshift
288
postgresql
287
data-collection
284
s3
284
java
283
change-data-capture
283
mssql
281
self-hosted
278
data-pipelines
206
mlops
182
workflow
175
orchestration
167
scheduler
149
analytics
149
data-orchestrator
148
machine-learning
129
apache
113
airflow
101
workflow-engine
101
automation
97
dag
95
apache-airflow
89
workflow-orchestration
84
sql
71
integration
67
metadata
66
database
65
workflow-automation
64
dagster
63
airflow-provider
63
data-warehouse
55
trino
55
dataops
53
data-engineering-pipeline
51
data-engineer
50
data-structures
50
warehouse
49
data-lineage
49
data-analytics
40
data-visualization
35
data-quality
30
apache-superset
29
asf
29
pipelines
29
bi
29
business-analytics
29
business-intelligence
29
data-viz
29
flask
29
react
29
sql-editor
29
superset
29
pandas
25
spark
20
dataframe
17
dbt
15
hacktoberfest
14
aws
14
data-ops
14
etl-pipeline
14
dataquality
14
feature-engineering
13
big-data
13
prefect
13
postgres
13
observability
12
framework
12
kubernetes
12
pyspark
11
infrastructure
11
airbyte
10
python3
10
feature-store
10
data-unit-tests
10
data-lake
9
data-profiling
9
etl-framework
9
ml-ops
8
llmops
8
connector-development-kit
8
data-versioning
8
cdk
8
data engineering
8
batch-processing
8
data-version-control
7
parquet
7
ai
7
ml
7