Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-engineering" keyword

panoramix 0.8.0
A interactive data visualization platform build on SqlAlchemy and druid.io
11 versions - Latest release: about 8 years ago - 2 dependent repositories - 51 downloads last month - 58,575 stars on GitHub - 1 maintainer
abel-airflow 1.7.1.3.post3
Programmatically author, schedule and monitor data pipelines
1 version - Latest release: over 7 years ago - 13 downloads last month - 34,219 stars on GitHub - 1 maintainer
creavel 0.11.0
A interactive data visualization platform build on SqlAlchemy and druid.io
1 version - Latest release: over 7 years ago - 1 dependent repositories - 9 downloads last month - 58,575 stars on GitHub - 1 maintainer
resp 0.1.2
Make the Redis Mass Insertion by using the REdis Serialization Protocol (RESP) simple.
1 version - Latest release: over 6 years ago - 4 dependent repositories - 220 downloads last month - 0 stars on GitHub - 1 maintainer
superset-erikxt 0.26.0
A interactive data visualization platform build on SqlAlchemy and druid.io
2 versions - Latest release: almost 6 years ago - 1 dependent repositories - 11 downloads last month - 58,575 stars on GitHub - 1 maintainer
superset-erik 0.26.0
A interactive data visualization platform build on SqlAlchemy and druid.io
1 version - Latest release: almost 6 years ago - 1 dependent repositories - 15 downloads last month - 58,575 stars on GitHub - 1 maintainer
superset-dywx 0.26.3
A modern, enterprise-ready business intelligence web application
1 version - Latest release: almost 6 years ago - 1 dependent repositories - 20 downloads last month - 58,575 stars on GitHub - 1 maintainer
superset-growth 0.26.3
A modern, enterprise-ready business intelligence web application
1 version - Latest release: almost 6 years ago - 1 dependent repositories - 17 downloads last month - 58,575 stars on GitHub - 1 maintainer
superset-d1 0.26.3
A modern, enterprise-ready business intelligence web application
2 versions - Latest release: almost 6 years ago - 1 dependent repositories - 21 downloads last month - 58,575 stars on GitHub - 1 maintainer
superset-d2 0.26.3
A modern, enterprise-ready business intelligence web application
1 version - Latest release: almost 6 years ago - 1 dependent repositories - 19 downloads last month - 58,575 stars on GitHub - 1 maintainer
superset-d3 0.26.3
A modern, enterprise-ready business intelligence web application
1 version - Latest release: almost 6 years ago - 1 dependent repositories - 12 downloads last month - 58,575 stars on GitHub - 1 maintainer
heresuperset 0.27.6
A modern, enterprise-ready business intelligence web application
7 versions - Latest release: over 5 years ago - 1 dependent repositories - 48 downloads last month - 58,575 stars on GitHub - 2 maintainers
nifty-nesting 0.2.3
Python utilities for arbitrarily nested data structures.
6 versions - Latest release: over 5 years ago - 1 dependent repositories - 72 downloads last month - 1 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
quilt 2.9.15
Quilt is a data package manager
50 versions - Latest release: over 5 years ago - 6 dependent packages - 24 dependent repositories - 340 downloads last month - 1,311 stars on GitHub - 1 maintainer
pyxmlparser 0.1.2
CLI interface to convert XML into various formats
4 versions - Latest release: about 5 years ago - 1 dependent repositories - 48 downloads last month - 2 stars on GitHub - 1 maintainer
dagster-sqlalchemy 0.3.5
Utilities and examples for working with SQLAlchemy and dagster, an opinionated framework for expr...
9 versions - Latest release: about 5 years ago - 1 dependent repositories - 294 downloads last month - 10,330 stars on GitHub - 1 maintainer
stepist 0.1.5
Data process utils
9 versions - Latest release: about 5 years ago - 1 dependent repositories - 50 downloads last month - 27 stars on GitHub - 1 maintainer
pandas-ext 0.5.1
Python Pandas extensions for pandas dataframes
22 versions - Latest release: about 5 years ago - 1 dependent repositories - 57 downloads last month - 4 stars on GitHub - 2 maintainers
practicalai 0.0.1
practicalAI ยท A practical approach to machine learning
5 versions - Latest release: almost 5 years ago - 1 dependent repositories - 35 downloads last month - 35,580 stars on GitHub - 1 maintainer
Top 6.8% on pypi.org
dagster-pagerduty 2019.6.2
Package for pagerduty Dagster framework components.
518 versions - Latest release: almost 5 years ago - 1 dependent repositories - 64.7 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
eos-etl 1.0.0
Tools for exporting EOS blockchain data to JSON
1 version - Latest release: almost 5 years ago - 1 dependent repositories - 7 downloads last month - 9 stars on GitHub - 1 maintainer
unofficial-superset 0.34.0
A modern, enterprise-ready business intelligence web application
4 versions - Latest release: over 4 years ago - 1 dependent repositories - 41 downloads last month - 58,575 stars on GitHub - 1 maintainer
alyeska 0.3.0a1
Alyeska /al-ee-EHS-kah/ n. A Data Pipeline Toolkit
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 46 downloads last month - 3 stars on GitHub - 1 maintainer
quilt-installer 0.0.0a5
Quilt Data installation tool
4 versions - Latest release: over 4 years ago - 1 dependent repositories - 28 downloads last month - 1,311 stars on GitHub - 1 maintainer
superset-master-prasad-bhosale 0.999.0.dev0
A modern, enterprise-ready business intelligence web application
1 version - Latest release: over 4 years ago - 1 dependent repositories - 12 downloads last month - 58,575 stars on GitHub - 1 maintainer
auptimizer 1.0.1
An automatic ML model optimization tool.
7 versions - Latest release: over 4 years ago - 1 dependent repositories - 119 downloads last month - 200 stars on GitHub - 1 maintainer
apache-superset-jwi078 0.35.0
A modern, enterprise-ready business intelligence web application
1 version - Latest release: over 4 years ago - 1 dependent repositories - 17 downloads last month - 58,575 stars on GitHub - 1 maintainer
stairs-project 0.1.6
Framework for data processing using data pipelines
18 versions - Latest release: over 4 years ago - 1 dependent repositories - 126 downloads last month - 46 stars on GitHub - 1 maintainer
quilt-stack-installer 1.0.0
Quilt Data installation tool
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 8 downloads last month - 1,311 stars on GitHub - 1 maintainer
plai 0.0.0
Programming language to create data manipulation pipelines.
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 32 downloads last month - 2 stars on GitHub - 1 maintainer
apache-superset-johan078 0.35.2
A modern, enterprise-ready business intelligence web application
2 versions - Latest release: about 4 years ago - 1 dependent repositories - 10 downloads last month - 58,575 stars on GitHub - 1 maintainer
apache-superset-078 0.35.2
A modern, enterprise-ready business intelligence web application
1 version - Latest release: about 4 years ago - 19 downloads last month - 58,575 stars on GitHub - 1 maintainer
apache-superset-patched 0.35.2
A modern, enterprise-ready business intelligence web application
1 version - Latest release: about 4 years ago - 15 downloads last month - 58,575 stars on GitHub - 1 maintainer
apache-superset-patched-1 0.35.2
A modern, enterprise-ready business intelligence web application
1 version - Latest release: about 4 years ago - 23 downloads last month - 58,575 stars on GitHub - 1 maintainer
apache-superset-jw078 0.999.0.dev0
A modern, enterprise-ready business intelligence web application
1 version - Latest release: about 4 years ago - 20 downloads last month - 58,575 stars on GitHub - 1 maintainer
diver 0.2.3
diver is a series of tools to speed up common feature-set investigation, conditioning and encodin...
20 versions - Latest release: about 4 years ago - 1 dependent repositories - 136 downloads last month - 1 stars on GitHub - 1 maintainer
mario-python 1.7.0
A configurable data pipeline library.
9 versions - Latest release: almost 4 years ago - 1 dependent repositories - 61 downloads last month - 0 stars on GitHub - 1 maintainer
dagster-bash 0.7.16
Package for Dagster bash solids.
57 versions - Latest release: almost 4 years ago - 1 dependent repositories - 381 downloads last month - 10,330 stars on GitHub - 1 maintainer
apache-airflow-backport-providers-email 2020.6.24
Back-ported airflow.providers.email.* package for Airflow 1.10.*
5 versions - Latest release: almost 4 years ago - 1 dependent repositories - 86 downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-superset-red 0.34.1
A modern, enterprise-ready business intelligence web application
1 version - Latest release: almost 4 years ago - 11 downloads last month - 57,957 stars on GitHub - 1 maintainer
streamsql 2.0.1
Python SDK for the StreamSQL feature store
14 versions - Latest release: almost 4 years ago - 1 dependent repositories - 48 downloads last month - 4 stars on GitHub - 1 maintainer
aiscalator 0.1.18
AIscalate your Jupyter Notebook Prototypes into Airflow Data Products
22 versions - Latest release: almost 4 years ago - 277 downloads last month - 5 stars on GitHub - 1 maintainer
coralinede 1.0.1
python library for data engineering
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 15 downloads last month - 0 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
dagster-flyte 0.9.15
A Dagster integration for flyte
72 versions - Latest release: over 3 years ago - 1 dependent repositories - 583 downloads last month - 9,161 stars on GitHub - 1 maintainer
haiqv-streaming-dag-editor 1.0.0
A code editor and file manager about dag for haiqv-streaming
1 version - Latest release: over 3 years ago - 1 dependent repositories - 8 downloads last month - 33,967 stars on GitHub - 1 maintainer
alphalib 0.0.3
A library for your daily data engineering and data science routines.
3 versions - Latest release: over 3 years ago - 1 dependent repositories - 20 downloads last month - 1 stars on GitHub - 1 maintainer
sheetwork 1.0.7 ๐Ÿ’ฐ
A handy CLI tool to ingest GoogleSheets into your database without writing a single line of code
27 versions - Latest release: over 3 years ago - 1 dependent repositories - 105 downloads last month - 16 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
apache-airflow-backport-providers-google 2021.3.3
Backport provider package apache-airflow-backport-providers-google for Apache Airflow
17 versions - Latest release: about 3 years ago - 1 dependent package - 55 dependent repositories - 6.21 thousand downloads last month - 34,343 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-jenkins 2021.3.3
Backport provider package apache-airflow-backport-providers-jenkins for Apache Airflow
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 32 downloads last month - 33,967 stars on GitHub - 3 maintainers
Top 7.0% on pypi.org
apache-airflow-backport-providers-apache-livy 2021.3.17
Backport provider package apache-airflow-backport-providers-apache-livy for Apache Airflow
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 6.03 thousand downloads last month - 33,967 stars on GitHub - 3 maintainers
Top 9.9% on pypi.org
apache-airflow-backport-providers-apache-sqoop 2021.3.17
Backport provider package apache-airflow-backport-providers-apache-sqoop for Apache Airflow
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 68 downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-cloudant 2021.3.17
Backport provider package apache-airflow-backport-providers-cloudant for Apache Airflow
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 67 downloads last month - 33,967 stars on GitHub - 3 maintainers
Top 9.8% on pypi.org
apache-airflow-backport-providers-grpc 2021.3.17
Backport provider package apache-airflow-backport-providers-grpc for Apache Airflow
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 55 downloads last month - 33,967 stars on GitHub - 3 maintainers
Top 7.6% on pypi.org
apache-airflow-backport-providers-mongo 2021.3.17
Backport provider package apache-airflow-backport-providers-mongo for Apache Airflow
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 144 downloads last month - 33,967 stars on GitHub - 3 maintainers
Top 8.7% on pypi.org
apache-airflow-backport-providers-odbc 2021.3.17
Backport provider package apache-airflow-backport-providers-odbc for Apache Airflow
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 228 downloads last month - 34,343 stars on GitHub - 3 maintainers
Top 8.7% on pypi.org
apache-airflow-backport-providers-pagerduty 2021.3.17
Backport provider package apache-airflow-backport-providers-pagerduty for Apache Airflow
11 versions - Latest release: about 3 years ago - 1 dependent repositories - 255 downloads last month - 33,967 stars on GitHub - 3 maintainers
apache-airflow-backport-providers-plexus 2021.3.17
Backport provider package apache-airflow-backport-providers-plexus for Apache Airflow
6 versions - Latest release: about 3 years ago - 1 dependent repositories - 17 downloads last month - 33,967 stars on GitHub - 2 maintainers
Top 6.9% on pypi.org
apache-airflow-backport-providers-http 2021.4.10
Backport provider package apache-airflow-backport-providers-http for Apache Airflow
18 versions - Latest release: about 3 years ago - 1 dependent repositories - 9.48 thousand downloads last month - 33,967 stars on GitHub - 3 maintainers
bytehub 0.4.0
ByteHub Timeseries Feature Store
22 versions - Latest release: about 3 years ago - 223 downloads last month - 57 stars on GitHub - 1 maintainer
journalpdfscraper 0.2.1
A project to check if articles are free or paid
3 versions - Latest release: about 3 years ago - 1 dependent repositories - 6 downloads last month - 1 stars on GitHub - 1 maintainer
pangeo-forge 0.0.0
Pipeline tools for building and publishing analysis ready datasets
1 version - Latest release: about 3 years ago - 1 dependent repositories - 18 downloads last month - 111 stars on GitHub - 1 maintainer
zapr-athena-client 0.1
It is a python library to run the presto query on the AWS Athena.
1 version - Latest release: about 3 years ago - 1 dependent repositories - 5 downloads last month - 1 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
airbyte-cdk-test 0.1.0rc3
A framework for writing Airbyte Connectors.
2 versions - Latest release: about 3 years ago - 7 dependent repositories - 2 downloads last month - 13,821 stars on GitHub - 1 maintainer
data-science-kit 0.0.1
Data Science Basic Functions
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 16 downloads last month - 1 stars on GitHub - 1 maintainer
contessa 0.2.12
Data-quality framework
14 versions - Latest release: almost 3 years ago - 1 dependent repositories - 23 downloads last month - 18 stars on GitHub - 2 maintainers
Top 3.8% on pypi.org
apache-airflow-upgrade-check 1.4.0
Check for compatibility between Airflow versions
12 versions - Latest release: almost 3 years ago - 6 dependent repositories - 12.6 thousand downloads last month - 33,967 stars on GitHub - 3 maintainers
Top 5.2% on pypi.org
dagster-cron 0.11.16
A Dagster integration for cron
235 versions - Latest release: almost 3 years ago - 3 dependent repositories - 2.89 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
dataexpectations 0.0.6
Is your data meeting all your expecations
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 10 downloads last month - 1 stars on GitHub - 1 maintainer
viewflow 0.1.0
Viewflow is an Airflow-based framework that allows data scientists to create data models without ...
2 versions - Latest release: almost 3 years ago - 4 dependent repositories - 53 downloads last month - 120 stars on GitHub - 2 maintainers
resilient-exporters 0.1.6
A package to export data to databases resiliently.
7 versions - Latest release: over 2 years ago - 1 dependent repositories - 5 downloads last month - 0 stars on GitHub - 1 maintainer
Top 6.8% on pypi.org
hive-metastore-client 1.0.9
A client for connecting and running DDLs on Hive Metastore with Thrift protocol
10 versions - Latest release: over 2 years ago - 1 dependent package - 3 dependent repositories - 122 thousand downloads last month - 46 stars on GitHub - 1 maintainer
beneath 1.4.2
Python client and CLI for Beneath (https://beneath.dev/)
39 versions - Latest release: over 2 years ago - 2 dependent repositories - 373 downloads last month - 79 stars on GitHub - 2 maintainers
cargo-crates 0.0.1
An easy way to build data extractors in Docker.
1 version - Latest release: over 2 years ago - 1 dependent repositories - 29 downloads last month - 1 stars on GitHub - 1 maintainer
funsies 0.8.1
Funsies is a library to build and execution engine for reproducible, composable and data-persiste...
4 versions - Latest release: over 2 years ago - 1 dependent repositories - 32 downloads last month - 38 stars on GitHub - 1 maintainer
py-dagger 0.4.2
Define sophisticated data pipelines with Python and run them on different distributed systems (su...
8 versions - Latest release: over 2 years ago - 1 dependent repositories - 79 downloads last month - 13 stars on GitHub - 1 maintainer
airbyte-cdk-velocity-amazon 0.1.1
A framework for writing Airbyte Connectors.
1 version - Latest release: over 2 years ago - 1 dependent repositories - 8 downloads last month - 12,521 stars on GitHub - 1 maintainer
airbyte-cdk-velocity 0.1.32
A framework for writing Airbyte Connectors.
9 versions - Latest release: over 2 years ago - 1 dependent repositories - 95 downloads last month - 12,512 stars on GitHub - 1 maintainer
Top 9.2% on pypi.org
lakehouse 0.12.15
An orchestration platform for the development, production, and observation of data assets.
167 versions - Latest release: over 2 years ago - 1 dependent repositories - 1.53 thousand downloads last month - 10,330 stars on GitHub - 1 maintainer
py-dagger-contrib 0.4.0
Extensions for the Dagger library (py-dagger in PyPI).
4 versions - Latest release: over 2 years ago - 1 dependent repositories - 12 downloads last month - 1 stars on GitHub - 1 maintainer
apache-superset-11680 1.3.1
A modern, enterprise-ready business intelligence web application
1 version - Latest release: over 2 years ago - 25 downloads last month - 58,575 stars on GitHub - 1 maintainer
apache-superset-11680-1000 1.3.1
A modern, enterprise-ready business intelligence web application
1 version - Latest release: over 2 years ago - 20 downloads last month - 58,575 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
pyspark-test 0.2.0
Check that left and right spark DataFrame are equal.
2 versions - Latest release: over 2 years ago - 5 dependent repositories - 168 thousand downloads last month - 20 stars on GitHub - 1 maintainer
prefect-saturn 0.6.0
Client library for running Prefect Cloud flows in Saturn Cloud
14 versions - Latest release: over 2 years ago - 1 dependent repositories - 110 downloads last month - 16 stars on GitHub - 1 maintainer
totype 0.1.0
Data converter
1 version - Latest release: over 2 years ago - 1 dependent repositories - 15 downloads last month - 0 stars on GitHub - 1 maintainer
cacheml 1.0.4
Cache ML -- layer on top of joblib to cache parsed datasets, dramatically reducing load time of l...
1 version - Latest release: over 2 years ago - 8 downloads last month - 1 stars on GitHub - 1 maintainer
analytics-command-center 3.0.14
Command Center for Data Ingestion, Advanced Analytics and Artificial Intelligence process
1 version - Latest release: over 2 years ago - 26 downloads last month - 11 stars on GitHub - 1 maintainer
bitcoin-etl 1.5.2
Tools for exporting Bitcoin blockchain data to JSON
10 versions - Latest release: over 2 years ago - 1 dependent repositories - 189 downloads last month - 369 stars on GitHub - 1 maintainer
dbt-sugar 0.2.0 ๐Ÿ’ฐ
A sweet CLI tool to help dbt users enforce documentation and testing on their dbt projects.
17 versions - Latest release: over 2 years ago - 1 dependent repositories - 150 downloads last month - 149 stars on GitHub - 1 maintainer
metastore 1.0.0.dev21
Metastore Python SDK. Feature store and data catalog for machine learning.
21 versions - Latest release: over 2 years ago - 1 dependent repositories - 119 downloads last month - 0 stars on GitHub - 1 maintainer
gcp-airflow-foundations-dev-jiny 0.2.9
Opinionated framework based on Airflow 2.0 for building pipelines to ingest data into a BigQuery ...
1 version - Latest release: over 2 years ago - 1 dependent repositories - 10 downloads last month - 11 stars on GitHub - 1 maintainer
pandasecharts 0.4.2
Visualize your pandas data with one-line code
9 versions - Latest release: over 2 years ago - 1 dependent repositories - 93 downloads last month - 4 stars on GitHub - 1 maintainer
uberjob 1.0.0
uberjob is a Python package for building and running call graphs.
2 versions - Latest release: over 2 years ago - 1 dependent repositories - 4.88 thousand downloads last month - 27 stars on GitHub - 1 maintainer
plugin-package-template 0.1.477708478
Plugin template project used to quick start development of a new Versatile Data Kit SDK plugin.
10 versions - Latest release: about 2 years ago - 1 dependent repositories - 8 downloads last month - 408 stars on GitHub - 1 maintainer
sage-superset 1.0.0
A modern, enterprise-ready business intelligence web application
26 versions - Latest release: about 2 years ago - 1 dependent repositories - 9 downloads last month - 58,575 stars on GitHub - 1 maintainer
siphon-data 0.6.1
A data engineering utility library for siphoning data around
3 versions - Latest release: about 2 years ago - 1 dependent repositories - 23 downloads last month - 0 stars on GitHub - 1 maintainer
Top 8.4% on pypi.org
cauldron-notebook 1.0.9
The Unnotebook: Data Analysis Environment
66 versions - Latest release: about 2 years ago - 4 dependent repositories - 612 downloads last month - 78 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
soda-spark 0.3.3
Soda SQL API for PySpark data frame
11 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 46.3 thousand downloads last month - 60 stars on GitHub - 1 maintainer
data-hopper 0.1.0
Package for data wrangling in python.
1 version - Latest release: about 2 years ago - 1 dependent repositories - 8 downloads last month - 1 stars on GitHub - 1 maintainer
pyduct 0.0.1
A framework for building and running simple data engineering pipelines in Python.
1 version - Latest release: about 2 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 1 maintainer
risk-command-center 1.0.37
Risk Command Center, manage your risk easly.
2 versions - Latest release: almost 2 years ago - 1 dependent repositories - 10 downloads last month - 11 stars on GitHub - 1 maintainer