Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

conda-forge.org "data-engineering" keyword

pangeo-forge-recipes 0.9.2
Pangeo Forge is an open source platform for data Extraction, Transformation, and Loading (ETL). T...
5 versions - Latest release: over 1 year ago - 1 dependent package - 4 dependent repositories - 95 stars on GitHub
dagster-celery 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 2 dependent packages - 6,905 stars on GitHub
dagster-dask 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-postgres 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 4 dependent packages - 6,905 stars on GitHub
dagster-celery-docker 1.0.17
An orchestration platform for the development, production, and observation of data assets.
98 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_slack 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_aws 0.6.6
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-duckdb-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_dbt 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_bash 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_ssh 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_pandas 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 1 dependent package - 6,905 stars on GitHub
dagster_ge 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-k8s 1.0.17
An orchestration platform for the development, production, and observation of data assets.
109 versions - Latest release: over 1 year ago - 1 dependent package - 6,905 stars on GitHub
dagster_datadog 0.6.5
An orchestration platform for the development, production, and observation of data assets.
1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-celery-k8s 1.0.17
Dagster lets you define pipelines in terms of the data flow between reusable, logical components,...
98 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-github 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_graphql 0.6.7
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 4 years ago - 2 dependent packages - 6,905 stars on GitHub
dagster-spark 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 1 dependent package - 6,905 stars on GitHub
dagster-airbyte 1.0.17
An orchestration platform for the development, production, and observation of data assets.
47 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-aws 1.0.17
An orchestration platform for the development, production, and observation of data assets.
55 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_cron 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_postgres 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-managed-elements 1.0.17
An orchestration platform for the development, production, and observation of data assets.
3 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-bash 0.7.16
An orchestration platform for the development, production, and observation of data assets.
11 versions - Latest release: almost 4 years ago - 6,905 stars on GitHub
dagster-duckdb-pyspark 1.0.17
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-census 1.0.17
An orchestration platform for the development, production, and observation of data assets.
12 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_snowflake 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-shell 1.0.17
An orchestration platform for the development, production, and observation of data assets.
103 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
Top 7.6% on conda-forge.org
dagster 1.0.17
Dagster lets you define pipelines in terms of the data flow between reusable, logical components,...
119 versions - Latest release: over 1 year ago - 60 dependent packages - 2 dependent repositories - 6,905 stars on GitHub
dagster-dbt 1.0.17
An orchestration platform for the development, production, and observation of data assets.
106 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-snowflake-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.
12 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-gcp 1.0.17
An orchestration platform for the development, production, and observation of data assets.
112 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-ssh 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_twilio 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-prometheus 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-mlflow 1.0.17
An orchestration platform for the development, production, and observation of data assets.
70 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-fivetran 1.0.17
An orchestration platform for the development, production, and observation of data assets.
56 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-pandera 1.0.17
An orchestration platform for the development, production, and observation of data assets.
34 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-duckdb 1.0.17
An orchestration platform for the development, production, and observation of data assets.
5 versions - Latest release: over 1 year ago - 2 dependent packages - 6,905 stars on GitHub
dagster-pyspark 1.0.17
An orchestration platform for the development, production, and observation of data assets.
112 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-mysql 1.0.17
An orchestration platform for the development, production, and observation of data assets.
82 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-snowflake 1.0.17
An orchestration platform for the development, production, and observation of data assets.
108 versions - Latest release: over 1 year ago - 1 dependent package - 6,905 stars on GitHub
dagster_papertrail 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-papertrail 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-cron 0.11.15
An orchestration platform for the development, production, and observation of data assets.
44 versions - Latest release: almost 3 years ago - 6,905 stars on GitHub
dagster-graphql 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6 dependent packages - 6,905 stars on GitHub
dagster-datadog 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_dask 0.6.5
An orchestration platform for the development, production, and observation of data assets.
1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-docker 1.0.17
An orchestration platform for the development, production, and observation of data assets.
86 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-slack 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-twilio 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
r-uptasticsearch 0.4.0
An Elasticsearch client tailored to data science workflows.
1 version - Latest release: about 4 years ago - 47 stars on GitHub
data-diff 0.2.8
Efficiently diff data in or across relational databases
6 versions - Latest release: over 1 year ago - 1,766 stars on GitHub
pyjanitor 0.23.1
Clean APIs for data cleaning. Python implementation of R package Janitor
51 versions - Latest release: over 1 year ago - 1 dependent package - 14 dependent repositories - 1,124 stars on GitHub
feast 0.26.0
Feature Store for Machine Learning
1 version - Latest release: over 1 year ago - 4,088 stars on GitHub
pyspark-test 0.2.0
Testing library for pyspark, inspired from pandas testing module but for pyspark, to help users w...
2 versions - Latest release: over 2 years ago - 14 stars on GitHub
Top 4.9% on conda-forge.org
prefect 2.6.7
Prefect is a workflow management system, designed for modern infrastructure and powered by the op...
133 versions - Latest release: over 1 year ago - 8 dependent packages - 41 dependent repositories - 11,520 stars on GitHub
quilt 3.0.6
Quilt is infrastructure for data-driven teams to store, version, deploy and iterate on models and...
2 versions - Latest release: over 4 years ago - 1,226 stars on GitHub
quilt3 5.0.0
Quilt is infrastructure for data-driven teams to store, version, deploy and iterate on models and...
21 versions - Latest release: about 2 years ago - 4 dependent packages - 7 dependent repositories - 1,226 stars on GitHub
mage-ai 0.7.5
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and t...
22 versions - Latest release: over 1 year ago - 3,631 stars on GitHub
great-expectations 0.15.32
Great Expectations helps teams save time and promote analytic integrity by offering a unique appr...
144 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 8,121 stars on GitHub
superset 2.0.0
Apache Superset is a Data Visualization and Data Exploration Platform
12 versions - Latest release: almost 2 years ago - 51,076 stars on GitHub
apache-airflow-providers-samba 4.0.0
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
7 versions - Latest release: almost 2 years ago - 33,057 stars on GitHub
sf-hamilton 1.11.0
A scalable general purpose micro-framework for defining dataflows. You can use it to build datafr...
12 versions - Latest release: over 1 year ago - 688 stars on GitHub
ploomber 0.21.7
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
60 versions - Latest release: over 1 year ago - 2 dependent repositories - 3,017 stars on GitHub
ploomber-scaffold 0.3.1
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️
10 versions - Latest release: about 2 years ago - 1 dependent package - 3,021 stars on GitHub
gspread-pandas 2.2.3
A package to easily open an instance of a Google spreadsheet and interact with worksheets through...
3 versions - Latest release: about 4 years ago - 357 stars on GitHub
awswrangler 2.17.0
An open-source Python package that extends the power of Pandas library to AWS connecting DataFram...
81 versions - Latest release: over 1 year ago - 1 dependent repositories - 3,363 stars on GitHub
dagster-msteams 1.0.17
An orchestration platform for the development, production, and observation of data assets.
56 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagit 1.0.17
An orchestration platform for the development, production, and observation of data assets.
118 versions - Latest release: over 1 year ago - 2 dependent repositories - 6,905 stars on GitHub
dagster_pagerduty 0.6.4
An orchestration platform for the development, production, and observation of data assets.
1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-ge 1.0.17
An orchestration platform for the development, production, and observation of data assets.
103 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-pagerduty 1.0.17
An orchestration platform for the development, production, and observation of data assets.
114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_spark 0.6.5
An orchestration platform for the development, production, and observation of data assets.
2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_gcp 0.6.5
An orchestration platform for the development, production, and observation of data assets.
1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-airflow 1.0.17
An orchestration platform for the development, production, and observation of data assets.
110 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagstermill 1.0.17
An orchestration platform for the development, production, and observation of data assets.
112 versions - Latest release: over 1 year ago - 6,905 stars on GitHub