Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
conda-forge.org "data-engineering" keyword
pangeo-forge-recipes 0.9.2
Pangeo Forge is an open source platform for data Extraction, Transformation, and Loading (ETL). T...5 versions - Latest release: over 1 year ago - 1 dependent package - 4 dependent repositories - 95 stars on GitHub
dagster-celery 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 2 dependent packages - 6,905 stars on GitHub
dagster-dask 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-postgres 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 4 dependent packages - 6,905 stars on GitHub
dagster-celery-docker 1.0.17
An orchestration platform for the development, production, and observation of data assets.98 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_slack 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_aws 0.6.6
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-duckdb-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_dbt 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_bash 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_ssh 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_pandas 0.6.7
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 4 years ago - 1 dependent package - 6,905 stars on GitHub
dagster_ge 0.6.7
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-k8s 1.0.17
An orchestration platform for the development, production, and observation of data assets.109 versions - Latest release: over 1 year ago - 1 dependent package - 6,905 stars on GitHub
dagster_datadog 0.6.5
An orchestration platform for the development, production, and observation of data assets.1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-celery-k8s 1.0.17
Dagster lets you define pipelines in terms of the data flow between reusable, logical components,...98 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-github 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_graphql 0.6.7
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 4 years ago - 2 dependent packages - 6,905 stars on GitHub
dagster-spark 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 1 dependent package - 6,905 stars on GitHub
dagster-airbyte 1.0.17
An orchestration platform for the development, production, and observation of data assets.47 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-aws 1.0.17
An orchestration platform for the development, production, and observation of data assets.55 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_cron 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_postgres 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-managed-elements 1.0.17
An orchestration platform for the development, production, and observation of data assets.3 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-bash 0.7.16
An orchestration platform for the development, production, and observation of data assets.11 versions - Latest release: almost 4 years ago - 6,905 stars on GitHub
dagster-duckdb-pyspark 1.0.17
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-census 1.0.17
An orchestration platform for the development, production, and observation of data assets.12 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_snowflake 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-shell 1.0.17
An orchestration platform for the development, production, and observation of data assets.103 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
Top 7.6% on conda-forge.org
119 versions - Latest release: over 1 year ago - 60 dependent packages - 2 dependent repositories - 6,905 stars on GitHub
dagster 1.0.17
Dagster lets you define pipelines in terms of the data flow between reusable, logical components,...119 versions - Latest release: over 1 year ago - 60 dependent packages - 2 dependent repositories - 6,905 stars on GitHub
dagster-dbt 1.0.17
An orchestration platform for the development, production, and observation of data assets.106 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-snowflake-pandas 1.0.17
An orchestration platform for the development, production, and observation of data assets.12 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-gcp 1.0.17
An orchestration platform for the development, production, and observation of data assets.112 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-ssh 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_twilio 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-prometheus 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-mlflow 1.0.17
An orchestration platform for the development, production, and observation of data assets.70 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-fivetran 1.0.17
An orchestration platform for the development, production, and observation of data assets.56 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-pandera 1.0.17
An orchestration platform for the development, production, and observation of data assets.34 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-duckdb 1.0.17
An orchestration platform for the development, production, and observation of data assets.5 versions - Latest release: over 1 year ago - 2 dependent packages - 6,905 stars on GitHub
dagster-pyspark 1.0.17
An orchestration platform for the development, production, and observation of data assets.112 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-mysql 1.0.17
An orchestration platform for the development, production, and observation of data assets.82 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-snowflake 1.0.17
An orchestration platform for the development, production, and observation of data assets.108 versions - Latest release: over 1 year ago - 1 dependent package - 6,905 stars on GitHub
dagster_papertrail 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-papertrail 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-cron 0.11.15
An orchestration platform for the development, production, and observation of data assets.44 versions - Latest release: almost 3 years ago - 6,905 stars on GitHub
dagster-graphql 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6 dependent packages - 6,905 stars on GitHub
dagster-datadog 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_dask 0.6.5
An orchestration platform for the development, production, and observation of data assets.1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-docker 1.0.17
An orchestration platform for the development, production, and observation of data assets.86 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-slack 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-twilio 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
r-uptasticsearch 0.4.0
An Elasticsearch client tailored to data science workflows.1 version - Latest release: about 4 years ago - 47 stars on GitHub
data-diff 0.2.8
Efficiently diff data in or across relational databases6 versions - Latest release: over 1 year ago - 1,766 stars on GitHub
pyjanitor 0.23.1
Clean APIs for data cleaning. Python implementation of R package Janitor51 versions - Latest release: over 1 year ago - 1 dependent package - 14 dependent repositories - 1,124 stars on GitHub
feast 0.26.0
Feature Store for Machine Learning1 version - Latest release: over 1 year ago - 4,088 stars on GitHub
pyspark-test 0.2.0
Testing library for pyspark, inspired from pandas testing module but for pyspark, to help users w...2 versions - Latest release: over 2 years ago - 14 stars on GitHub
Top 4.9% on conda-forge.org
133 versions - Latest release: over 1 year ago - 8 dependent packages - 41 dependent repositories - 11,520 stars on GitHub
prefect 2.6.7
Prefect is a workflow management system, designed for modern infrastructure and powered by the op...133 versions - Latest release: over 1 year ago - 8 dependent packages - 41 dependent repositories - 11,520 stars on GitHub
quilt 3.0.6
Quilt is infrastructure for data-driven teams to store, version, deploy and iterate on models and...2 versions - Latest release: over 4 years ago - 1,226 stars on GitHub
quilt3 5.0.0
Quilt is infrastructure for data-driven teams to store, version, deploy and iterate on models and...21 versions - Latest release: about 2 years ago - 4 dependent packages - 7 dependent repositories - 1,226 stars on GitHub
mage-ai 0.7.5
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and t...22 versions - Latest release: over 1 year ago - 3,631 stars on GitHub
great-expectations 0.15.32
Great Expectations helps teams save time and promote analytic integrity by offering a unique appr...144 versions - Latest release: over 1 year ago - 2 dependent packages - 1 dependent repositories - 8,121 stars on GitHub
superset 2.0.0
Apache Superset is a Data Visualization and Data Exploration Platform12 versions - Latest release: almost 2 years ago - 51,076 stars on GitHub
apache-airflow-providers-samba 4.0.0
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows7 versions - Latest release: almost 2 years ago - 33,057 stars on GitHub
sf-hamilton 1.11.0
A scalable general purpose micro-framework for defining dataflows. You can use it to build datafr...12 versions - Latest release: over 1 year ago - 688 stars on GitHub
ploomber 0.21.7
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️60 versions - Latest release: over 1 year ago - 2 dependent repositories - 3,017 stars on GitHub
ploomber-scaffold 0.3.1
The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️10 versions - Latest release: about 2 years ago - 1 dependent package - 3,021 stars on GitHub
gspread-pandas 2.2.3
A package to easily open an instance of a Google spreadsheet and interact with worksheets through...3 versions - Latest release: about 4 years ago - 357 stars on GitHub
awswrangler 2.17.0
An open-source Python package that extends the power of Pandas library to AWS connecting DataFram...81 versions - Latest release: over 1 year ago - 1 dependent repositories - 3,363 stars on GitHub
dagster-msteams 1.0.17
An orchestration platform for the development, production, and observation of data assets.56 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagit 1.0.17
An orchestration platform for the development, production, and observation of data assets.118 versions - Latest release: over 1 year ago - 2 dependent repositories - 6,905 stars on GitHub
dagster_pagerduty 0.6.4
An orchestration platform for the development, production, and observation of data assets.1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-ge 1.0.17
An orchestration platform for the development, production, and observation of data assets.103 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster-pagerduty 1.0.17
An orchestration platform for the development, production, and observation of data assets.114 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagster_spark 0.6.5
An orchestration platform for the development, production, and observation of data assets.2 versions - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster_gcp 0.6.5
An orchestration platform for the development, production, and observation of data assets.1 version - Latest release: over 4 years ago - 6,905 stars on GitHub
dagster-airflow 1.0.17
An orchestration platform for the development, production, and observation of data assets.110 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
dagstermill 1.0.17
An orchestration platform for the development, production, and observation of data assets.112 versions - Latest release: over 1 year ago - 6,905 stars on GitHub
Related Keywords
data-science
74
python
73
etl
67
mlops
66
workflow
65
orchestration
64
data-integration
63
data-pipelines
63
analytics
62
data-orchestrator
62
scheduler
62
workflow-automation
61
dagster
61
metadata
61
machine-learning
6
data
6
pandas
4
pipeline
3
data-quality
3
dataframe
3
pipelines
3
dag
2
sql
2
jupyter
2
spark
2
vscode
2
pycharm
2
papermill
2
notebooks
2
jupyter-notebooks
2
elt
2
automation
2
mysql
2
dataquality
2
serialization
2
parquet
2
data-analytics
2
data-analysis
2
data-versioning
2
data-version-control
2
apache
2
workflow-engine
2
etl-framework
1
etl-pipeline
1
data-platform
1
workflow-orchestration
1
apache-airflow
1
airflow
1
superset
1
sql-editor
1
react
1
flask
1
data-viz
1
data-visualization
1
business-intelligence
1
business-analytics
1
bi
1
asf
1
redshift
1
lambda
1
glue-catalog
1
emr
1
aws-lambda
1
aws-glue
1
aws
1
athena
1
apache-parquet
1
apache-arrow
1
amazon-sagemaker-notebook
1
amazon-athena
1
sheets
1
gspread
1
google-spreadsheets
1
google-sheets
1
google
1
dataframes
1
stitch-fix
1
software-engineering
1
numpy
1
lineage
1
hamiltonian
1
hamilton
1
featurization
1
feature-engineering
1
apache-superset
1
pyspark
1
ml
1
features
1
feature-store
1
big-data
1
pydata
1
hacktoberfest
1
cleaning-data
1
trino
1
snowflake
1
rdbms
1
postgresql
1
postgres
1
oracle-database
1
dataengineering
1