Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "dataengineering" keyword

aws-orbit-custom-cfn 1.4.0
Launch a CloudFormation stack for the team space
19 versions - Latest release: over 2 years ago - 1 dependent repositories - 149 downloads last month - 127 stars on GitHub - 4 maintainers
cz-data-diff 0.0.4
Command-line tool and Python library to efficiently diff rows across two different databases.
3 versions - Latest release: 4 months ago - 26 downloads last month - 2,657 stars on GitHub - 4 maintainers
aws-orbit-sm-operator 1.4.0
Launch a Pod for the team space that executes a script given by the user
11 versions - Latest release: over 2 years ago - 1 dependent repositories - 85 downloads last month - 127 stars on GitHub - 4 maintainers
aws-orbit-team-script-launcher 1.4.0
Launch a Pod for the team space that executes a script given by the user
19 versions - Latest release: over 2 years ago - 1 dependent repositories - 174 downloads last month - 127 stars on GitHub - 5 maintainers
aws-orbit-hello-world 1.4.0
Minimal Orbit Workbench Plugin.
19 versions - Latest release: over 2 years ago - 1 dependent repositories - 168 downloads last month - 127 stars on GitHub - 4 maintainers
aws-orbit-overprovisioning 1.4.0
Launch a Pod for the team space that executes a script given by the user
13 versions - Latest release: over 2 years ago - 1 dependent repositories - 105 downloads last month - 127 stars on GitHub - 4 maintainers
aws-orbit 1.4.0
Data & ML Unified Development and Production Environment.
19 versions - Latest release: over 2 years ago - 1 dependent repositories - 131 downloads last month - 127 stars on GitHub - 4 maintainers
aws-orbit-jupyterlab-orbit 1.4.0
AWS Orbit Workbench JupyterLab extension.
11 versions - Latest release: over 2 years ago - 1 dependent repositories - 89 downloads last month - 127 stars on GitHub - 4 maintainers
Top 7.3% on pypi.org
sqlmesh 0.91.2
Efficient data transformation and modeling framework that is backwards compatible with dbt.
332 versions - Latest release: 2 days ago - 1 dependent package - 1 dependent repositories - 21 thousand downloads last month - 1,222 stars on GitHub - 7 maintainers
aws-orbit-lustre 1.4.0
Add support for FSX Lustre for high-performance file system
13 versions - Latest release: over 2 years ago - 1 dependent repositories - 87 downloads last month - 127 stars on GitHub - 4 maintainers
aws-orbit-sdk 1.4.0
AWS Orbit Workbench SDK
19 versions - Latest release: over 2 years ago - 1 dependent repositories - 151 downloads last month - 127 stars on GitHub - 4 maintainers
aws-orbit-ray 1.4.0
Launch a Pod for the team space that executes a script given by the user
13 versions - Latest release: over 2 years ago - 1 dependent repositories - 52 downloads last month - 126 stars on GitHub - 5 maintainers
aws-orbit-voila 1.4.0
Launch a Pod for the team space that executes a script given by the user
11 versions - Latest release: over 2 years ago - 1 dependent repositories - 103 downloads last month - 127 stars on GitHub - 4 maintainers
aws-orbit-emr-on-eks 1.4.0
Allow users to run EMR jobs on their EKS namespace
18 versions - Latest release: over 2 years ago - 1 dependent repositories - 130 downloads last month - 127 stars on GitHub - 4 maintainers
aws-orbit-redshift 1.4.0
Orbit Workbench Redshift Plugin.
19 versions - Latest release: over 2 years ago - 1 dependent repositories - 148 downloads last month - 127 stars on GitHub - 5 maintainers
aws-orbit-code-commit 1.4.0
Orbit Workbench CodeCommit Plugin.
19 versions - Latest release: over 2 years ago - 1 dependent repositories - 131 downloads last month - 127 stars on GitHub - 5 maintainers
Top 9.4% on pypi.org
openmetadata-managed-apis 1.3.3.0
Airflow REST APIs to create and manage DAGS
120 versions - Latest release: 8 days ago - 3.79 thousand downloads last month - 4,124 stars on GitHub - 1 maintainer
openmetadata-sqlalchemy-bigquery 1.2.0
SQLAlchemy dialect for BigQuery by OpenMetadata
4 versions - Latest release: over 2 years ago - 1 dependent package - 1 dependent repositories - 28 downloads last month - 4,124 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
openmetadata-ingestion 0.10.1
Ingestion Framework for OpenMetadata
271 versions - Latest release: almost 2 years ago - 2 dependent packages - 2 dependent repositories - 24.7 thousand downloads last month - 4,124 stars on GitHub - 1 maintainer
idg-metadata-client 1.0.2.0
Ingestion Framework for OpenMetadata
1 version - Latest release: 10 months ago - 27 downloads last month - 3,365 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
openmetadata-airflow-managed-apis 0.10.1
Airflow REST APIs to create and manage DAGS
31 versions - Latest release: almost 2 years ago - 1 dependent repositories - 398 downloads last month - 3,365 stars on GitHub - 1 maintainer
Top 9.7% on pypi.org
openmetadata-ingestion-core 0.10.0
These are the generated Python classes from JSON Schema
12 versions - Latest release: about 2 years ago - 1 dependent package - 1 dependent repositories - 150 downloads last month - 3,365 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
grai-graph 0.2.5
24 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 208 downloads last month - 269 stars on GitHub - 4 maintainers
grai_source_mysql 0.1.1
19 versions - Latest release: 7 months ago - 220 downloads last month - 269 stars on GitHub - 4 maintainers
grai_source_dbt 0.3.5
42 versions - Latest release: 2 months ago - 308 downloads last month - 269 stars on GitHub - 2 maintainers
grai_source_redshift 0.1.1
18 versions - Latest release: 7 months ago - 182 downloads last month - 269 stars on GitHub - 4 maintainers
Top 9.6% on pypi.org
grai-source-postgres 0.2.4
30 versions - Latest release: 6 months ago - 1 dependent package - 1 dependent repositories - 295 downloads last month - 269 stars on GitHub - 4 maintainers
grai-cli 0.2.6
19 versions - Latest release: 7 months ago - 95 downloads last month - 269 stars on GitHub - 4 maintainers
grai_source_fivetran 0.1.2
18 versions - Latest release: 7 months ago - 174 downloads last month - 269 stars on GitHub - 4 maintainers
grai_source_snowflake 0.1.2
29 versions - Latest release: 8 months ago - 234 downloads last month - 269 stars on GitHub - 4 maintainers
grai_schemas 0.2.11
61 versions - Latest release: 5 months ago - 785 downloads last month - 269 stars on GitHub - 4 maintainers
grai_source_bigquery 0.2.4
29 versions - Latest release: 7 months ago - 208 downloads last month - 269 stars on GitHub - 4 maintainers
the-guide 0.1.36
1 version - Latest release: 9 months ago - 4 downloads last month - 269 stars on GitHub - 4 maintainers
grai_source_cube 0.0.2
4 versions - Latest release: about 2 months ago - 138 downloads last month - 269 stars on GitHub - 2 maintainers
grai_source_looker 0.0.3
11 versions - Latest release: 7 months ago - 151 downloads last month - 269 stars on GitHub - 4 maintainers
Top 6.1% on pypi.org
grai-client 0.3.5
61 versions - Latest release: 5 months ago - 18 dependent packages - 3 dependent repositories - 584 downloads last month - 269 stars on GitHub - 4 maintainers
grai_source_mssql 0.1.3
21 versions - Latest release: 2 months ago - 201 downloads last month - 269 stars on GitHub - 4 maintainers
grai-source-openlineage 0.1.0a1
1 version - Latest release: 7 months ago - 110 downloads last month - 269 stars on GitHub - 4 maintainers
grai_source_dbt_cloud 0.1.5
18 versions - Latest release: 2 months ago - 185 downloads last month - 269 stars on GitHub - 4 maintainers
grai_source_flat_file 0.2.2
18 versions - Latest release: 5 months ago - 183 downloads last month - 269 stars on GitHub - 4 maintainers
recohut 0.0.11
A python library for building recommender systems.
4 versions - Latest release: over 2 years ago - 1 dependent repositories - 27 downloads last month - 9 stars on GitHub - 1 maintainer
Top 7.3% on pypi.org
aws-ddk 0.6.2
AWS DataOps Development Kit - CLI
15 versions - Latest release: about 1 year ago - 3 dependent repositories - 412 downloads last month - 244 stars on GitHub - 8 maintainers
Top 7.0% on pypi.org
aws-ddk-core 1.4.0
The AWS DataOps Development Kit is an open source development framework for customers that build ...
22 versions - Latest release: 4 months ago - 3 dependent repositories - 2.11 thousand downloads last month - 244 stars on GitHub - 9 maintainers
rawbuilder 0.0.7
an elegant datasets factory
7 versions - Latest release: about 2 years ago - 1 dependent repositories - 36 downloads last month - 7 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
zingg 0.4.0
Zingg Entity Resolution, Data Mastering and Deduplication
3 versions - Latest release: 4 months ago - 1 dependent repositories - 1.05 thousand downloads last month - 877 stars on GitHub - 1 maintainer
Top 4.5% on pypi.org
data-diff 0.11.1
Command-line tool and Python library to efficiently diff rows across two different databases.
74 versions - Latest release: 2 months ago - 1 dependent package - 2 dependent repositories - 56.8 thousand downloads last month - 2,657 stars on GitHub - 10 maintainers
metadata-guardian 0.2.7
MetadataGuardian is used to protect data by searching the source metadata.
12 versions - Latest release: over 1 year ago - 1 dependent repositories - 50 downloads last month - 20 stars on GitHub - 1 maintainer
livyc 0.0.14 💰
Apache Livy Client
11 versions - Latest release: almost 2 years ago - 15 downloads last month - 3 stars on GitHub - 2 maintainers
dftools-snowflake 0.1.1
Data Flooder Tools - Snowflake Package
7 versions - Latest release: 4 months ago - 33 downloads last month - 2 maintainers
dftools-core 0.1.1
Data Flooder Tools - Core Package
18 versions - Latest release: 4 months ago - 1 dependent package - 97 downloads last month - 2 maintainers
data-diff-customize 1.0.3
Command-line tool and Python library to efficiently diff rows across two different databases.
5 versions - Latest release: 6 months ago - 52 downloads last month - 2,657 stars on GitHub - 2 maintainers
camelcasing 0.1.3
Converts PascalCase or snake_case strings to camelCase.
13 versions - Latest release: about 1 year ago - 2 thousand downloads last month - 2 stars on GitHub - 1 maintainer
kedro-static-viz 0.4.4 💰
Creates a static visualization of your pipeline
12 versions - Latest release: about 3 years ago - 1 dependent repositories - 30 downloads last month - 27 stars on GitHub - 2 maintainers
etlworkers 0.0.6
A Data Engineering package
6 versions - Latest release: over 2 years ago - 1 dependent repositories - 3 downloads last month - 2 stars on GitHub - 2 maintainers
prodmodel 0.4.3
Build data science pipelines and models
25 versions - Latest release: almost 5 years ago - 1 dependent repositories - 143 downloads last month - 56 stars on GitHub - 2 maintainers
zenopy 2022.10.29
zenopy: A Python wrapper package for Zenodo API
3 versions - Latest release: over 1 year ago - 1 dependent repositories - 16 downloads last month - 5 stars on GitHub - 1 maintainer
pandas-aws 0.1.6
AWS helpers for data engineers and data scientists. Easily interacts with AWS from and to pandas....
7 versions - Latest release: about 3 years ago - 1 dependent repositories - 32 downloads last month - 1 stars on GitHub - 2 maintainers
athenasql 0.1.0a13
SQL builder for AWS Athena, inspired by sparkSQL
12 versions - Latest release: about 2 months ago - 7.76 thousand downloads last month - 4 stars on GitHub - 2 maintainers
sparkdataset 1.0.0
Provides instant access to many popular datasets right from Pyspark (in dataframe structure).
1 version - Latest release: over 2 years ago - 1 dependent repositories - 8 downloads last month - 34 stars on GitHub - 2 maintainers
sysxtract 1.0.0
Extract logs based off events from sysmon. Comes as a package, cli and ui.
1 version - Latest release: almost 4 years ago - 1 dependent repositories - 8 downloads last month - 3 stars on GitHub - 2 maintainers
normandy 0.2.3
A data pipeline framework.
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 10 downloads last month - 0 stars on GitHub - 2 maintainers