Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data engineering" keyword

aiscalator 0.1.18
AIscalate your Jupyter Notebook Prototypes into Airflow Data Products
22 versions - Latest release: almost 4 years ago - 277 downloads last month - 5 stars on GitHub - 1 maintainer
covsirphy 3.1.1 💰
COVID-19 data analysis with phase-dependent SIR-derived ODE models
59 versions - Latest release: 3 months ago - 1 dependent repositories - 354 downloads last month - 101 stars on GitHub - 2 maintainers
dbt_coves 1.7.8
CLI tool for dbt users adopting analytics engineering best practices.
198 versions - Latest release: 14 days ago - 5.88 thousand downloads last month - 204 stars on GitHub - 1 maintainer
Top 2.5% on pypi.org
kedro-viz 9.0.0
Kedro-Viz helps visualise Kedro data and analytics pipelines
71 versions - Latest release: 26 days ago - 4 dependent packages - 131 dependent repositories - 89.5 thousand downloads last month - 635 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
kedro 0.19.5
Kedro helps you build production-ready data and analytics pipelines
50 versions - Latest release: 21 days ago - 35 dependent packages - 402 dependent repositories - 504 thousand downloads last month - 9,337 stars on GitHub - 1 maintainer
deepcoreml 0.3.4
A collection of Machine Learning techniques for data management, engineering and augmentation.
9 versions - Latest release: 3 days ago - 44 downloads last month - 0 stars on GitHub - 1 maintainer
goodcrap 0.2.5
goodcrap creates tables, databases and csv files and fill them with random data
8 versions - Latest release: about 1 month ago - 151 downloads last month - 1 stars on GitHub - 2 maintainers
datajob 0.11.0
Build and deploy a serverless data pipeline with no effort on AWS.
13 versions - Latest release: over 1 year ago - 1 dependent repositories - 116 downloads last month - 106 stars on GitHub - 2 maintainers
dhcdatacleaner 0.1.0
DHC Python tool that automatically cleans data sets and readies them for analysis.
1 version - Latest release: about 6 years ago - 1 dependent repositories - 8 downloads last month - 1 maintainer
Top 4.4% on pypi.org
kedro-mlflow 0.12.2
A kedro-plugin to use mlflow in your kedro projects
32 versions - Latest release: 26 days ago - 3 dependent packages - 21 dependent repositories - 14.7 thousand downloads last month - 187 stars on GitHub - 1 maintainer
aws-json-dataset 0.1.0
Send JSON datasets to various AWS services.
1 version - Latest release: 3 months ago - 17 downloads last month - 0 stars on GitHub - 1 maintainer
plug-email-chase 1.0.1
Plug - Email Chase Pipeline
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 8 downloads last month - 1 maintainer
kedro-graphql 0.4.0
A kedro plugin for serving any kedro project as a GraphQL api
13 versions - Latest release: 6 months ago - 103 downloads last month - 5 stars on GitHub - 2 maintainers
datasaurus 0.0.2.dev4
Data Engineering framework based on Polars.rs
5 versions - Latest release: 5 months ago - 48 downloads last month - 14 stars on GitHub - 2 maintainers
dsutils-ms 1.10
My Data Science Utils
11 versions - Latest release: 10 months ago - 321 downloads last month - 2 maintainers
kedro-diff 0.1.1 💰
diff commits to your kedro pipeline
2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 11 downloads last month - 10 stars on GitHub - 2 maintainers
dtflw 0.6.7
dtflw is a Python framework for building modular data pipelines based on Databricks dbutils.noteb...
7 versions - Latest release: 7 months ago - 1.97 thousand downloads last month - 4 stars on GitHub - 4 maintainers
dataset-shuffler 0.1.1
Data engineering tool for learning-based computer vision.
1 version - Latest release: over 1 year ago - 20 downloads last month - 24 stars on GitHub - 2 maintainers
dhcstat 0.1.1
DHC Python tool that automatically analysis.
1 version - Latest release: almost 6 years ago - 1 dependent repositories - 23 downloads last month - 2 maintainers
sheetwork 1.0.7 💰
A handy CLI tool to ingest GoogleSheets into your database without writing a single line of code
27 versions - Latest release: over 3 years ago - 1 dependent repositories - 105 downloads last month - 16 stars on GitHub - 2 maintainers
dbt-sugar 0.2.0 💰
A sweet CLI tool to help dbt users enforce documentation and testing on their dbt projects.
17 versions - Latest release: over 2 years ago - 1 dependent repositories - 150 downloads last month - 149 stars on GitHub - 2 maintainers
mjooln 0.7.0
63 versions - Latest release: over 1 year ago - 1 dependent repositories - 203 downloads last month - 0 stars on GitLab.com - 2 maintainers
data-science-kit 0.0.1
Data Science Basic Functions
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 16 downloads last month - 1 stars on GitHub - 1 maintainer
Top 3.1% on pypi.org
skrebate 0.3.4
Relief-based feature selection algorithms
13 versions - Latest release: about 7 years ago - 8 dependent packages - 51 dependent repositories - 4.77 thousand downloads last month - 394 stars on GitHub - 4 maintainers
tdprepview 1.4.1
Python Package that creates Data Preparation Pipeline in Teradata-SQL in Views
16 versions - Latest release: 20 days ago - 275 downloads last month - 2 maintainers
kedro-boot 0.2.2
A kedro plugin that streamlines the integration between Kedro projects and external applications,...
5 versions - Latest release: 22 days ago - 112 downloads last month - 19 stars on GitHub - 2 maintainers
atalert 0.1.10
Atalert slack alerting service helper module
10 versions - Latest release: over 1 year ago - 75 downloads last month - 2 stars on GitHub - 2 maintainers
dcw 0.0.11
Data Collection and Wrangling
6 versions - Latest release: 4 months ago - 35 downloads last month - 0 stars on GitHub - 2 maintainers
spooq 3.4.0
Spooq is a PySpark based helper library for ETL data ingestion pipeline in Data Lakes.
11 versions - Latest release: about 2 months ago - 1 dependent repositories - 19.3 thousand downloads last month - 8 stars on GitHub - 2 maintainers
pycurie 0.1.16
16 versions - Latest release: 7 months ago - 14 downloads last month - 1 stars on GitHub - 2 maintainers
Top 7.4% on pypi.org
xedro 0.17.6
Kedro helps you build production-ready data and analytics pipelines
1 version - Latest release: over 2 years ago - 1 dependent repositories - 3.98 thousand downloads last month - 9,337 stars on GitHub - 2 maintainers
Top 8.6% on pypi.org
pipelinex 0.7.9
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
48 versions - Latest release: 6 months ago - 3 dependent repositories - 121 downloads last month - 218 stars on GitHub - 2 maintainers
kedex 0.1.0
Kedro extension for rapid prototyping and experimentation
6 versions - Latest release: over 4 years ago - 1 dependent repositories - 5 downloads last month - 0 stars on GitHub - 1 maintainer
fahr 0.0.1
Tool for running remote machine learning jobs remotely.
1 version - Latest release: about 5 years ago - 1 dependent repositories - 8 downloads last month - 4 stars on GitHub - 2 maintainers
parallelfileconcatenator 0.1 removed
ParallelFileConcatenator is a robust tool designed to efficiently combine data files of various f...
1 version - Latest release: 9 months ago