Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "data engineering" keyword
aiscalator 0.1.18
AIscalate your Jupyter Notebook Prototypes into Airflow Data Products22 versions - Latest release: almost 4 years ago - 277 downloads last month - 5 stars on GitHub - 1 maintainer
covsirphy 3.1.1 💰
COVID-19 data analysis with phase-dependent SIR-derived ODE models59 versions - Latest release: 3 months ago - 1 dependent repositories - 354 downloads last month - 101 stars on GitHub - 2 maintainers
dbt_coves 1.7.8
CLI tool for dbt users adopting analytics engineering best practices.198 versions - Latest release: 14 days ago - 5.88 thousand downloads last month - 204 stars on GitHub - 1 maintainer
Top 2.5% on pypi.org
71 versions - Latest release: 26 days ago - 4 dependent packages - 131 dependent repositories - 89.5 thousand downloads last month - 635 stars on GitHub - 1 maintainer
kedro-viz 9.0.0
Kedro-Viz helps visualise Kedro data and analytics pipelines71 versions - Latest release: 26 days ago - 4 dependent packages - 131 dependent repositories - 89.5 thousand downloads last month - 635 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
50 versions - Latest release: 21 days ago - 35 dependent packages - 402 dependent repositories - 504 thousand downloads last month - 9,337 stars on GitHub - 1 maintainer
kedro 0.19.5
Kedro helps you build production-ready data and analytics pipelines50 versions - Latest release: 21 days ago - 35 dependent packages - 402 dependent repositories - 504 thousand downloads last month - 9,337 stars on GitHub - 1 maintainer
deepcoreml 0.3.4
A collection of Machine Learning techniques for data management, engineering and augmentation.9 versions - Latest release: 3 days ago - 44 downloads last month - 0 stars on GitHub - 1 maintainer
goodcrap 0.2.5
goodcrap creates tables, databases and csv files and fill them with random data8 versions - Latest release: about 1 month ago - 151 downloads last month - 1 stars on GitHub - 2 maintainers
datajob 0.11.0
Build and deploy a serverless data pipeline with no effort on AWS.13 versions - Latest release: over 1 year ago - 1 dependent repositories - 116 downloads last month - 106 stars on GitHub - 2 maintainers
dhcdatacleaner 0.1.0
DHC Python tool that automatically cleans data sets and readies them for analysis.1 version - Latest release: about 6 years ago - 1 dependent repositories - 8 downloads last month - 1 maintainer
Top 4.4% on pypi.org
32 versions - Latest release: 26 days ago - 3 dependent packages - 21 dependent repositories - 14.7 thousand downloads last month - 187 stars on GitHub - 1 maintainer
kedro-mlflow 0.12.2
A kedro-plugin to use mlflow in your kedro projects32 versions - Latest release: 26 days ago - 3 dependent packages - 21 dependent repositories - 14.7 thousand downloads last month - 187 stars on GitHub - 1 maintainer
aws-json-dataset 0.1.0
Send JSON datasets to various AWS services.1 version - Latest release: 3 months ago - 17 downloads last month - 0 stars on GitHub - 1 maintainer
plug-email-chase 1.0.1
Plug - Email Chase Pipeline1 version - Latest release: almost 3 years ago - 1 dependent repositories - 8 downloads last month - 1 maintainer
kedro-graphql 0.4.0
A kedro plugin for serving any kedro project as a GraphQL api13 versions - Latest release: 6 months ago - 103 downloads last month - 5 stars on GitHub - 2 maintainers
datasaurus 0.0.2.dev4
Data Engineering framework based on Polars.rs5 versions - Latest release: 5 months ago - 48 downloads last month - 14 stars on GitHub - 2 maintainers
dsutils-ms 1.10
My Data Science Utils11 versions - Latest release: 10 months ago - 321 downloads last month - 2 maintainers
kedro-diff 0.1.1 💰
diff commits to your kedro pipeline2 versions - Latest release: almost 3 years ago - 1 dependent repositories - 11 downloads last month - 10 stars on GitHub - 2 maintainers
dtflw 0.6.7
dtflw is a Python framework for building modular data pipelines based on Databricks dbutils.noteb...7 versions - Latest release: 7 months ago - 1.97 thousand downloads last month - 4 stars on GitHub - 4 maintainers
dataset-shuffler 0.1.1
Data engineering tool for learning-based computer vision.1 version - Latest release: over 1 year ago - 20 downloads last month - 24 stars on GitHub - 2 maintainers
dhcstat 0.1.1
DHC Python tool that automatically analysis.1 version - Latest release: almost 6 years ago - 1 dependent repositories - 23 downloads last month - 2 maintainers
sheetwork 1.0.7 💰
A handy CLI tool to ingest GoogleSheets into your database without writing a single line of code27 versions - Latest release: over 3 years ago - 1 dependent repositories - 105 downloads last month - 16 stars on GitHub - 2 maintainers
dbt-sugar 0.2.0 💰
A sweet CLI tool to help dbt users enforce documentation and testing on their dbt projects.17 versions - Latest release: over 2 years ago - 1 dependent repositories - 150 downloads last month - 149 stars on GitHub - 2 maintainers
mjooln 0.7.0
63 versions - Latest release: over 1 year ago - 1 dependent repositories - 203 downloads last month - 0 stars on GitLab.com - 2 maintainersdata-science-kit 0.0.1
Data Science Basic Functions1 version - Latest release: almost 3 years ago - 1 dependent repositories - 16 downloads last month - 1 stars on GitHub - 1 maintainer
Top 3.1% on pypi.org
13 versions - Latest release: about 7 years ago - 8 dependent packages - 51 dependent repositories - 4.77 thousand downloads last month - 394 stars on GitHub - 4 maintainers
skrebate 0.3.4
Relief-based feature selection algorithms13 versions - Latest release: about 7 years ago - 8 dependent packages - 51 dependent repositories - 4.77 thousand downloads last month - 394 stars on GitHub - 4 maintainers
tdprepview 1.4.1
Python Package that creates Data Preparation Pipeline in Teradata-SQL in Views16 versions - Latest release: 20 days ago - 275 downloads last month - 2 maintainers
kedro-boot 0.2.2
A kedro plugin that streamlines the integration between Kedro projects and external applications,...5 versions - Latest release: 22 days ago - 112 downloads last month - 19 stars on GitHub - 2 maintainers
atalert 0.1.10
Atalert slack alerting service helper module10 versions - Latest release: over 1 year ago - 75 downloads last month - 2 stars on GitHub - 2 maintainers
dcw 0.0.11
Data Collection and Wrangling6 versions - Latest release: 4 months ago - 35 downloads last month - 0 stars on GitHub - 2 maintainers
spooq 3.4.0
Spooq is a PySpark based helper library for ETL data ingestion pipeline in Data Lakes.11 versions - Latest release: about 2 months ago - 1 dependent repositories - 19.3 thousand downloads last month - 8 stars on GitHub - 2 maintainers
pycurie 0.1.16
16 versions - Latest release: 7 months ago - 14 downloads last month - 1 stars on GitHub - 2 maintainers
Top 7.4% on pypi.org
1 version - Latest release: over 2 years ago - 1 dependent repositories - 3.98 thousand downloads last month - 9,337 stars on GitHub - 2 maintainers
xedro 0.17.6
Kedro helps you build production-ready data and analytics pipelines1 version - Latest release: over 2 years ago - 1 dependent repositories - 3.98 thousand downloads last month - 9,337 stars on GitHub - 2 maintainers
Top 8.6% on pypi.org
48 versions - Latest release: 6 months ago - 3 dependent repositories - 121 downloads last month - 218 stars on GitHub - 2 maintainers
pipelinex 0.7.9
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more48 versions - Latest release: 6 months ago - 3 dependent repositories - 121 downloads last month - 218 stars on GitHub - 2 maintainers
kedex 0.1.0
Kedro extension for rapid prototyping and experimentation6 versions - Latest release: over 4 years ago - 1 dependent repositories - 5 downloads last month - 0 stars on GitHub - 1 maintainer
fahr 0.0.1
Tool for running remote machine learning jobs remotely.1 version - Latest release: about 5 years ago - 1 dependent repositories - 8 downloads last month - 4 stars on GitHub - 2 maintainers
parallelfileconcatenator 0.1 removed
ParallelFileConcatenator is a robust tool designed to efficiently combine data files of various f...1 version - Latest release: 9 months ago
Related Keywords
data science
21
machine learning
18
python
10
data pipelines
10
pipelines
10
data-engineering
8
machine-learning
6
data
6
data-science
6
etl
6
pipeline
5
mlops
5
data analysis
5
kedro
5
kedro-plugin
4
hacktoberfest
3
experiment-tracking
3
data pipeline
3
framework
3
machine-learning-engineering
2
sql
2
snowflake
2
data management
2
deep learning
2
aws
2
data cleaning
2
csv
2
preprocessing
2
kedro-hook
2
json
2
dataset
2
streaming
2
databricks
2
etl-pipeline
2
cli
2
database
2
big-data
2
data modelling
2
ETL
2
dbt
2
big data
2
data wrangling
2
data preparation
1
md5 checksum
1
data merging
1
data lake
1
microservice
1
data kit
1
data science kit
1
data aggregation
1
parallel file concatenation
1
science
1
science kit
1
data engineer
1
exploratory data analysis
1
model-training
1
aws-sagemaker
1
eda
1
measurement units
1
file processing
1
google-sheets
1
dbt-sugar
1
documentation
1
path
1
file
1
folder
1
file handling
1
encryption
1
file management
1
data compression
1
compression
1
aes
1
gzip
1
data deduplication
1
uuid
1
utc
1
extract
1
tdprepview
1
teradata
1
data apps
1
model serving
1
batch
1
data ingestion
1
dataops
1
hadoop
1
alert
1
slack webhook
1
cloudera
1
hive
1
spark
1
spooq
1
data quality
1
data pipeline monitoring
1
experimentation
1
data operations
1
data preprocessing
1
deep-learning
1
data-analysis
1
devops
1
transform
1