Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "pipeline" keyword

Top 0.7% on pypi.org
jina 3.23.3
Multimodal AI services & pipelines with cloud-native stack: gRPC, Kubernetes, Docker, OpenTelemet...
2,450 versions - Latest release: 3 months ago - 18 dependent packages - 687 dependent repositories - 139 thousand downloads last month - 19,512 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
coconut-develop 3.1.0.post0.dev11 💰
Simple, elegant, Pythonic functional programming.
655 versions - Latest release: 22 days ago - 2 dependent repositories - 3.58 thousand downloads last month - 3,951 stars on GitHub - 2 maintainers
Top 9.1% on pypi.org
quickstart-vdk 0.2.1287193961
Versatile Data Kit SDK packaging containing common plugins to get started quickly using it.
614 versions - Latest release: 6 days ago - 1 dependent repositories - 6 thousand downloads last month - 409 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
great-expectations-experimental 0.1.20240513031
Always know what to expect from your data.
502 versions - Latest release: 4 days ago - 1 dependent package - 1 dependent repositories - 255 thousand downloads last month - 9,129 stars on GitHub - 4 maintainers
Top 2.6% on pypi.org
toil 6.1.0
Pipeline management software for clusters.
450 versions - Latest release: 3 months ago - 4 dependent packages - 37 dependent repositories - 7.64 thousand downloads last month - 857 stars on GitHub - 4 maintainers
Top 6.3% on pypi.org
km3pipe 9.13.11
"An analysis framework for KM3NeT"
420 versions - Latest release: 9 months ago - 1 dependent package - 5 dependent repositories - 1.28 thousand downloads last month - 3 maintainers
naas-drivers 0.121.8 💰
Drivers made to easy connect to any services
419 versions - Latest release: 18 days ago - 1 dependent repositories - 1.6 thousand downloads last month - 53 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
airbyte-cdk 0.90.0
A framework for writing Airbyte Connectors.
410 versions - Latest release: about 24 hours ago - 266 dependent packages - 234 dependent repositories - 113 thousand downloads last month - 12,703 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
mage-ai 0.9.70
Mage is a tool for building and deploying data pipelines.
333 versions - Latest release: 21 days ago - 2 dependent repositories - 46.2 thousand downloads last month - 6,940 stars on GitHub - 2 maintainers
Top 0.4% on pypi.org
prefect 2.18.3
Workflow orchestration and management.
276 versions - Latest release: 15 days ago - 123 dependent packages - 767 dependent repositories - 1.2 million downloads last month - 13,720 stars on GitHub - 3 maintainers
Top 0.7% on pypi.org
great-expectations 0.18.13
Always know what to expect from your data.
265 versions - Latest release: 18 days ago - 58 dependent packages - 284 dependent repositories - 19.9 million downloads last month - 9,420 stars on GitHub - 8 maintainers
Top 3.6% on pypi.org
vdk-core 0.3.1284553848
Versatile Data Kit SDK Core
210 versions - Latest release: 9 days ago - 35 dependent packages - 9 dependent repositories - 6.01 thousand downloads last month - 409 stars on GitHub - 1 maintainer
weaverbird 0.44.2
A visual data pipeline builder with various backends
169 versions - Latest release: 3 days ago - 1 dependent repositories - 2.01 thousand downloads last month - 1 maintainer
kts 0.4.0
A framework for fast and interactive conducting machine learning experiments on tabular data
146 versions - Latest release: about 4 years ago - 1 dependent repositories - 929 downloads last month - 17 stars on GitHub - 1 maintainer
Top 2.7% on pypi.org
zenml 0.57.1
ZenML: Write production-ready ML code.
138 versions - Latest release: 4 days ago - 2 dependent packages - 44 dependent repositories - 20 thousand downloads last month - 3,632 stars on GitHub - 1 maintainer
Top 1.3% on pypi.org
django-pipeline 3.1.0 💰
Pipeline is an asset packaging library for Django.
133 versions - Latest release: 7 days ago - 9 dependent packages - 957 dependent repositories - 164 thousand downloads last month - 1,494 stars on GitHub - 3 maintainers
marimo 0.6.0
A library for making reactive notebooks and apps
132 versions - Latest release: about 24 hours ago - 3 dependent packages - 1 dependent repositories - 31 thousand downloads last month - 3,902 stars on GitHub - 1 maintainer
Top 0.7% on pypi.org
kfp 2.7.0
Kubeflow Pipelines SDK
132 versions - Latest release: 3 months ago - 48 dependent packages - 714 dependent repositories - 5.76 million downloads last month - 3,360 stars on GitHub - 2 maintainers
libreflow 2.4.3
An example flow for kabaret
120 versions - Latest release: 1 day ago - 1 dependent package - 3 dependent repositories - 609 downloads last month - 2 maintainers
jenkins-epo 1.160
Leverage Jenkins features for GitHub repositories.
113 versions - Latest release: about 7 years ago - 1 dependent repositories - 720 downloads last month - 6 stars on GitHub - 5 maintainers
dearwatson 0.10.4
Visual Vetting and Analysis of Transits from Space ObservatioNs
105 versions - Latest release: about 18 hours ago - 2 dependent packages - 1 dependent repositories - 1.03 thousand downloads last month - 2 stars on GitHub - 1 maintainer
cpg-workflows 1.24.4
CPG workflows for Hail Batch
103 versions - Latest release: 1 day ago - 1 dependent repositories - 4.32 thousand downloads last month - 0 stars on GitHub - 2 maintainers
Top 5.8% on pypi.org
pipen 0.14.6
A pipeline framework for python
97 versions - Latest release: about 1 month ago - 18 dependent packages - 9 dependent repositories - 1.27 thousand downloads last month - 100 stars on GitHub - 1 maintainer
Top 2.0% on pypi.org
google-cloud-pipeline-components 2.14.1
This SDK enables a set of First Party (Google owned) pipeline components that allow users to take...
96 versions - Latest release: about 5 hours ago - 2 dependent packages - 28 dependent repositories - 842 thousand downloads last month - 3,457 stars on GitHub - 1 maintainer
ekorpkit 0.1.40
eKorpkit provides a flexible interface for NLP and ML research pipelines such as extraction, tran...
94 versions - Latest release: over 1 year ago - 1 dependent repositories - 26 downloads last month - 5 stars on GitHub - 1 maintainer
Top 3.1% on pypi.org
fluids 1.0.25
Fluid dynamics component of Chemical Engineering Design Library (ChEDL)
94 versions - Latest release: 8 months ago - 7 dependent packages - 34 dependent repositories - 17.9 thousand downloads last month - 335 stars on GitHub - 1 maintainer
Top 6.7% on pypi.org
pdpipe 0.3.2
Easy pipelines for pandas.
86 versions - Latest release: over 1 year ago - 1 dependent package - 12 dependent repositories - 2.44 thousand downloads last month - 715 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
pypyr 5.9.1
task-runner for automation pipelines defined in yaml. cli & api.
86 versions - Latest release: 8 months ago - 3 dependent packages - 12 dependent repositories - 2.36 thousand downloads last month - 567 stars on GitHub - 1 maintainer
Top 9.8% on pypi.org
vdk-jupyterlab-extension 0.1.1230061130
A Jupyterlab extension for using VDK
83 versions - Latest release: about 2 months ago - 1 dependent repositories - 804 downloads last month - 409 stars on GitHub - 1 maintainer
Top 4.5% on pypi.org
vdk-control-cli 1.3.1220258461
VDK Control CLI allows user to create, delete, manage and their Data Jobs in Kubernetes runtime.
82 versions - Latest release: about 2 months ago - 5 dependent packages - 5 dependent repositories - 3.64 thousand downloads last month - 408 stars on GitHub - 1 maintainer
ndmg 0.2.1
Neuro Data MRI to Graphs Pipeline
79 versions - Latest release: over 4 years ago - 2 dependent repositories - 265 downloads last month - 59 stars on GitHub - 2 maintainers
Top 1.1% on pypi.org
kfp-server-api 2.2.0
Kubeflow Pipelines API
79 versions - Latest release: 17 days ago - 6 dependent packages - 235 dependent repositories - 3.2 million downloads last month - 3,378 stars on GitHub - 2 maintainers
twyn 2.6.39
Security tool against dependency typosquatting attacks
76 versions - Latest release: 5 days ago - 1.08 thousand downloads last month - 8 stars on GitHub - 1 maintainer
Top 0.8% on pypi.org
papermill 2.6.0 💰
Parameterize and run Jupyter and nteract Notebooks
75 versions - Latest release: 21 days ago - 129 dependent packages - 1,105 dependent repositories - 1.89 million downloads last month - 5,615 stars on GitHub - 4 maintainers
Top 10.0% on pypi.org
ascend-io-sdk 0.2.64
The Ascend.io SDK for Python
72 versions - Latest release: 2 months ago - 2 dependent packages - 1 dependent repositories - 5.12 thousand downloads last month - 11 maintainers
stdflow 0.0.73
Data flow tool that transform your notebooks and python files into pipeline steps by standardizin...
72 versions - Latest release: 7 months ago - 557 downloads last month - 1 stars on GitHub - 1 maintainer
sematic 0.38.0
An open-source ML pipeline development platform
71 versions - Latest release: about 1 month ago - 3.63 thousand downloads last month - 941 stars on GitHub - 5 maintainers
mercury-toolkit 0.9.86
Mercury: a framework for fluid ETL and data management
70 versions - Latest release: 11 months ago - 4 dependent repositories - 353 downloads last month - 4 stars on GitHub - 1 maintainer
Top 9.8% on pypi.org
gitlabci-local 9.1.0
Launch .gitlab-ci.yml jobs locally
69 versions - Latest release: 5 months ago - 3 dependent repositories - 1.73 thousand downloads last month - 54 stars on GitLab.com - 1 maintainer
Top 9.8% on pypi.org
vdk-impala 0.4.1245476944
Versatile Data Kit SDK plugin provides support for Impala database.
67 versions - Latest release: about 1 month ago - 1 dependent repositories - 383 downloads last month - 409 stars on GitHub - 1 maintainer
ph 1.1.5
ph - the tabular data shell tool
65 versions - Latest release: about 1 year ago - 3 dependent repositories - 274 downloads last month - 14 stars on GitHub - 1 maintainer
smashed 0.21.5
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields ext...
64 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 11.5 thousand downloads last month - 30 stars on GitHub - 2 maintainers
dawgie 1.4.2
Data and Algorithm Work-flow Generation, Introspection, and Execution (DAWGIE)
63 versions - Latest release: 4 months ago - 1 dependent repositories - 468 downloads last month - 2 stars on GitHub - 1 maintainer
unipipeline 1.9.4
simple way to build the declarative and distributed data pipelines with python. it supports rabbi...
62 versions - Latest release: 8 months ago - 5 dependent packages - 1 dependent repositories - 778 downloads last month - 0 stars on GitHub - 1 maintainer
pynb-dag-runner-snapshot 0.0.9.dev1672416330
Python library to run ML/data pipelines on stateless compute infrastructure (that may be ephemera...
62 versions - Latest release: over 1 year ago - 1 dependent repositories - 95 downloads last month - 17 stars on GitHub - 1 maintainer
plumbing 2.11.2
Helps with plumbing-type programing in python.
61 versions - Latest release: about 2 years ago - 3 dependent packages - 11 dependent repositories - 383 downloads last month - 5 stars on GitHub - 1 maintainer
hydra-genetics 2.0.0
Helper tools for use with hydra-genetics pipelines.
60 versions - Latest release: 23 days ago - 23 dependent repositories - 2.11 thousand downloads last month - 3 stars on GitHub - 1 maintainer
Top 8.6% on pypi.org
bodywork 3.0.12
ML pipeline orchestration and model deployments on Kubernetes, made really easy.
60 versions - Latest release: almost 2 years ago - 6 dependent repositories - 367 downloads last month - 431 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
galaxy-tool-util 24.0.2
Galaxy tool and tool dependency utilities
60 versions - Latest release: 11 days ago - 16 dependent packages - 43 dependent repositories - 19.6 thousand downloads last month - 1,315 stars on GitHub - 3 maintainers
Top 7.6% on pypi.org
law 0.1.18
Build large-scale task workflows using luigi, remote job submission, remote targets, and environment
59 versions - Latest release: 3 months ago - 18 dependent repositories - 583 downloads last month - 86 stars on GitHub - 1 maintainer
plynx 1.11.1
ML platform
57 versions - Latest release: 12 months ago - 1 dependent repositories - 454 downloads last month - 289 stars on GitHub - 1 maintainer
cemba-data 1.6.9
Pipelines for single nucleus methylome and multi-omic dataset.
55 versions - Latest release: about 1 year ago - 1 dependent repositories - 151 downloads last month - 14 stars on GitHub - 1 maintainer
graphtik 10.5.0
A Python lib for solving & executing graphs of functions, with `pandas` in mind
54 versions - Latest release: about 1 year ago - 1 dependent repositories - 228 downloads last month - 22 stars on GitHub - 1 maintainer
pipen-annotate 0.13.1
Use docstring to annotate pipen processes
53 versions - Latest release: 3 months ago - 5 dependent packages - 1 dependent repositories - 529 downloads last month - 100 stars on GitHub - 1 maintainer
Top 8.0% on pypi.org
vdk-trino 0.4.1290280218
Versatile Data Kit SDK plugin provides support for trino database and trino transformation templa...
51 versions - Latest release: 4 days ago - 11 dependent repositories - 456 downloads last month - 409 stars on GitHub - 1 maintainer
Top 1.0% on pypi.org
kedro 0.19.5
Kedro helps you build production-ready data and analytics pipelines
50 versions - Latest release: 26 days ago - 39 dependent packages - 402 dependent repositories - 501 thousand downloads last month - 9,337 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
doit 0.36.0 💰
doit - Automation Tool
48 versions - Latest release: about 2 years ago - 39 dependent packages - 477 dependent repositories - 801 thousand downloads last month - 1,747 stars on GitHub - 1 maintainer
Top 4.3% on pypi.org
nf-core 2.14.1
Helper tools for use with nf-core Nextflow pipelines.
48 versions - Latest release: 9 days ago - 1 dependent package - 4 dependent repositories - 9.42 thousand downloads last month - 218 stars on GitHub - 4 maintainers
Top 8.6% on pypi.org
pipelinex 0.7.9
PipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
48 versions - Latest release: 6 months ago - 3 dependent repositories - 121 downloads last month - 218 stars on GitHub - 1 maintainer
Top 9.3% on pypi.org
pyppl 3.2.2
A Python PiPeLine framework
47 versions - Latest release: almost 4 years ago - 4 dependent repositories - 409 downloads last month - 99 stars on GitHub - 1 maintainer
tsdat 0.7.7
A data processing framework used to convert time series data into standardized format.
47 versions - Latest release: 3 months ago - 1 dependent package - 3 dependent repositories - 730 downloads last month - 11 stars on GitHub - 2 maintainers
Top 5.7% on pypi.org
paddle-serving-server-gpu 0.3.1
Paddle Serving Package for saved model with PaddlePaddle
47 versions - Latest release: almost 4 years ago - 31 dependent repositories - 542 downloads last month - 877 stars on GitHub - 1 maintainer
pipe21 1.23.0
simple functional pipes
46 versions - Latest release: about 1 month ago - 4 dependent packages - 378 downloads last month - 13 stars on GitHub - 1 maintainer
fondant 1.0.0
Fondant - Large-scale data processing made easy and reusable
45 versions - Latest release: 4 months ago - 1 dependent repositories - 890 downloads last month - 316 stars on GitHub - 2 maintainers
vdk-heartbeat 0.6.1220258461
Versatile Data Kit Heartbeat and Health Test
44 versions - Latest release: about 2 months ago - 1 dependent repositories - 236 downloads last month - 379 stars on GitHub - 1 maintainer
pynot-redux 1.3.4
Data Reduction Pipeline for NOT/ALFOSC
44 versions - Latest release: 25 days ago - 1 dependent repositories - 316 downloads last month - 4 stars on GitHub - 1 maintainer
drf-pipeline-views 0.9.1
Django REST framework views using the pipeline pattern.
44 versions - Latest release: 10 months ago - 1 dependent repositories - 919 downloads last month - 1 stars on GitHub - 1 maintainer
Top 6.7% on pypi.org
pylivetrader 0.7.1
simple live trading framework
44 versions - Latest release: about 2 years ago - 14 dependent repositories - 241 downloads last month - 645 stars on GitHub - 1 maintainer
sequana-rnaseq 0.19.3
A RNAseq pipeline from raw reads to feature counts
44 versions - Latest release: 3 months ago - 1 dependent repositories - 232 downloads last month - 17 stars on GitHub - 2 maintainers
goodman-pipeline 1.3.7
Pipeline for reducing Goodman HTS data.
44 versions - Latest release: 9 months ago - 1 dependent repositories - 361 downloads last month - 15 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
ruffus 2.8.4
Light-weight Python Computational Pipeline Management
44 versions - Latest release: about 4 years ago - 3 dependent packages - 79 dependent repositories - 4.2 thousand downloads last month - 171 stars on GitHub - 3 maintainers
libreflow.pianoplayer 2.2.12
Tailor-made flow for a feature film production
43 versions - Latest release: over 1 year ago - 50 downloads last month - 2 maintainers
Top 2.6% on pypi.org
nbclient 0.10.0 💰
A client library for executing notebooks. Formerly nbconvert's ExecutePreprocessor.
41 versions - Latest release: 2 months ago - 149 dependent packages - 22,956 dependent repositories - 21.2 million downloads last month - 135 stars on GitHub - 4 maintainers
prefect-client 2.19.1
Workflow orchestration and management.
41 versions - Latest release: 1 day ago - 9.98 thousand downloads last month - 13,585 stars on GitHub - 1 maintainer
Top 3.6% on pypi.org
coconut 3.1.0 💰
Simple, elegant, Pythonic functional programming.
41 versions - Latest release: 3 months ago - 3 dependent packages - 22 dependent repositories - 2.81 thousand downloads last month - 3,951 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
prefect-aws 0.4.17
Prefect integrations for interacting with Amazon Web Services.
41 versions - Latest release: 1 day ago - 4 dependent packages - 82 dependent repositories - 1.12 million downloads last month - 14,512 stars on GitHub - 2 maintainers
acclimatise 1.2.0
Acclimatise is a Python library and command-line utility for parsing the help output of a command...
41 versions - Latest release: over 3 years ago - 348 downloads last month - 14 stars on GitHub - 1 maintainer
mlpipeline 2.0a7.post1
A framework to define a machine learning pipeline
41 versions - Latest release: over 3 years ago - 1 dependent repositories - 324 downloads last month - 6 stars on GitHub - 1 maintainer
libreflow.thesiren 0.4.14
A Kabaret flow tailor-made for the production of a feature film
40 versions - Latest release: over 2 years ago - 158 downloads last month - 2 maintainers
neumai 0.0.40
Package containing connectors for Neum AI.
40 versions - Latest release: 4 months ago - 1 dependent package - 449 downloads last month - 770 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
vdk-server 0.1.1256945357
Versatile Data Kit SDK plugin that facilitates the installation of a local Control Service.
40 versions - Latest release: about 1 month ago - 1 dependent package - 5 dependent repositories - 711 downloads last month - 408 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
prql-python 0.11.2
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
38 versions - Latest release: 3 months ago - 1 dependent package - 2 dependent repositories - 1.69 thousand downloads last month - 9,417 stars on GitHub - 2 maintainers
Top 4.6% on pypi.org
prefect-gcp 0.5.12
Prefect integrations for interacting with Google Cloud Platform.
38 versions - Latest release: 1 day ago - 2 dependent packages - 39 dependent repositories - 319 thousand downloads last month - 14,512 stars on GitHub - 2 maintainers
nanome-jax 2.0.11
NANOME (Nanopore methylation) pipeline developed by Li Lab at The Jackson Laboratory
37 versions - Latest release: over 1 year ago - 1 dependent repositories - 344 downloads last month - 27 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
vdk-test-utils 0.2.1239847314
Provides utilities for testing Versatile Data Kit SDK plugins.
37 versions - Latest release: about 1 month ago - 9 dependent repositories - 3.4 thousand downloads last month - 409 stars on GitHub - 1 maintainer
olympipe 1.4.5
A powerful parallel pipelining tool
37 versions - Latest release: 5 months ago - 1 dependent package - 205 downloads last month - 2 stars on GitLab.com - 1 maintainer
Top 1.9% on pypi.org
galaxy-util 24.0.2
Galaxy generic utilities
36 versions - Latest release: 11 days ago - 23 dependent packages - 23 dependent repositories - 19.8 thousand downloads last month - 1,315 stars on GitHub - 3 maintainers
Top 8.1% on pypi.org
peekingduck 1.3.0
A modular framework built to simplify Computer Vision inference workloads.
35 versions - Latest release: over 1 year ago - 3 dependent repositories - 176 downloads last month - 156 stars on GitHub - 1 maintainer
Top 4.7% on pypi.org
vdk-plugin-control-cli 0.1.1220499344
Versatile Data Kit SDK plugin exposing CLI commands for managing the lifecycle of a Data Jobs.
35 versions - Latest release: about 2 months ago - 4 dependent packages - 7 dependent repositories - 1.46 thousand downloads last month - 409 stars on GitHub - 1 maintainer
hoodat-vertex-components 1.6.6
Re-usable kfp components for hoodat
35 versions - Latest release: about 1 year ago - 1 dependent repositories - 336 downloads last month - 3,457 stars on GitHub - 1 maintainer
pandoo 0.3.3
Pandoo: a pipeline of tools for bacterial genomics.
34 versions - Latest release: almost 5 years ago - 1 dependent repositories - 38 downloads last month - 1 stars on GitHub - 1 maintainer
carriage 0.4.15
Enhanced collection classes for programming fluently
34 versions - Latest release: almost 5 years ago - 1 dependent repositories - 169 downloads last month - 9 stars on GitHub - 1 maintainer
ovl 2022.2.2
A modular and versatile Python package for computer vision object detection pipelines tailored fo...
34 versions - Latest release: almost 2 years ago - 1 dependent repositories - 157 downloads last month - 8 stars on GitHub - 1 maintainer
Top 4.4% on pypi.org
mlblocks 0.6.1
Pipelines and primitives for machine learning and data science.
33 versions - Latest release: 8 months ago - 6 dependent packages - 15 dependent repositories - 4.77 thousand downloads last month - 111 stars on GitHub - 4 maintainers
airbyte-source-salesforce 2.5.12
Source implementation for Salesforce.
33 versions - Latest release: about 24 hours ago - 7 dependent repositories - 1.11 thousand downloads last month - 13,821 stars on GitHub - 1 maintainer
persistable 1.2.2
Reproducible parameter based pipelines and persisting
33 versions - Latest release: over 1 year ago - 1 dependent repositories - 199 downloads last month - 10 stars on GitHub - 1 maintainer
my-data-toolkit 0.0.20
Face the engineering of data preprocessing.
33 versions - Latest release: over 1 year ago - 1 dependent repositories - 475 downloads last month - 2 stars on GitHub - 1 maintainer
ascend-io-cli 1.0.18
The Ascend CLI
32 versions - Latest release: 3 months ago - 379 downloads last month - 4 maintainers
pnp 0.28.0
Pull 'n' Push
32 versions - Latest release: about 3 years ago - 1 dependent repositories - 261 downloads last month - 4 stars on GitHub - 1 maintainer
cloudcheck 0.1.0
Check whether an IP address belongs to a cloud provider
31 versions - Latest release: almost 2 years ago - 1 dependent package - 2 dependent repositories - 12.4 thousand downloads last month - 27 stars on GitHub - 1 maintainer