Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-pipelines" keyword

datalines 0.1.0 💰
Datalines = Dataloaders + Pipelines, add simplicity
2 versions - Latest release: over 2 years ago - 1 dependent repositories - 30 downloads last month - 0 stars on GitHub - 1 maintainer
py-dagger-contrib 0.4.0
Extensions for the Dagger library (py-dagger in PyPI).
4 versions - Latest release: over 2 years ago - 1 dependent repositories - 50 downloads last month - 1 stars on GitHub - 1 maintainer
imagepypelines-tools 0.4.0a0
accessory library to augment the image processing library imagepypelines
22 versions - Latest release: about 5 years ago - 1 dependent repositories - 163 downloads last month - 1 stars on GitHub - 3 maintainers
phx-filters 3.4.0
Validation and data pipelines made easy!
10 versions - Latest release: 8 months ago - 2 dependent packages - 16 dependent repositories - 341 downloads last month - 1 stars on GitHub - 1 maintainer
bowline-streaming 0.5.0
Bowline: Easily build performant data stream processing pipelines in Python.
15 versions - Latest release: 3 months ago - 301 downloads last month - 1 stars on GitHub - 1 maintainer
airflow-run 0.4.9
Simplified Airflow CLI Tool for Lauching CeleryExecutor Deployment
28 versions - Latest release: almost 3 years ago - 222 downloads last month - 2 stars on GitHub - 1 maintainer
ipynta 0.0.40 💰
A Python library for different image processing tasks.
32 versions - Latest release: over 2 years ago - 1 dependent repositories - 8 downloads last month - 2 stars on GitHub - 1 maintainer
visarchpy 1.0.4
Data pipelines for extraction, transformation and visualization of architectural visuals in Python.
5 versions - Latest release: 4 months ago - 29 downloads last month - 3 stars on GitHub - 1 maintainer
light-pipe 0.3.1
A high-level syntax for data pipelines, designed to make pipeline development quick and painless.
5 versions - Latest release: 12 months ago - 29 downloads last month - 3 stars on GitHub - 1 maintainer
epiphyte 0.1.1
a Python toolkit for high-dimensional neural data recorded during naturalistic, continuous stimuli
4 versions - Latest release: over 3 years ago - 1 dependent repositories - 23 downloads last month - 3 stars on GitHub - 1 maintainer
mag-cli 0.1.1
Eases the tasks of managing a magasin instance. The glue between components.
3 versions - Latest release: 3 months ago - 25 downloads last month - 4 stars on GitHub - 1 maintainer
chariots 0.2.4
machine learning pipelines
3 versions - Latest release: over 4 years ago - 1 dependent repositories - 32 downloads last month - 6 stars on GitHub - 1 maintainer
imagepypelines 0.2.0a0
data pipeline and convienence library targeted at accelerating the development of imaging project...
7 versions - Latest release: about 4 years ago - 1 dependent repositories - 52 downloads last month - 6 stars on GitHub - 3 maintainers
tsdat 0.7.7
A data processing framework used to convert time series data into standardized format.
47 versions - Latest release: 3 months ago - 1 dependent package - 3 dependent repositories - 730 downloads last month - 11 stars on GitHub - 2 maintainers
marshmallow-pyspark 0.2.4
PySpark data serializer
6 versions - Latest release: 5 months ago - 1 dependent repositories - 2.14 thousand downloads last month - 12 stars on GitHub - 1 maintainer
py-dagger 0.4.2
Define sophisticated data pipelines with Python and run them on different distributed systems (su...
8 versions - Latest release: over 2 years ago - 1 dependent repositories - 79 downloads last month - 13 stars on GitHub - 1 maintainer
iterabledata 1.0.2
Iterable data processing Python library
1 version - Latest release: over 1 year ago - 7 downloads last month - 13 stars on GitHub - 1 maintainer
rivery-cli 0.4.0
Rivery CLI
17 versions - Latest release: about 2 years ago - 1 dependent repositories - 79 downloads last month - 17 stars on GitHub - 1 maintainer
smartpipeline 0.7.3
A framework for fast developing scalable data pipelines following a simple design pattern
11 versions - Latest release: 5 months ago - 1 dependent repositories - 68 downloads last month - 21 stars on GitHub - 1 maintainer
stepist 0.1.5
Data process utils
9 versions - Latest release: about 5 years ago - 1 dependent repositories - 50 downloads last month - 27 stars on GitHub - 1 maintainer
kedro-pandera 0.2.1
A kedro plugin to use pandera in your kedro projects
3 versions - Latest release: 14 days ago - 283 downloads last month - 30 stars on GitHub - 1 maintainer
datatiles 0.0.0
Data transformations and pipelines for data science and machine learning.
1 version - Latest release: 8 months ago - 1 dependent repositories - 23 downloads last month - 31 stars on GitHub - 1 maintainer
streams-explorer 2.3.2
Explore Data Pipelines in Apache Kafka.
48 versions - Latest release: 7 months ago - 1 dependent repositories - 310 downloads last month - 43 stars on GitHub - 1 maintainer
conductor-python 1.1.5
Netflix Conductor Python SDK
75 versions - Latest release: 7 days ago - 1 dependent package - 1 dependent repositories - 6.75 thousand downloads last month - 50 stars on GitHub - 1 maintainer
beneath 1.4.2
Python client and CLI for Beneath (https://beneath.dev/)
39 versions - Latest release: over 2 years ago - 2 dependent repositories - 373 downloads last month - 79 stars on GitHub - 2 maintainers
patterns-components 0.1.3
Patterns open-source components
4 versions - Latest release: about 1 year ago - 33 downloads last month - 106 stars on GitHub - 1 maintainer
pureml 0.4.6
Developer platform for production ML.
31 versions - Latest release: about 2 months ago - 2 dependent packages - 267 downloads last month - 171 stars on GitHub - 3 maintainers
dataplane 0.1.3
The data engineering library to build robust, reliable and on time data pipelines in Python.
22 versions - Latest release: 10 months ago - 1 dependent package - 1 dependent repositories - 415 downloads last month - 181 stars on GitHub - 1 maintainer
recap-core 0.12.0
Recap reads and writes schemas from web services, databases, and schema registries in a standard ...
41 versions - Latest release: 3 months ago - 1 dependent repositories - 601 downloads last month - 306 stars on GitHub - 1 maintainer
Top 8.8% on pypi.org
vdk-data-sources 0.1.1190994517
Enables Versatile Data Kit (VDK) to integrate with various data sources by providing a unified in...
5 versions - Latest release: 3 months ago - 5 dependent packages - 1 dependent repositories - 394 downloads last month - 379 stars on GitHub - 1 maintainer
vdk-heartbeat 0.6.1220258461
Versatile Data Kit Heartbeat and Health Test
44 versions - Latest release: 2 months ago - 1 dependent repositories - 236 downloads last month - 379 stars on GitHub - 1 maintainer
plugin-package-template 0.1.477708478
Plugin template project used to quick start development of a new Versatile Data Kit SDK plugin.
10 versions - Latest release: about 2 years ago - 1 dependent repositories - 8 downloads last month - 408 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
vdk-server 0.1.1256945357
Versatile Data Kit SDK plugin that facilitates the installation of a local Control Service.
40 versions - Latest release: about 1 month ago - 1 dependent package - 5 dependent repositories - 711 downloads last month - 408 stars on GitHub - 1 maintainer
Top 4.5% on pypi.org
vdk-control-cli 1.3.1220258461
VDK Control CLI allows user to create, delete, manage and their Data Jobs in Kubernetes runtime.
82 versions - Latest release: 2 months ago - 5 dependent packages - 5 dependent repositories - 3.64 thousand downloads last month - 408 stars on GitHub - 1 maintainer
vdk-meta-jobs 0.1.1190994517 removed
Express dependecies between data jobs.
29 versions - Latest release: 3 months ago - 1 dependent repositories - 39 downloads last month - 408 stars on GitHub - 1 maintainer
vdk-duckdb 0.2.1216772137
DuckDB Plugin for VDK.
7 versions - Latest release: 2 months ago - 946 downloads last month - 408 stars on GitHub - 1 maintainer
vdk-plugin-name 0.1.0
Simple description of my project.
1 version - Latest release: over 1 year ago - 7 downloads last month - 409 stars on GitHub - 1 maintainer
Top 7.0% on pypi.org
vdk-csv 0.1.1190994517
Versatile Data Kit SDK CSV plugin to ingest, export, or manipulate csv files.
21 versions - Latest release: 3 months ago - 5 dependent repositories - 716 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-jobs-troubleshooting 0.2.1190994517
Versatile Data Kit SDK troubleshooting plugin to assist in troubleshooting deployed data jobs.
9 versions - Latest release: 3 months ago - 232 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-notebook 0.1.1190994517
A VDK plugin for working with notebooks
17 versions - Latest release: 3 months ago - 1 dependent repositories - 93 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-huggingface 0.1.1190994517
Integrate VDK with Huggingface as both data source and target
4 versions - Latest release: 3 months ago - 31 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-storage 0.1.1190994517
Library for access to different managed storages
4 versions - Latest release: 3 months ago - 15 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-postgres 0.0.1285721801
Versatile Data Kit SDK plugin provides support for PostgreSQL database and postgres transformatio...
23 versions - Latest release: 10 days ago - 2 dependent repositories - 135 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-logging-ltsv 0.1.693641831
Versatile Data Kit SDK plugin that changes logging output to LTSV format.
11 versions - Latest release: over 1 year ago - 1 dependent repositories - 28 downloads last month - 409 stars on GitHub - 1 maintainer
Top 9.8% on pypi.org
vdk-kerberos-auth 0.3.1190994517
Versatile Data Kit SDK plugin adds Kerberos/GSSAPI support.
26 versions - Latest release: 3 months ago - 1 dependent repositories - 758 downloads last month - 409 stars on GitHub - 1 maintainer
Top 9.8% on pypi.org
vdk-jupyterlab-extension 0.1.1230061130
A Jupyterlab extension for using VDK
83 versions - Latest release: about 2 months ago - 1 dependent repositories - 804 downloads last month - 409 stars on GitHub - 1 maintainer
Top 8.0% on pypi.org
vdk-trino 0.4.1290280218
Versatile Data Kit SDK plugin provides support for trino database and trino transformation templa...
52 versions - Latest release: 6 days ago - 11 dependent repositories - 510 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-snowflake 0.2.1190994517
Versatile Data Kit SDK plugin provides support for snowflake databases.
13 versions - Latest release: 3 months ago - 2 dependent repositories - 48 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-confluence-data-source 0.1.1190994517
VDK data source plugin for Confluence
3 versions - Latest release: 3 months ago - 24 downloads last month - 409 stars on GitHub - 1 maintainer
Top 9.8% on pypi.org
vdk-impala 0.4.1245476944
Versatile Data Kit SDK plugin provides support for Impala database.
67 versions - Latest release: about 1 month ago - 1 dependent repositories - 383 downloads last month - 409 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
vdk-logging-format 0.1.1184833162
Versatile Data Kit SDK plugin that configures logging output format.
7 versions - Latest release: 3 months ago - 1 dependent package - 4 dependent repositories - 23 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-oracle 0.1.1284357447
Support for VDK Managed Oracle connection
21 versions - Latest release: 11 days ago - 330 downloads last month - 409 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
vdk-ingest-http 0.2.1184833162
Versatile Data Kit SDK ingestion plugin to ingest data via http requests.
31 versions - Latest release: 3 months ago - 1 dependent package - 7 dependent repositories - 912 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-singer 0.1.1184833162
The plugin provides seamless configuration and execution of Singer Taps and Targets.
2 versions - Latest release: 3 months ago - 8 downloads last month - 409 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
vdk-ingest-file 0.1.1190994517
Versatile Data Kit SDK ingestion plugin to ingest data into a file.
17 versions - Latest release: 3 months ago - 1 dependent package - 7 dependent repositories - 806 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-smarter 0.1.1227661975
Making VDK smarter by employing ML/AI.
6 versions - Latest release: about 2 months ago - 32 downloads last month - 409 stars on GitHub - 1 maintainer
Top 4.6% on pypi.org
vdk-control-api-auth 0.1.1190994517
Versatile Data Kit plugin library provides support for authentication.
15 versions - Latest release: 3 months ago - 3 dependent packages - 5 dependent repositories - 3.06 thousand downloads last month - 409 stars on GitHub - 1 maintainer
vdk-ipython 0.2.1269113113
Ipython extension for VDK
16 versions - Latest release: 24 days ago - 1 dependent repositories - 98 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-data-source-git 0.1.1190994517
Read Git repository data source
3 versions - Latest release: 3 months ago - 10 downloads last month - 409 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
vdk-lineage-model 0.0.1190994517
VDK Lineage Model plugin defines common lineage model and classes used for managing lineageinform...
7 versions - Latest release: 3 months ago - 3 dependent packages - 5 dependent repositories - 1.37 thousand downloads last month - 409 stars on GitHub - 1 maintainer
vdk-structlog 0.1.1190994517
Structured logging for versatile data kit
17 versions - Latest release: 3 months ago - 1 dependent package - 636 downloads last month - 409 stars on GitHub - 1 maintainer
Top 3.6% on pypi.org
vdk-core 0.3.1284553848
Versatile Data Kit SDK Core
210 versions - Latest release: 11 days ago - 35 dependent packages - 9 dependent repositories - 6.01 thousand downloads last month - 409 stars on GitHub - 1 maintainer
vdk-audit 0.1.1190994517
Versatile Data Kit SDK Audit plugin restricts forbidden operations.
11 versions - Latest release: 3 months ago - 210 downloads last month - 409 stars on GitHub - 1 maintainer
Top 5.6% on pypi.org
vdk-tino 10.7.8
Simple description of my project.
2 versions - Latest release: over 1 year ago - 6 downloads last month - 409 stars on GitHub - 1 maintainer
airflow-provider-vdk 0.0.1184833162
Airflow provider for Versatile Data Kit.
16 versions - Latest release: 3 months ago - 1 dependent repositories - 52 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-greenplum 0.0.1245476944
Versatile Data Kit SDK plugin provides support for Greenplum database and greenplum transformatio...
19 versions - Latest release: about 1 month ago - 1 dependent repositories - 88 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-properties-fs 0.0.1190994517
Versatile Data Kit SDK plugin provides support for Properties API client that uses local FS storage.
8 versions - Latest release: 3 months ago - 24 downloads last month - 409 stars on GitHub - 1 maintainer
Top 4.7% on pypi.org
vdk-plugin-control-cli 0.1.1220499344
Versatile Data Kit SDK plugin exposing CLI commands for managing the lifecycle of a Data Jobs.
35 versions - Latest release: 2 months ago - 4 dependent packages - 7 dependent repositories - 1.46 thousand downloads last month - 409 stars on GitHub - 1 maintainer
vdk-lineage 0.3.1227661975
VDK Lineage plugin collects lineage (input -> job -> output) information and send it to a pre-con...
19 versions - Latest release: about 2 months ago - 1 dependent repositories - 78 downloads last month - 409 stars on GitHub - 1 maintainer
Top 9.9% on pypi.org
vdk-dag 0.1.1236886565
Express dependecies between data jobs.
17 versions - Latest release: about 2 months ago - 1 dependent repositories - 236 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-logging-json 0.1.693641831
Versatile Data Kit SDK plugin that changes logging output to JSON format.
17 versions - Latest release: over 1 year ago - 2 dependent repositories - 80 downloads last month - 409 stars on GitHub - 1 maintainer
Top 5.7% on pypi.org
vdk-test-utils 0.2.1239847314
Provides utilities for testing Versatile Data Kit SDK plugins.
37 versions - Latest release: about 2 months ago - 9 dependent repositories - 3.4 thousand downloads last month - 409 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
quickstart-vdk 0.2.1287193961
Versatile Data Kit SDK packaging containing common plugins to get started quickly using it.
614 versions - Latest release: 9 days ago - 1 dependent repositories - 6 thousand downloads last month - 409 stars on GitHub - 1 maintainer
vdk-control-cli-name 0.1.0
Simple description of my project.
1 version - Latest release: over 1 year ago - 13 downloads last month - 409 stars on GitHub - 1 maintainer
vdk-gdp-execution-id 0.0.1190994517
This Versatile Data Kit SDK plugin is a Generative Data Pack, that expands each ingested dataset ...
9 versions - Latest release: 3 months ago - 1 dependent repositories - 79 downloads last month - 409 stars on GitHub - 1 maintainer
Top 4.7% on pypi.org
vdk-control-service-api 1.0.11
Versatile Data Kit Control Service API
11 versions - Latest release: 12 months ago - 3 dependent packages - 5 dependent repositories - 3.13 thousand downloads last month - 409 stars on GitHub - 1 maintainer
Top 1.7% on pypi.org
mleap 0.23.1
MLeap Python API
15 versions - Latest release: 6 months ago - 3 dependent packages - 61 dependent repositories - 158 thousand downloads last month - 1,494 stars on GitHub - 2 maintainers
mleap-splice 0.17.0a0
MLeap Python API
1 version - Latest release: over 3 years ago - 1 dependent repositories - 12 downloads last month - 1,494 stars on GitHub - 1 maintainer
Top 3.4% on pypi.org
meltano 3.4.2
Meltano is your CLI for ELT+: Open Source, Flexible, and Scalable. Move, transform, and test your...
273 versions - Latest release: 5 days ago - 1 dependent package - 42 dependent repositories - 35.4 thousand downloads last month - 1,598 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
elementary-data 0.15.1
Data monitoring and lineage
72 versions - Latest release: 11 days ago - 1 dependent repositories - 400 thousand downloads last month - 1,725 stars on GitHub - 1 maintainer
elementary-lineage 0.1.4
elementary-lineage is deprecated and moved to elementary-data
26 versions - Latest release: over 2 years ago - 1 dependent repositories - 187 downloads last month - 1,725 stars on GitHub - 1 maintainer
Top 7.8% on pypi.org
orchest 0.3.11
SDK for Orchest
14 versions - Latest release: over 1 year ago - 1 dependent repositories - 427 downloads last month - 4,019 stars on GitHub - 2 maintainers
orchest-cli 0.8.1
CLI for Orchest
19 versions - Latest release: about 1 year ago - 1 dependent repositories - 99 downloads last month - 4,019 stars on GitHub - 2 maintainers
Top 1.5% on pypi.org
unstructured 0.14.0
A library that prepares raw documents for downstream ML tasks.
133 versions - Latest release: 3 days ago - 113 dependent packages - 3,374 dependent repositories - 1.13 million downloads last month - 4,064 stars on GitHub - 1 maintainer
lftakakura-mage-ai 0.9.37a1
Mage is a tool for building and deploying data pipelines.
1 version - Latest release: 7 months ago - 15 downloads last month - 6,086 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
mage-ai 0.9.70
Mage is a tool for building and deploying data pipelines.
333 versions - Latest release: 24 days ago - 2 dependent repositories - 46.2 thousand downloads last month - 6,940 stars on GitHub - 2 maintainers
Top 9.3% on pypi.org
dagster-flyte 0.9.15
A Dagster integration for flyte
72 versions - Latest release: over 3 years ago - 1 dependent repositories - 448 downloads last month - 9,161 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
dagster-msteams 1.0.5
A Microsoft Teams client resource for posting to Microsoft Teams
239 versions - Latest release: over 1 year ago - 1 dependent repositories - 20.3 thousand downloads last month - 9,174 stars on GitHub - 1 maintainer
Top 4.8% on pypi.org
dagster-pipes 1.7.6
Toolkit for Dagster integrations with transform logic outside of Dagster
42 versions - Latest release: 4 days ago - 2 dependent packages - 1 dependent repositories - 738 thousand downloads last month - 9,174 stars on GitHub - 1 maintainer
dagster-deltalake-polars 0.23.6
Package for storing Polars DataFrames in Delta tables.
33 versions - Latest release: 4 days ago - 1.47 thousand downloads last month - 9,174 stars on GitHub - 1 maintainer
Top 2.8% on pypi.org
dagster-ext-process 1.4.17
Toolkit for Dagster integrations with transform logic outside of Dagster
6 versions - Latest release: 8 months ago - 2 dependent packages - 5 dependent repositories - 3.42 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
dagster-wandb 0.23.6
Package for wandb Dagster components.
92 versions - Latest release: 4 days ago - 5 dependent repositories - 1.6 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
dagster-deltalake 0.23.6
Package for Deltalake-specific Dagster framework op and resource components.
33 versions - Latest release: 4 days ago - 2 dependent packages - 2.72 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
Top 5.5% on pypi.org
dagster-azure 1.0.5
Package for Azure-specific Dagster framework op and resource components.
450 versions - Latest release: over 1 year ago - 1 dependent package - 1 dependent repositories - 83.1 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
dagster-prometheus 1.0.5
A Dagster integration for prometheus
492 versions - Latest release: over 1 year ago - 1 dependent repositories - 16.8 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
Top 2.1% on pypi.org
dagster-snowflake 1.0.5
Package for Snowflake Dagster framework components.
539 versions - Latest release: over 1 year ago - 2 dependent packages - 10 dependent repositories - 90.5 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
dagster-ge 1.0.5
Package for GE-specific Dagster framework op and resource components.
508 versions - Latest release: over 1 year ago - 11 dependent repositories - 7 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
Top 4.1% on pypi.org
dagster-dask 1.0.5
Package for using Dask as Dagster's execution engine.
516 versions - Latest release: over 1 year ago - 8 dependent repositories - 6.75 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
Top 2.6% on pypi.org
dagster-pandera 1.0.5
Integration layer for dagster and pandera.
188 versions - Latest release: over 1 year ago - 1 dependent package - 9 dependent repositories - 23.7 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer
Top 3.7% on pypi.org
dagster-snowflake-pandas 1.0.5
Package for integrating Snowflake and Pandas with Dagster.
163 versions - Latest release: over 1 year ago - 8 dependent repositories - 48.7 thousand downloads last month - 9,191 stars on GitHub - 1 maintainer