An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-processing-pipelines" keyword

View the packages on the pypi.org package registry that are tagged with the "data-processing-pipelines" keyword.

nemo-curator 0.7.1
Scalable Data Preprocessing Tool for Training Large Language Models
16 versions - Latest release: 22 days ago - 1.5 thousand downloads last month - 879 stars on GitHub - 5 maintainers
invisible-unicorn 0.4.0
Scalable Data Preprocessing Tool for Training Large Language Models
1 version - Latest release: 7 months ago - 54 downloads last month - 879 stars on GitHub - 1 maintainer
invisible-rabbit 0.5.0
Scalable Data Preprocessing Tool for Training Large Language Models
4 versions - Latest release: 6 months ago - 213 downloads last month - 879 stars on GitHub - 1 maintainer
convtools 1.14.4 💰
dynamic, declarative data transformations with automatic code generation
113 versions - Latest release: about 1 month ago - 3.04 thousand downloads last month - 40 stars on GitHub - 1 maintainer
artifician 0.6.4
Artifician is an event driven framework developed to simplify the process of preparation of the d...
35 versions - Latest release: about 1 year ago - 1.22 thousand downloads last month - 10 stars on GitHub - 1 maintainer
thepipe 1.3.8
A lightweight, general purpose pipeline framework.
15 versions - Latest release: almost 3 years ago - 1 dependent package - 2 dependent repositories - 1.25 thousand downloads last month - 14 stars on GitHub - 2 maintainers
graphbook 0.13.3
The AI-driven data pipeline and workflow framework for data scientists and machine learning engin...
23 versions - Latest release: 11 days ago - 957 downloads last month - 35 stars on GitHub - 1 maintainer
graphbook_huggingface 0.0.6
Graphbook Hugging Face Plugin for no-code Hugging Face AI pipelines
5 versions - Latest release: 19 days ago - 237 downloads last month - 35 stars on GitHub - 1 maintainer