pypi.org "data-processing" keyword
evolvishub-dataloader 1.0.0
A comprehensive data loading framework for Excel, CSV, and JSON files with async support and data...1 version - Latest release: 10 months ago - 15 downloads last month - 1 maintainer
Top 8.1% on pypi.org
48 versions - Latest release: 3 days ago - 969 downloads last month - 9 stars on GitHub - 1 maintainer
flowquery 1.0.48
A declarative query language for data processing pipelines48 versions - Latest release: 3 days ago - 969 downloads last month - 9 stars on GitHub - 1 maintainer
cocoindex 999.0.0
With CocoIndex, users declare the transformation, CocoIndex creates & maintains an index, and kee...201 versions - Latest release: 5 months ago - 37.4 thousand downloads last month - 6,704 stars on GitHub - 1 maintainer
geniescript 0.1.1
A Python package for generating and executing data processing scripts using AI language models2 versions - Latest release: over 1 year ago - 18 downloads last month - 1 stars on GitHub - 1 maintainer
openforis-whisp 0.0.1
Whisp (What is in that plot) is an open-source solution which helps to produce relevant forest mo...33 versions - Latest release: about 1 year ago - 390 downloads last month - 30 stars on GitHub - 1 maintainer
gaojie-niceutils 0.1.0
A collection of handy Python utility functions for file I/O, data processing, and more.1 version - Latest release: 1 day ago - 1 maintainer
lesslines 0.0.4
explore python with less lines of code4 versions - Latest release: about 1 year ago - 7 downloads last month - 0 stars on GitHub - 1 maintainer
amanogawa 0.0.1a1
Flexible graph construction and data pre-processing engine2 versions - Latest release: over 6 years ago - 1 dependent repositories - 48 downloads last month - 5 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
120 versions - Latest release: 1 day ago - 97 dependent packages - 229 dependent repositories - 8.56 million downloads last month - 3,009 stars on GitHub - 3 maintainers
pandera 0.31.0 💰
A light-weight and flexible data validation and testing tool for statistical data objects.120 versions - Latest release: 1 day ago - 97 dependent packages - 229 dependent repositories - 8.56 million downloads last month - 3,009 stars on GitHub - 3 maintainers
uvvispy 0.1.1
Package for handling optical absorption data.4 versions - Latest release: almost 5 years ago - 1 dependent repositories - 33 downloads last month - 2 stars on GitHub - 1 maintainer
omforme 2023.6.18
Reshape (Danish: omforme).2 versions - Latest release: almost 3 years ago - 17 downloads last month - 1 maintainer
ddeutil-extensions 0.0.1 💰
Extension functions and objects1 version - Latest release: about 1 year ago - 15 downloads last month - 0 stars on GitHub - 1 maintainer
datasetops 0.0.6
Fluent dataset operations, compatible with your favorite libraries4 versions - Latest release: about 6 years ago - 4 dependent repositories - 307 downloads last month - 11 stars on GitHub - 1 maintainer
nvidia-nvimgcodec-tegra-cu12 0.7.0.11
NVIDIA nvimgcodec tegra for CUDA 12.7 versions - Latest release: 4 months ago - 194 downloads last month - 146 stars on GitHub - 1 maintainer
nvidia-nvimgcodec-cu11 0.6.1.37
NVIDIA nvimgcodec for CUDA 11. Git SHA:7 versions - Latest release: 5 months ago - 2 dependent packages - 10.1 thousand downloads last month - 146 stars on GitHub - 1 maintainer
nvidia-nvimgcodec-cu13 0.7.0.11
NVIDIA nvimgcodec for CUDA 13.4 versions - Latest release: 4 months ago - 4.71 thousand downloads last month - 146 stars on GitHub - 1 maintainer
nvidia-nvimgcodec-cu12 0.7.0.11
NVIDIA nvimgcodec for CUDA 12.8 versions - Latest release: 4 months ago - 3 dependent packages - 74.4 thousand downloads last month - 145 stars on GitHub - 1 maintainer
nvidia-nvimgcodec-tegra-cu13 0.6.0.32
NVIDIA nvimgcodec tegra for CUDA 13. Git SHA:2 versions - Latest release: 8 months ago - 166 downloads last month - 146 stars on GitHub - 1 maintainer
philiprehberger-data-pipeline 0.5.0
Composable data transformation pipeline with lazy evaluation.12 versions - Latest release: 9 days ago - 958 downloads last month - 2 maintainers
eyecu-bumblebee 0.5.7
Advanced pipelines for video datasets46 versions - Latest release: about 5 years ago - 1 dependent repositories - 143 downloads last month - 8 stars on GitHub - 1 maintainer
ray-milvus 0.1.1
Ray Data datasource and datasink for Milvus Storage2 versions - Latest release: 3 months ago - 9 downloads last month - 2 maintainers
linepipe 0.2.2
Composable, linear pipelines5 versions - Latest release: 3 months ago - 514 downloads last month - 1 maintainer
pvdata 0.2.1
High-performance toolkit for photovoltaic data processing, PV simulation, and time series analysis6 versions - Latest release: 4 months ago - 23 downloads last month - 1 maintainer
vaspy 0.8.12
A pure Python library designed to make it easy and quick to manipulate VASP files19 versions - Latest release: about 7 years ago - 1 dependent repositories - 173 downloads last month - 291 stars on GitHub - 1 maintainer
niamoto 0.14.4
Niamoto is a command-line application and library focused on processing and publishing botanical ...46 versions - Latest release: 3 days ago - 1.11 thousand downloads last month - 3 stars on GitHub - 1 maintainer
eoir 0.0.1
EOIR FOIA data processing tools1 version - Latest release: 9 months ago - 10 downloads last month - 0 stars on GitHub - 1 maintainer
pybcsv 1.5.4
High-performance Python bindings for the BCSV (Binary CSV) library with pandas integration12 versions - Latest release: 4 days ago - 1.74 thousand downloads last month - 0 stars on GitHub - 1 maintainer
m-patternpy 2.0.1
Trading analysis with high-speed pattern recognition, leveraging Pandas & Numpy. Effortlessly spo...2 versions - Latest release: about 1 year ago - 150 downloads last month - 326 stars on GitHub - 1 maintainer
unifiles 0.4.0
统一的文件操作库,提供跨文件类型的统一接口3 versions - Latest release: 22 days ago - 193 downloads last month - 1 maintainer
excel-toolbox 1.0.0
一体化 Excel 数据处理工具集 - 数据整合、清洗、转换全流程解决方案1 version - Latest release: 4 months ago - 19 downloads last month - 1 maintainer
eencijferho 0.1.3
Standalone backend processing toolkit for 1CijferHO4 versions - Latest release: 7 days ago - 219 downloads last month - 7 stars on GitHub - 1 maintainer
pygaps 4.6.1
Framework for processing gas adsorption isotherms32 versions - Latest release: about 1 year ago - 1 dependent repositories - 835 downloads last month - 77 stars on GitHub - 1 maintainer
sl-shared-assets 7.0.0
Provides data acquisition and processing assets shared between Sun (NeuroAI) lab libraries.92 versions - Latest release: 3 months ago - 453 downloads last month - 0 stars on GitHub - 1 maintainer
transmog 2.0.4
A data transformation library for flattening complex nested structures into tabular formats while...19 versions - Latest release: about 1 month ago - 2.14 thousand downloads last month - 1 stars on GitHub - 1 maintainer
pyadps 0.3.3
A Python package for ADCP data processing19 versions - Latest release: 6 months ago - 51 downloads last month - 2 stars on GitHub - 1 maintainer
datagpu 0.1.1
Open-source data compiler for AI training datasets2 versions - Latest release: 5 months ago - 13 downloads last month - 1 maintainer
px-processor 0.2.4
Process and validate JSON and CSV data with ease.7 versions - Latest release: 8 months ago - 44 downloads last month - 1 maintainer
Top 6.6% on pypi.org
42 versions - Latest release: 9 months ago - 2 dependent packages - 3 dependent repositories - 1.51 thousand downloads last month - 122 stars on GitHub - 3 maintainers
libertem 0.15.2
Open pixelated STEM framework42 versions - Latest release: 9 months ago - 2 dependent packages - 3 dependent repositories - 1.51 thousand downloads last month - 122 stars on GitHub - 3 maintainers
streamchunk 2.0.1
Adaptive data stream chunker with CPU-aware parallel processing for real-time ETL pipelines2 versions - Latest release: about 1 month ago - 38 downloads last month - 0 stars on GitHub - 1 maintainer
pytemporal 1.4.27
High-performance bitemporal data processing for Python43 versions - Latest release: 4 months ago - 925 downloads last month - 1 stars on GitHub - 1 maintainer
exonware-xwnode 0.9.0.21
Node-based data processing and graph computation library30 versions - Latest release: 11 days ago - 2.54 thousand downloads last month - 1 maintainer
xwnode 0.9.0.21
Convenience wrapper for exonware-xwnode - provides 'import xwnode' alias28 versions - Latest release: 11 days ago - 1.39 thousand downloads last month - 1 maintainer
bellapy 1.0.0
ML data toolkit - 29 features for dataset processing1 version - Latest release: 3 months ago - 30 downloads last month - 1 maintainer
dagster-kafka 1.3.1
Enterprise-grade Kafka integration for Dagster with Confluent Connect, comprehensive serializatio...9 versions - Latest release: 8 months ago - 534 downloads last month - 10 stars on GitHub - 1 maintainer
dagloom 0.1.0
A lightweight pipeline/workflow engine. Weave data processing nodes into DAG workflows with decor...1 version - Latest release: 9 days ago - 1 maintainer
1cijferho 0.1.0
Professional tools for processing Dutch higher educational data (1CijferHO / ROD)1 version - Latest release: 6 months ago - 53 downloads last month - 4 stars on GitHub - 1 maintainer
snowpark-checkpoints 0.4.0
Snowflake Snowpark Checkpoints16 versions - Latest release: 10 months ago - 121 downloads last month - 5 stars on GitHub - 1 maintainer
qufe 0.5.18
A comprehensive Python utility library for data processing, file handling, database management, a...24 versions - Latest release: about 1 month ago - 314 downloads last month - 0 stars on GitHub - 1 maintainer
cratedb-toolkit 0.0.46
CrateDB Toolkit44 versions - Latest release: about 1 month ago - 3 dependent packages - 1 dependent repositories - 10.5 thousand downloads last month - 12 stars on GitHub - 5 maintainers
dolma-rust-components 1.3.0
Rust components for Dolma - Toolkit for pre-processing LLM training data.2 versions - Latest release: 7 months ago - 12 downloads last month - 1,314 stars on GitHub - 2 maintainers
noob 1000.0.1
A graph processing library for processing graphs4 versions - Latest release: about 1 month ago - 253 downloads last month - 2 maintainers
lumpur 0.0.6
learn to use methods for processing unclear response6 versions - Latest release: over 1 year ago - 26 downloads last month - 0 stars on GitHub - 1 maintainer
stardust-sdk 0.1.1
Stardust SDK for AI/ML data processing and annotation workflows2 versions - Latest release: 7 months ago - 15 downloads last month - 0 stars on GitHub - 1 maintainer
lucid-pipeline 0.1.1
A clean, expressive pipeline pattern for Python.2 versions - Latest release: 11 days ago - 1 maintainer
haashi-pkg 1.6.0
A modular Python toolkit for analytics workflows, including data processing, visualization, and r...1 version - Latest release: 11 days ago - 97 downloads last month - 1 maintainer
flycatcher 0.1.0
Define your data schema once. Validate at scale. Stay columnar.1 version - Latest release: 5 months ago - 143 downloads last month - 3 stars on GitHub - 1 maintainer
stagecraft 0.1.9
A Python library for building robust ETL pipelines with declarative stages and data flow management10 versions - Latest release: about 2 months ago - 273 downloads last month - 0 stars on GitHub - 1 maintainer
tsqlike 1.1.8
SQL-like interface to tabular structured data18 versions - Latest release: over 1 year ago - 138 downloads last month - 0 stars on GitHub - 1 maintainer
pylib-docgen 0.1.0
Generate README/API docs using AI summarization. AI-powered documentation. Perfect for AI agents ...1 version - Latest release: 5 months ago - 8 downloads last month - 1 maintainer
pylib-streams 0.1.0
Chainable functional API for list processing (map/filter/reduce). Great for data pipelines. Perfe...1 version - Latest release: 5 months ago - 11 downloads last month - 1 maintainer
pylib-timer 0.1.0
Code timers, profiling decorators. Performance measurement tools.1 version - Latest release: 5 months ago - 12 downloads last month - 1 maintainer
pylib-serializer 0.1.0
Safe JSON/YAML serialization with circular-reference handling. Data processing utility.1 version - Latest release: 5 months ago - 8 downloads last month - 1 maintainer
pylib-autogptkit 0.1.0
Build agent workflows with memory & tools. Autonomous AI agents toolkit. Perfect for AI agents an...1 version - Latest release: 5 months ago - 12 downloads last month - 1 maintainer
pylib-authclient 0.1.0
OAuth2 / API token helper toolkit. Authentication utilities.1 version - Latest release: 5 months ago - 11 downloads last month - 1 maintainer
pylib-checksum 0.1.0
Compute MD5/SHA file hashes and integrity checks. Security utilities.1 version - Latest release: 5 months ago - 12 downloads last month - 1 maintainer
pylib-tzconvert 0.1.0
Time-zone conversions and aware datetimes. Timezone utilities.1 version - Latest release: 5 months ago - 10 downloads last month - 1 maintainer
pylib-searchalgo 0.1.0
Search & sort algorithms with performance metrics. Essential for AI and ML applications. Perfect ...1 version - Latest release: 5 months ago - 11 downloads last month - 1 maintainer
pylib-summarize 0.1.0
Summarize long text using frequency-based or AI-based summarizers. Perfect for AI agents. Perfect...1 version - Latest release: 5 months ago - 10 downloads last month - 1 maintainer
pylib-datastruct 0.1.0
Educational DSA implementations (Stack, Queue, Graph, Tree). Core algorithms library.1 version - Latest release: 5 months ago - 9 downloads last month - 1 maintainer
pylib-daterange 0.1.0
Generate date ranges, sequences, calendars. Date range utilities.1 version - Latest release: 5 months ago - 10 downloads last month - 1 maintainer
pylib-taskqueue 0.1.0
In-memory async queue with retry & backoff. Task processing for AI agents. Perfect for AI agents ...1 version - Latest release: 5 months ago - 10 downloads last month - 1 maintainer
pylib-compare 0.1.0
Deep diff for dicts/lists with patch generation. Data comparison utilities.1 version - Latest release: 5 months ago - 16 downloads last month - 1 maintainer
pylib-fetcher 0.1.0
Async HTTP client with caching and metrics. Network utilities for AI agents. Perfect for AI agent...1 version - Latest release: 5 months ago - 10 downloads last month - 1 maintainer
pylib-aibox 0.1.0
Prompt templates & LLM orchestration helpers. Essential for AI agents and LLMs. Perfect for AI ag...1 version - Latest release: 5 months ago - 12 downloads last month - 1 maintainer
pylib-scheduler 0.1.0
Lightweight cron/job scheduler with async support. Task scheduling utilities.1 version - Latest release: 5 months ago - 10 downloads last month - 1 maintainer
pylib-textai 0.1.0
Sentiment analysis, embeddings, summarization wrappers. AI text processing. Perfect for AI agents...1 version - Latest release: 5 months ago - 11 downloads last month - 0 stars on GitHub - 1 maintainer
pylib-dateutils 0.1.0
Days-between, add/subtract, format/parse helpers. Date manipulation utilities.1 version - Latest release: 5 months ago - 13 downloads last month - 1 maintainer
pylib-validator 0.1.0
Validate Python objects with schema definitions. Perfect for API validation.1 version - Latest release: 5 months ago - 19 downloads last month - 1 maintainer
pylib-restmock 0.1.0
Mock external REST APIs for tests. Testing utilities.1 version - Latest release: 5 months ago - 11 downloads last month - 1 maintainer
pylib-optimize 0.1.0
Simple optimization solvers (gradient descent, LP). Machine learning utilities.1 version - Latest release: 5 months ago - 13 downloads last month - 1 maintainer
pylib-dictutils 0.1.0
Deep merge, flatten, pick, omit, and transform nested dicts. Essential for data processing.1 version - Latest release: 5 months ago - 12 downloads last month - 1 maintainer
pylib-compress 0.1.0
Compress/decompress files (zip, gzip, tar). File compression utilities.1 version - Latest release: 5 months ago - 10 downloads last month - 1 maintainer
pylib-codefixer 0.1.0
AI-based code refactoring & linting assistant. Code quality AI tools. Perfect for AI agents and L...1 version - Latest release: 5 months ago - 11 downloads last month - 1 maintainer
databus 0.1.0
Python SDK and command-line toolkit for GTFS data processing, validation, and analysis. Provides ...1 version - Latest release: 8 months ago - 25 downloads last month - 1 maintainer
data-science-kit 0.0.1
Data Science Basic Functions1 version - Latest release: almost 5 years ago - 1 dependent repositories - 13 downloads last month - 1 stars on GitHub - 1 maintainer
nvidia-dali-nightly-cuda120 1.53.0.dev20251017
NVIDIA DALI nightly for CUDA 12.0. Git SHA: e2db4d795524dd2274dec9fbe479d2e8e50c6f23276 versions - Latest release: 6 months ago - 1.46 thousand downloads last month - 4,992 stars on GitHub - 1 maintainer
nvidia-dali-nightly-cuda110 1.52.0.dev20250626
NVIDIA DALI nightly for CUDA 11.0. Git SHA: 2be08c56f2be9ec8055256256039eb534ab7a080241 versions - Latest release: 10 months ago - 1 dependent repositories - 622 downloads last month - 4,992 stars on GitHub - 1 maintainer
nvidia-dali-tf-plugin-nightly-cuda120 1.43.0.dev20240919
NVIDIA DALI nightly TensorFlow plugin for CUDA 12.0. Git SHA: 94f02ad69abe149f345684ef2aba3e13d2...170 versions - Latest release: over 1 year ago - 366 downloads last month - 4,992 stars on GitHub - 1 maintainer
nvidia-dali-tf-plugin-nightly-cuda110 1.43.0.dev20240919
NVIDIA DALI nightly TensorFlow plugin for CUDA 11.0. Git SHA: 94f02ad69abe149f345684ef2aba3e13d2...152 versions - Latest release: over 1 year ago - 430 downloads last month - 4,992 stars on GitHub - 1 maintainer
nvidia-dali-weekly-cuda120 1.52.0.dev20250720
NVIDIA DALI weekly for CUDA 12.0. Git SHA: 67f2c79cbb2488d43757d94e30369464f2a516eb37 versions - Latest release: 9 months ago - 110 downloads last month - 5,531 stars on GitHub - 1 maintainer
nvidia-dali-cuda130 2.0.0
NVIDIA DALI for CUDA 13.0. Git SHA: a807a5a11d234580f6857bc4b3206ab8d7080f275 versions - Latest release: about 1 month ago - 706 downloads last month - 5,531 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
37 versions - Latest release: 11 months ago - 6 dependent repositories - 141 downloads last month - 5,578 stars on GitHub - 1 maintainer
nvidia-dali-tf-plugin-cuda110 1.50.0
NVIDIA DALI TensorFlow plugin for CUDA 11.0. Git SHA: d5c7b54f776fcba58944048f984f5645e7d7d1bb37 versions - Latest release: 11 months ago - 6 dependent repositories - 141 downloads last month - 5,578 stars on GitHub - 1 maintainer
nvidia-dali-tf-plugin-cuda120 2.0.0
NVIDIA DALI TensorFlow plugin for CUDA 12.0. Git SHA: a807a5a11d234580f6857bc4b3206ab8d7080f2734 versions - Latest release: about 1 month ago - 272 downloads last month - 5,564 stars on GitHub - 1 maintainer
Top 2.1% on pypi.org
37 versions - Latest release: 11 months ago - 5 dependent packages - 95 dependent repositories - 2.36 thousand downloads last month - 5,551 stars on GitHub - 1 maintainer
nvidia-dali-cuda110 1.50.0
NVIDIA DALI for CUDA 11.0. Git SHA: d5c7b54f776fcba58944048f984f5645e7d7d1bb37 versions - Latest release: 11 months ago - 5 dependent packages - 95 dependent repositories - 2.36 thousand downloads last month - 5,551 stars on GitHub - 1 maintainer
nvidia-dali-tf-plugin-cuda130 2.0.0
NVIDIA DALI TensorFlow plugin for CUDA 13.0. Git SHA: a807a5a11d234580f6857bc4b3206ab8d7080f275 versions - Latest release: about 1 month ago - 33 downloads last month - 5,637 stars on GitHub - 1 maintainer
nvidia-dali-nightly-cuda130 1.53.0.dev20251017
NVIDIA DALI nightly for CUDA 13.0. Git SHA: e2db4d795524dd2274dec9fbe479d2e8e50c6f2318 versions - Latest release: 6 months ago - 85 downloads last month - 5,637 stars on GitHub - 1 maintainer
nvidia-dali-tf-plugin-weekly-cuda120 1.42.0.dev20240915
NVIDIA DALI weekly TensorFlow plugin for CUDA 12.0. Git SHA: 408c18bb0d8a7c1b300e02fd7f6bb58369f...27 versions - Latest release: over 1 year ago - 73 downloads last month - 5,551 stars on GitHub - 1 maintainer
nvidia-dali-weekly-cuda130 1.52.0.dev20251005
NVIDIA DALI weekly for CUDA 13.0. Git SHA: 4da8adfb6b58c3a3c352f98c6f431b49323ac5183 versions - Latest release: 6 months ago - 18 downloads last month - 5,539 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
35 versions - Latest release: about 1 month ago - 1 dependent package - 11 dependent repositories - 48.4 thousand downloads last month - 5,531 stars on GitHub - 1 maintainer
nvidia-dali-cuda120 2.0.0
NVIDIA DALI for CUDA 12.0. Git SHA: a807a5a11d234580f6857bc4b3206ab8d7080f2735 versions - Latest release: about 1 month ago - 1 dependent package - 11 dependent repositories - 48.4 thousand downloads last month - 5,531 stars on GitHub - 1 maintainer
Top 8.3% on pypi.org
5 versions - Latest release: about 4 years ago - 1 dependent package - 14 dependent repositories - 3.22 thousand downloads last month - 747 stars on GitHub - 1 maintainer
texar-pytorch 0.1.4
Toolkit for Machine Learning and Text Generation5 versions - Latest release: about 4 years ago - 1 dependent package - 14 dependent repositories - 3.22 thousand downloads last month - 747 stars on GitHub - 1 maintainer
Related Keywords
machine-learning
90
python
89
data-science
65
ai
50
pipeline
41
ml
37
etl
36
deep-learning
36
pytorch
33
nlp
33
utilities
31
pandas
31
data
30
data-analysis
28
csv
25
image-processing
25
json
24
gpu
21
workflow
20
llm
19
fast-data-pipeline
19
excel
18
analytics
15
audio-processing
15
data-engineering
15
async
15
streaming
15
data-cleaning
14
automation
14
gpu-tensorflow
14
image-augmentation
14
neural-network
14
mxnet
14
data-augmentation
14
paddle
14
data-visualization
13
data-pipeline
13
pipelines
11
database
11
python3
10
polars
10
dataset
9
natural-language-processing
9
validation
9
numpy
9
parquet
9
deduplication
8
api
8
cli
8
visualization
8
data-processing-pipelines
8
spark
8
big-data
7
real-time
7
performance
7
data-analytics
7
kubernetes
7
distributed
7
data-preparation
7
data-preprocessing
7
research
7
multiprocessing
7
data-validation
7
mcp
7
computer-vision
7
rust
7
data-transformation
6
tensorflow
6
cuda
6
stream-processing
6
parallel
6
framework
6
dataframe
6
large-language-models
6
data-pipelines
6
matplotlib
5
openpyxl
5
dali
5
kafka
5
machine learning
5
converter
5
postgresql
5
business-intelligence
5
text-processing
5
nvidia
5
data-quality
5
python-library
5
mlops
5
ray
5
compression
5
graph
5
data science
5
data-management
5
sql
5
preprocessing
5
duckdb
5
cpp
5
sqlite
5
text-data
4
data-curation
4