An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data processing" keyword

View the packages on the pypi.org package registry that are tagged with the "data processing" keyword.

netcdf-scm 2.1.1
Processing netCDF files for use with simple climate models
43 versions - Latest release: over 4 years ago - 1 dependent repositories - 2.33 thousand downloads last month - 6 stars on GitHub - 2 maintainers
giga-spatial 0.8.0 💰
A package for spatial data download & processing
18 versions - Latest release: 3 days ago - 537 downloads last month - 17 stars on GitHub - 1 maintainer
abconnect 0.2.1
A set of tools for connecting and processing data for Annex Brands, featuring API pack and ship q...
10 versions - Latest release: about 1 month ago - 499 downloads last month - 1 stars on GitHub - 1 maintainer
yalab-procedures 0.0.1
The yalab-procedures repository is dedicated to managing and executing data processing procedures...
1 version - Latest release: over 1 year ago - 10 downloads last month - 1 maintainer
Top 1.9% on pypi.org
datatable 1.1.0
Python library for fast multi-threaded data manipulation and munging.
15 versions - Latest release: about 2 years ago - 18 dependent packages - 78 dependent repositories - 43.9 thousand downloads last month - 1,879 stars on GitHub - 1 maintainer
featransform 1.6.65
Featransform is an automated feature engineering framework for supervised machine learning
15 versions - Latest release: 3 days ago - 322 downloads last month - 4 stars on GitHub - 1 maintainer
narrator 0.0.0.4
A set of functions that process and create descriptive summary visualizations to help develop a b...
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 30 downloads last month - 0 stars on GitHub - 1 maintainer
gavicore 0.0.8
Pydantic data models and common utilities for other Eozilla packages
6 versions - Latest release: 3 months ago - 60 downloads last month - 3 stars on GitHub - 1 maintainer
mirobody 1.0.3
Mirobody is a Python package for processing and analyzing health data.
1 version - Latest release: 14 days ago
llm-datagen 1.0.0
极简高性能流式数据加工库
1 version - Latest release: 17 days ago
datalab-kernel 0.2.10
A standalone Xeus-Python-based Jupyter kernel for DataLab with optional live synchronization
12 versions - Latest release: 3 days ago - 1 maintainer
keprep 0.2.2
Minimally preprocessing TheBase dMRI data.
9 versions - Latest release: over 1 year ago - 211 downloads last month - 1 stars on GitHub - 1 maintainer
ml-dataloader 0.9.0
dataloader for machelearning
24 versions - Latest release: over 3 years ago - 1 dependent repositories - 1.14 thousand downloads last month - 0 stars on GitHub - 1 maintainer
pi-folder-organizer 2.2.2
A Python package for cleaning up cluttered files and organizing them into respective folders.
6 versions - Latest release: over 1 year ago - 25 downloads last month - 1 maintainer
fast-writer 1.0.3
CLI Tools for writing data
2 versions - Latest release: 9 months ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
sreader 0.0.1
space-reader: Convert any file path into LLM-friendly inputs
1 version - Latest release: over 1 year ago - 23 downloads last month - 1 maintainer
irt-data-utils 0.0.1a2
Infrared Thermal Data Utils
3 versions - Latest release: over 2 years ago - 23 downloads last month - 1 maintainer
Top 9.0% on pypi.org
csv-detective 0.10.12674
Detect tabular files column content
196 versions - Latest release: 29 days ago - 2 dependent packages - 3 dependent repositories - 2.21 thousand downloads last month - 48 stars on GitHub - 1 maintainer
checkpointer 2.14.10
checkpointer adds code-aware caching to Python functions, maintaining correctness and speeding up...
50 versions - Latest release: 3 months ago - 2 dependent repositories - 950 downloads last month - 6 stars on GitHub - 1 maintainer
bolster 0.4.0
Bolster's Brain, you've been warned
9 versions - Latest release: about 2 months ago - 1 dependent repositories - 122 downloads last month - 2 stars on GitHub - 1 maintainer
chunklet-py 2.1.1
Advanced text, code, and document chunking for LLM applications. Split content semantically, visu...
7 versions - Latest release: about 2 months ago - 263 downloads last month - 36 stars on GitHub - 1 maintainer
constelation-astronomer 1.1.4
constelation-astronomer: results processing package for CONSTELATION coupled model
1 version - Latest release: almost 2 years ago - 15 downloads last month - 0 stars on GitHub - 1 maintainer
space-packet-parser 6.1.0
A CCSDS telemetry packet decoding library based on the XTCE packet format description standard.
26 versions - Latest release: 22 days ago - 6.19 thousand downloads last month - 31 stars on GitHub - 1 maintainer
tensorneko-tool 0.3.22
The CLI Tools for Library TensorNeko.
9 versions - Latest release: 3 months ago - 46 downloads last month - 11 stars on GitHub - 1 maintainer
python-datatable 1.1.3
Python library for fast multi-threaded data manipulation and munging.
4 versions - Latest release: about 3 years ago - 3 dependent packages - 428 downloads last month - 1,878 stars on GitHub - 1 maintainer
fibphoflow 0.1.8
Python package to process and visualize TDT fiber photometry data
3 versions - Latest release: over 2 years ago - 15 downloads last month - 1 maintainer
geniusrise-databases 0.1.4
listeners bolts for geniusrise
4 versions - Latest release: over 2 years ago - 26 downloads last month - 2 stars on GitHub - 1 maintainer
scmcallib 0.5.1
Perform calibration for simple climate models
20 versions - Latest release: almost 6 years ago - 1 dependent repositories - 170 downloads last month - 0 stars on gitlab.com - 2 maintainers
llmbuilder 2.0.0
A comprehensive toolkit for building, training, and deploying language models
10 versions - Latest release: 3 months ago - 49 downloads last month - 0 stars on GitHub - 1 maintainer
geniusrise 0.1.7
An LLM framework
50 versions - Latest release: almost 2 years ago - 9 dependent packages - 1 dependent repositories - 154 downloads last month - 60 stars on GitHub - 1 maintainer
litdata 0.2.59
The Deep Learning framework to train, deploy, and ship AI products Lightning fast.
65 versions - Latest release: 2 months ago - 2 dependent packages - 188 thousand downloads last month - 544 stars on GitHub - 2 maintainers
invoice-generator-pdf 1.0.0
A Python package designed to effortlessly convert Excel-based invoices into professionally format...
1 version - Latest release: over 1 year ago - 9 downloads last month - 1 maintainer
arus 1.1.22
Activity Recognition with Ubiquitous Sensing
59 versions - Latest release: about 5 years ago - 1 dependent repositories - 335 downloads last month - 0 stars on GitHub - 1 maintainer
saqc 2.7.0
A timeseries data quality control and processing tool/framework
21 versions - Latest release: 5 months ago - 1 dependent repositories - 956 downloads last month - 8 stars on git.ufz.de - 2 maintainers
data-preprocessing-library-sevvalcucuk-asudesozcu 1.1.7
A comprehensive toolkit for data processing including handling dates, encoding categorical variab...
2 versions - Latest release: over 1 year ago - 18 downloads last month - 0 stars on GitHub - 2 maintainers
conveyor-streaming 1.2.1
A Python library for streamlining asynchronous streaming tasks and pipelines.
5 versions - Latest release: 6 months ago - 52 downloads last month - 2 stars on GitHub - 1 maintainer
crystflow 0.0.1
Name reservation for WIP package
1 version - Latest release: about 2 months ago - 29 downloads last month - 1 maintainer
chunklet 1.4.0
A smart multilingual text chunker for LLMs, RAG, and beyond.
19 versions - Latest release: 6 months ago - 162 downloads last month - 23 stars on GitHub - 1 maintainer
scalary 0.1.3
Collection of practical tools for working with image data
3 versions - Latest release: over 5 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 1 maintainer
adaptivebridge 1.1.0
Revolutionizing ML adaptive modelling for handling missing features and data. The model can predi...
5 versions - Latest release: about 2 years ago - 66 downloads last month - 1 stars on GitHub - 1 maintainer
rezolve-ai-ingestion 0.1.4
A private package for ingesting and processing SharePoint data with AI capabilities
4 versions - Latest release: over 1 year ago - 43 downloads last month - 6,176 stars on GitHub - 1 maintainer
urlcounter 0.0.3
A set of functions that tally URLs within an event-based corpus. It assumes that you have data di...
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 32 downloads last month - 0 stars on GitHub - 1 maintainer
hfutils 0.13.0
Useful utilities for huggingface
51 versions - Latest release: about 2 months ago - 1 dependent package - 1.05 million downloads last month - 21 stars on GitHub - 1 maintainer
geniusrise-listeners 0.1.7
listeners bolts for geniusrise
7 versions - Latest release: over 2 years ago - 51 downloads last month - 2 stars on GitHub - 1 maintainer
ersatz 1.0.0
Simple sentence segmentation toolkit for segmenting and scoring
2 versions - Latest release: over 4 years ago - 1 dependent repositories - 149 downloads last month - 34 stars on GitHub - 2 maintainers
sportradar-unofficial 0.1.15
An unofficial python package to access sportradar NFL APIs.
13 versions - Latest release: almost 2 years ago - 128 downloads last month - 0 stars on GitHub - 1 maintainer
hll 2.4.0
Fast HyperLogLog for Python
20 versions - Latest release: 6 months ago - 17.8 thousand downloads last month - 109 stars on GitHub - 1 maintainer
informatics 0.0.1
Framework of fast implementation data processing and operating pipelines
6 versions - Latest release: over 2 years ago - 1 dependent repositories - 222 downloads last month - 583 stars on GitHub - 1 maintainer
bertrand 0.0.1
(in development) Type-safe language bindings for Python/C++
1 version - Latest release: over 1 year ago - 25 downloads last month - 2 stars on GitHub - 1 maintainer
eseas 1.0.4
eseas is a Python package that serves as a wrapper for the jwsacruncher Java package. This tool a...
14 versions - Latest release: 11 months ago - 118 downloads last month - 1 stars on GitHub - 1 maintainer
csvuniondiff 0.0.0.dev1
A package for comparing CSV-like files through union and difference operations.
2 versions - Latest release: over 1 year ago - 27 downloads last month - 7 stars on GitHub - 1 maintainer
cuery 0.33.2
Prompt (cue) management and execution for tabular data.
78 versions - Latest release: about 2 months ago - 984 downloads last month - 1 stars on GitHub - 1 maintainer
outputty 0.3.2
Import, filter and export tabular data with Python easily
6 versions - Latest release: almost 13 years ago - 3 dependent repositories - 62 downloads last month - 36 stars on GitHub - 1 maintainer
acfortformat 0.1.3
Python library for reading and writing data in Fortran-style and native Python formats.
1 version - Latest release: 7 months ago - 20 downloads last month - 0 stars on GitHub - 1 maintainer
lidirl 0.0.1
LID toolkit to improve performance on spontaneous noisy text with data augmentation.
1 version - Latest release: almost 3 years ago - 24 downloads last month - 0 stars on GitHub - 1 maintainer
imaxt-mosaic 1.15.0
Image stitching
20 versions - Latest release: about 3 years ago - 155 downloads last month - 0 stars on gitlab.developers.cam.ac.uk - 1 maintainer
hysteresis 2.0.5
Hysteresis data processing tools.
25 versions - Latest release: over 1 year ago - 1 dependent repositories - 1.06 thousand downloads last month - 62 stars on GitHub - 1 maintainer
datacleanerpro 0.1.1
A simple and efficient data cleaning library for CSV files
1 version - Latest release: 3 months ago - 1 maintainer
framemerge 1.1.0
Lightweight tool to merge crystallographic frames
3 versions - Latest release: 2 months ago - 38 downloads last month - 1 maintainer
analytics_tasks 0.1.0
Automation including file search and slide deck preparation.
1 version - Latest release: 8 months ago - 17 downloads last month - 0 stars on GitHub - 1 maintainer
py-simple-flow 2020.8.23
Simple data processing (ETL) library with support for multi-processing
10 versions - Latest release: over 5 years ago - 3 dependent repositories - 80 downloads last month - 1 maintainer
augmently 1.0.9
A library for Data Augmentation of images in computer vision.
6 versions - Latest release: over 6 years ago - 58 downloads last month - 4 stars on GitHub - 1 maintainer
datasponge-monitoring 0.0.1
A real-time data processing pipeline
1 version - Latest release: over 1 year ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
opencf 0.3.3
A collection of Python scripts for file conversion tasks, built on top of the opencf-core framework.
7 versions - Latest release: over 1 year ago - 46 downloads last month - 0 stars on GitHub - 1 maintainer
adpa 1.5.0
Advanced Data Processing and Analytics Framework
3 versions - Latest release: 12 months ago - 34 downloads last month - 1 maintainer
arekit-ss 0.25.0
Low Resource Context Relation Sampler for contexts with relations for fact-checking and fine-tuni...
3 versions - Latest release: about 1 year ago - 42 downloads last month - 4 stars on GitHub - 1 maintainer
prosto 0.6.0
Data processing toolkit radically changing the way data is processed
5 versions - Latest release: about 4 years ago - 1 dependent repositories - 26 downloads last month - 91 stars on GitHub - 1 maintainer
datasponge-core 0.0.6
A real-time data processing pipeline
5 versions - Latest release: over 1 year ago - 23 downloads last month - 2 stars on GitHub - 1 maintainer
datasetplus 0.6.0
An enhanced wrapper for Hugging Face datasets with additional functionality
12 versions - Latest release: 5 months ago - 78 downloads last month - 1 stars on GitHub - 1 maintainer
blossom-data 0.5.0
A simple way to synthesize LLM training data.
6 versions - Latest release: 2 months ago - 41 downloads last month - 25 stars on GitHub - 1 maintainer
strahlenexposition-uba 1.0.0
Package for importing, processing and visualising radition exposure data
1 version - Latest release: 9 months ago - 10 downloads last month - 1 maintainer
geniusrise-audio 0.1.12
audio bolts for geniusrise
13 versions - Latest release: almost 2 years ago - 69 downloads last month - 2 stars on GitHub - 1 maintainer
eozilla 0.0.8
Comprises all packages of the Eozilla suite
6 versions - Latest release: 3 months ago - 45 downloads last month - 0 stars on GitHub - 1 maintainer
pyees 2.4.8
EES but for python. Pyees can be used do perform uncertanty (error) propagation. Furthermore, it ...
153 versions - Latest release: about 1 month ago - 1 dependent repositories - 658 downloads last month - 1 stars on GitHub - 1 maintainer
pycfs 0.2.0
Python library for automating and data handling tasks for openCFS.
20 versions - Latest release: 3 months ago - 331 downloads last month - 4 stars on gitlab.com - 1 maintainer
proadv 2.1.5
Process Acoustic Doppler Velocimeter data with advanced despiking and analysis tools
6 versions - Latest release: over 1 year ago - 25 downloads last month - 11 stars on GitHub - 1 maintainer
rlylutils 0.1.23
General file and data processing tools
15 versions - Latest release: almost 3 years ago - 19 downloads last month - 1 stars on GitHub - 1 maintainer
csv2mne 0.0.1
Data formater
2 versions - Latest release: about 3 years ago - 9 downloads last month - 1 maintainer
pipd 0.2.2
Utility functions for python data pipelines.
20 versions - Latest release: over 2 years ago - 272 downloads last month - 15 stars on GitHub - 1 maintainer
dagstd 0.1.3
Dagstd
4 versions - Latest release: over 3 years ago - 148 downloads last month - 2 stars on GitHub - 1 maintainer
image-features-extract 0.4.19
toolbox for extracting features from an image
10 versions - Latest release: about 5 years ago - 1 dependent repositories - 33 downloads last month - 1 maintainer
imsciences 1.0.9
IMS Data Processing Package
169 versions - Latest release: about 1 month ago - 769 downloads last month - 5 maintainers
ds11mltoolkit 1.9
Helper functions for all stages of the machine learning model building process
8 versions - Latest release: almost 3 years ago - 44 downloads last month - 3 stars on GitHub - 2 maintainers
procodile 0.0.8
A light-weight processor development framework
5 versions - Latest release: 3 months ago - 177 downloads last month - 0 stars on GitHub - 1 maintainer
wraptile 0.0.8
FastAPI server that implements the OGC API - Processes
5 versions - Latest release: 3 months ago - 38 downloads last month - 0 stars on GitHub - 1 maintainer
cross_ml 2.0.1
⚠️ DEPRECATED: Please use BeaverFE instead (https://pypi.org/project/beaverfe/)
9 versions - Latest release: 7 months ago - 38 downloads last month - 0 stars on GitHub - 1 maintainer
cuiman 0.0.8
Provides a client Python API, GUI, and CLI for servers compliant with OGC API - Processes
5 versions - Latest release: 3 months ago - 34 downloads last month - 0 stars on GitHub - 1 maintainer
meowmotion 0.1.2
Mobile phone GPS data processor for trip generation and travel mode detection
3 versions - Latest release: 9 months ago - 13 downloads last month - 0 stars on GitHub - 1 maintainer
rivusio 0.2.0
A type-safe, async-first data processing pipeline framework
2 versions - Latest release: about 1 year ago - 12 downloads last month - 1 stars on GitHub - 1 maintainer
arus-stream-metawear 1.0.4
arus plugin that helps creating stream for metawear devices
2 versions - Latest release: over 6 years ago - 1 dependent repositories - 24 downloads last month - 1 stars on GitHub - 1 maintainer
textmining-module 2.1.2
A Python Module for Comprehensive Text Mining, including Keyword Extraction and Text Analysis.
7 versions - Latest release: about 1 year ago - 10 downloads last month - 0 stars on GitHub - 1 maintainer
dawgsml 0.0.3
A simple library for machine learning without a requirements.txt
2 versions - Latest release: almost 2 years ago - 28 downloads last month - 1 stars on GitHub - 1 maintainer
mgd-outliers 0.1.4
MGD_Outliers is a Pypyththon package for identifying and visualizing outliers in a DataFrame.
5 versions - Latest release: almost 3 years ago - 32 downloads last month - 1 maintainer
vre-eoles 0.2.1
toolbox for computing charge factor used in EOLES model
9 versions - Latest release: about 5 years ago - 1 dependent repositories - 41 downloads last month - 0 stars on GitHub - 1 maintainer
binaryrain-helper-data-processing 0.1.2
Aims to simplify and help with commonly used functions in the data processing areas.
14 versions - Latest release: 4 months ago - 73 downloads last month - 0 stars on GitHub - 3 maintainers
quiffen 4.0.1
Quiffen
30 versions - Latest release: about 1 month ago - 1 dependent repositories - 409 downloads last month - 38 stars on GitHub - 1 maintainer
opencf-core 0.3.4
A robust framework for handling file conversion tasks in Python
12 versions - Latest release: about 1 year ago - 1 dependent package - 56 downloads last month - 0 stars on GitHub - 1 maintainer
geniusrise-vision 0.1.5
Huggingface bolts for geniusrise
8 versions - Latest release: almost 2 years ago - 45 downloads last month - 7 stars on GitHub - 1 maintainer
biomdp 0.8.0
Usefull set of functions for analyzing time series records, particularly for biomechanical data
20 versions - Latest release: 3 months ago - 168 downloads last month - 1 stars on GitHub - 1 maintainer
nullaxe 0.4.2
A data cleaning library for Pandas and Polars DataFrames with a simple, chainable API.
3 versions - Latest release: 5 months ago - 11 downloads last month - 2 stars on GitHub - 1 maintainer