An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data processing" keyword

View the packages on the pypi.org package registry that are tagged with the "data processing" keyword.

tensorneko 0.3.21
Tensor Neural Engine Kompanion. An util library based on PyTorch and PyTorch Lightning.
91 versions - Latest release: 9 months ago - 1 dependent repositories - 263 downloads last month - 11 stars on GitHub - 1 maintainer
vre-eoles 0.2.1
toolbox for computing charge factor used in EOLES model
9 versions - Latest release: over 4 years ago - 1 dependent repositories - 19 downloads last month - 0 stars on GitHub - 1 maintainer
python-datatable 1.1.3
Python library for fast multi-threaded data manipulation and munging.
4 versions - Latest release: over 2 years ago - 3 dependent packages - 193 downloads last month - 1,871 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
datatable 1.1.0
Python library for fast multi-threaded data manipulation and munging.
15 versions - Latest release: almost 2 years ago - 18 dependent packages - 78 dependent repositories - 39.7 thousand downloads last month - 1,871 stars on GitHub - 3 maintainers
fast-writer 1.0.3
CLI Tools for writing data
2 versions - Latest release: 3 months ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
informatics 0.0.1
Framework of fast implementation data processing and operating pipelines
6 versions - Latest release: almost 2 years ago - 1 dependent repositories - 133 downloads last month - 582 stars on GitHub - 1 maintainer
saqc 2.6.0
A timeseries data quality control and processing tool/framework
16 versions - Latest release: over 1 year ago - 1 dependent repositories - 1.18 thousand downloads last month - 8 stars on git.ufz.de - 2 maintainers
pyees 2.3.2
EES but for python. Pyees can be used do perform uncertanty (error) propagation. Furthermore, it ...
137 versions - Latest release: about 1 month ago - 1 dependent repositories - 344 downloads last month - 1 stars on GitHub - 1 maintainer
imsciences 1.0.6
IMS Data Processing Package
168 versions - Latest release: about 1 month ago - 514 downloads last month - 5 maintainers
litdata 0.2.52
The Deep Learning framework to train, deploy, and ship AI products Lightning fast.
58 versions - Latest release: 26 days ago - 2 dependent packages - 156 thousand downloads last month - 536 stars on GitHub - 2 maintainers
joinem 0.10.0
CLI for fast, flexbile concatenation of tabular data using Polars.
19 versions - Latest release: 4 months ago - 5.8 thousand downloads last month - 16 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
csv-detective 0.9.2
Detect tabular files column content
142 versions - Latest release: 12 days ago - 2 dependent packages - 3 dependent repositories - 1.2 thousand downloads last month - 48 stars on GitHub - 1 maintainer
atlantic 1.1.80
Atlantic: Automated Preprocessing Framework for Supervised Machine Learning
46 versions - Latest release: 8 months ago - 2 dependent packages - 154 downloads last month - 29 stars on GitHub - 1 maintainer
cuery 0.19.0
Prompt (cue) management and execution for tabular data.
51 versions - Latest release: 15 days ago - 2.23 thousand downloads last month - 1 stars on GitHub - 1 maintainer
bolster 0.3.4
Bolster's Brain, you've been warned
8 versions - Latest release: 5 months ago - 1 dependent repositories - 61 downloads last month - 3 stars on GitHub - 1 maintainer
textmining-module 2.1.2
A Python Module for Comprehensive Text Mining, including Keyword Extraction and Text Analysis.
7 versions - Latest release: 9 months ago - 10 downloads last month - 0 stars on GitHub - 1 maintainer
space-packet-parser 5.0.1
A CCSDS telemetry packet decoding library based on the XTCE packet format description standard.
23 versions - Latest release: 11 months ago - 8.76 thousand downloads last month - 31 stars on GitHub - 1 maintainer
giga-spatial 0.6.9 đź’°
A package for spatial data download & processing
10 versions - Latest release: about 1 month ago - 67 downloads last month - 16 stars on GitHub - 1 maintainer
data-preprocessing-library-sevvalcucuk-asudesozcu 1.1.7
A comprehensive toolkit for data processing including handling dates, encoding categorical variab...
2 versions - Latest release: over 1 year ago - 13 downloads last month - 0 stars on GitHub - 2 maintainers
geniusrise-text 0.1.13
Text bolts for geniusrise
14 versions - Latest release: over 1 year ago - 45 downloads last month - 5 stars on GitHub - 1 maintainer
dagstd 0.1.3
Dagstd
4 versions - Latest release: about 3 years ago - 84 downloads last month - 2 stars on GitHub - 1 maintainer
quiffen 4.0.0
Quiffen
29 versions - Latest release: 14 days ago - 1 dependent repositories - 434 downloads last month - 38 stars on GitHub - 1 maintainer
bertrand 0.0.1
(in development) Type-safe language bindings for Python/C++
1 version - Latest release: over 1 year ago - 24 downloads last month - 2 stars on GitHub - 1 maintainer
netcdf-scm 2.1.1
Processing netCDF files for use with simple climate models
43 versions - Latest release: about 4 years ago - 1 dependent repositories - 3.32 thousand downloads last month - 6 stars on GitHub - 2 maintainers
datasetplus 0.6.0
An enhanced wrapper for Hugging Face datasets with additional functionality
12 versions - Latest release: 4 days ago - 684 downloads last month - 1 stars on GitHub - 1 maintainer
pysetl 1.2.1
A PySpark ETL Framework
14 versions - Latest release: about 1 month ago - 55 downloads last month - 5 stars on GitHub - 1 maintainer
urlcounter 0.0.3
A set of functions that tally URLs within an event-based corpus. It assumes that you have data di...
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
analysts-tool-share 0.0.1
Tools for analyzing data, using Python.
1 version - Latest release: over 5 years ago - 1 dependent repositories - 26 downloads last month - 0 stars on GitHub - 1 maintainer
analytics_tasks 0.1.0
Automation including file search and slide deck preparation.
1 version - Latest release: 2 months ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
mapminer 0.1.59
An advanced geospatial data extraction and processing toolkit for Earth observation datasets.
58 versions - Latest release: 5 days ago - 799 downloads last month - 43 stars on GitHub - 1 maintainer
nttc 0.6.1
A set of functions that process and create topic models from a sample of community-detected Twitt...
52 versions - Latest release: over 4 years ago - 1 dependent repositories - 53 downloads last month - 3 stars on GitHub - 1 maintainer
pycfs 0.1.9
Python library for automating and data handling tasks for openCFS.
19 versions - Latest release: 6 days ago - 104 downloads last month - 4 stars on gitlab.com - 1 maintainer
datasponge-monitoring 0.0.1
A real-time data processing pipeline
1 version - Latest release: 11 months ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
adpa 1.5.0
Advanced Data Processing and Analytics Framework
3 versions - Latest release: 7 months ago - 40 downloads last month - 1 maintainer
logicsponge 0.0.9
A real-time data processing pipeline
1 version - Latest release: 11 months ago - 17 downloads last month - 3 stars on GitHub - 3 maintainers
imaxt-mosaic 1.15.0
Image stitching
20 versions - Latest release: over 2 years ago - 30 downloads last month - 0 stars on gitlab.developers.cam.ac.uk - 1 maintainer
arus-stream-metawear 1.0.4
arus plugin that helps creating stream for metawear devices
2 versions - Latest release: almost 6 years ago - 1 dependent repositories - 34 downloads last month - 1 stars on GitHub - 1 maintainer
hysteresis 2.0.5
Hysteresis data processing tools.
25 versions - Latest release: 11 months ago - 1 dependent repositories - 152 downloads last month - 62 stars on GitHub - 1 maintainer
rezolve-ai-ingestion 0.1.4
A private package for ingesting and processing SharePoint data with AI capabilities
4 versions - Latest release: 12 months ago - 17 downloads last month - 6,176 stars on GitHub - 1 maintainer
constelation-astronomer 1.1.4
constelation-astronomer: results processing package for CONSTELATION coupled model
1 version - Latest release: over 1 year ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
tensorneko-tool 0.3.21
The CLI Tools for Library TensorNeko.
8 versions - Latest release: 9 months ago - 34 downloads last month - 11 stars on GitHub - 1 maintainer
ds11mltoolkit 1.9
Helper functions for all stages of the machine learning model building process
8 versions - Latest release: over 2 years ago - 13 downloads last month - 3 stars on GitHub - 2 maintainers
damast 0.1.12
Package to improve the development of transparent, replicable data processing pipelines
13 versions - Latest release: 12 days ago - 379 downloads last month - 1 stars on GitHub - 1 maintainer
arachnea 0.0.5
A Python library for efficient array operations using a fluent API.
4 versions - Latest release: about 1 year ago - 44 downloads last month - 1 maintainer
scmcallib 0.5.1
Perform calibration for simple climate models
20 versions - Latest release: over 5 years ago - 1 dependent repositories - 47 downloads last month - 0 stars on gitlab.com - 2 maintainers
logicsponge-monitoring 0.0.5
A real-time data processing pipeline
5 versions - Latest release: 5 months ago - 33 downloads last month - 0 stars on GitHub - 3 maintainers
invoice-generator-pdf 1.0.0
A Python package designed to effortlessly convert Excel-based invoices into professionally format...
1 version - Latest release: 10 months ago - 9 downloads last month - 1 maintainer
sas-to-polars 1.0.0
Convert SAS datasets (.sas7bdat files) to Polars DataFrames.
1 version - Latest release: 4 months ago - 25 downloads last month - 1 stars on GitHub - 1 maintainer
blossom-data 0.4.3
A simple way to synthesize LLM training data.
5 versions - Latest release: 2 months ago - 24 downloads last month - 25 stars on GitHub - 1 maintainer
eseas 1.0.4
eseas is a Python package that serves as a wrapper for the jwsacruncher Java package. This tool a...
14 versions - Latest release: 6 months ago - 61 downloads last month - 1 stars on GitHub - 1 maintainer
chunklet 1.4.0
A smart multilingual text chunker for LLMs, RAG, and beyond.
19 versions - Latest release: 10 days ago - 1.31 thousand downloads last month - 23 stars on GitHub - 1 maintainer
chunklet-py 1.4.0
A smart multilingual text chunker for LLMs, RAG, and beyond.
1 version - Latest release: 10 days ago - 23 stars on GitHub - 1 maintainer
binaryrain-helper-data-processing 0.0.12
Aims to simplify and help with commonly used functions in the data processing areas.
12 versions - Latest release: about 2 months ago - 86 downloads last month - 0 stars on GitHub - 3 maintainers
arus 1.1.22
Activity Recognition with Ubiquitous Sensing
59 versions - Latest release: over 4 years ago - 1 dependent repositories - 455 downloads last month - 0 stars on GitHub - 1 maintainer
adaptivebridge 1.1.0
Revolutionizing ML adaptive modelling for handling missing features and data. The model can predi...
5 versions - Latest release: over 1 year ago - 92 downloads last month - 1 stars on GitHub - 1 maintainer
memories-dev 2.0.8
Collective Memory Infrastructure for AGI
10 versions - Latest release: 4 months ago - 41 downloads last month - 8 stars on GitHub - 1 maintainer
biomdp 0.7.27
Usefull set of functions for analyzing time series records, particularly for biomechanical data
19 versions - Latest release: 20 days ago - 292 downloads last month - 1 stars on GitHub - 1 maintainer
particle-pack-tools 0.0.4
A toolkit for processing and visualizing particle pack data.
4 versions - Latest release: 12 days ago - 306 downloads last month - 0 stars on GitHub - 1 maintainer
strahlenexposition-uba 1.0.0
Package for importing, processing and visualising radition exposure data
1 version - Latest release: 4 months ago - 7 downloads last month - 1 maintainer
scalary 0.1.3
Collection of practical tools for working with image data
3 versions - Latest release: over 5 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
keprep 0.2.2
Minimally preprocessing TheBase dMRI data.
9 versions - Latest release: 12 months ago - 51 downloads last month - 1 stars on GitHub - 1 maintainer
pipd 0.2.2
Utility functions for python data pipelines.
20 versions - Latest release: about 2 years ago - 1.01 thousand downloads last month - 15 stars on GitHub - 1 maintainer
hfutils 0.11.1
Useful utilities for huggingface
47 versions - Latest release: 4 months ago - 1 dependent package - 416 thousand downloads last month - 21 stars on GitHub - 1 maintainer
uniform 0.2.2
Uniform - dress your form processing endpointsđź“‹
4 versions - Latest release: over 5 years ago - 6 dependent repositories - 254 downloads last month - 1 stars on GitLab.com - 1 maintainer
cross_ml 2.0.1
⚠️ DEPRECATED: Please use BeaverFE instead (https://pypi.org/project/beaverfe/)
9 versions - Latest release: about 2 months ago - 55 downloads last month - 0 stars on GitHub - 1 maintainer
csvuniondiff 0.0.0.dev1
A package for comparing CSV-like files through union and difference operations.
2 versions - Latest release: about 1 year ago - 15 downloads last month - 7 stars on GitHub - 1 maintainer
fibphoflow 0.1.8
Python package to process and visualize TDT fiber photometry data
3 versions - Latest release: over 2 years ago - 18 downloads last month - 1 maintainer
geniusrise-vision 0.1.5
Huggingface bolts for geniusrise
8 versions - Latest release: over 1 year ago - 55 downloads last month - 7 stars on GitHub - 1 maintainer
meowmotion 0.1.2
Mobile phone GPS data processor for trip generation and travel mode detection
3 versions - Latest release: 4 months ago - 24 downloads last month - 0 stars on GitHub - 1 maintainer
hll 2.3.0
Fast HyperLogLog for Python
19 versions - Latest release: 9 months ago - 18.1 thousand downloads last month - 109 stars on GitHub - 1 maintainer
datasponge 0.0.1
A real-time data processing pipeline
1 version - Latest release: 11 months ago - 11 downloads last month - 3 stars on GitHub - 1 maintainer
logicsponge-processmining 0.0.5
A real-time data processing pipeline
5 versions - Latest release: 3 months ago - 73 downloads last month - 1 stars on GitHub - 4 maintainers
py-simple-flow 2020.8.23
Simple data processing (ETL) library with support for multi-processing
10 versions - Latest release: about 5 years ago - 3 dependent repositories - 29 downloads last month - 1 maintainer
checkpointer 2.14.6
checkpointer adds code-aware caching to Python functions, maintaining correctness and speeding up...
46 versions - Latest release: about 2 months ago - 2 dependent repositories - 1.12 thousand downloads last month - 6 stars on GitHub - 1 maintainer
geniusrise-listeners 0.1.7
listeners bolts for geniusrise
7 versions - Latest release: almost 2 years ago - 48 downloads last month - 2 stars on GitHub - 1 maintainer
acfortformat 0.1.3
Python library for reading and writing data in Fortran-style and native Python formats.
1 version - Latest release: about 2 months ago - 0 stars on GitHub - 1 maintainer
fiiireflyyy 0.3.0
A python package covering miscellaneous uses, from system management to machine learning and imag...
35 versions - Latest release: 8 months ago - 1 dependent repositories - 58 downloads last month - 1 maintainer
geniusrise-databases 0.1.4
listeners bolts for geniusrise
4 versions - Latest release: almost 2 years ago - 34 downloads last month - 2 stars on GitHub - 1 maintainer
kmeans-tjdwill 1.0.4
A function-based implementation of k-means clustering that maintains data association.
5 versions - Latest release: about 1 year ago - 36 downloads last month - 0 stars on GitHub - 1 maintainer
opencf-core 0.3.4
A robust framework for handling file conversion tasks in Python
12 versions - Latest release: 9 months ago - 1 dependent package - 81 downloads last month - 0 stars on GitHub - 1 maintainer
lidirl 0.0.1
LID toolkit to improve performance on spontaneous noisy text with data augmentation.
1 version - Latest release: over 2 years ago - 26 downloads last month - 0 stars on GitHub - 1 maintainer
django-crunch 0.1.12
A data processing orcestration tool.
1 version - Latest release: over 2 years ago - 9 downloads last month - 3 stars on GitHub - 1 maintainer
rlylutils 0.1.23
General file and data processing tools
15 versions - Latest release: over 2 years ago - 11 downloads last month - 1 stars on GitHub - 1 maintainer
geniusrise-audio 0.1.12
audio bolts for geniusrise
13 versions - Latest release: over 1 year ago - 89 downloads last month - 2 stars on GitHub - 1 maintainer
abconnect 0.1.9
A set of tools for connecting and processing data for Annex Brands, featuring API pack and ship q...
8 versions - Latest release: 3 months ago - 311 downloads last month - 1 stars on GitHub - 1 maintainer
pi-folder-organizer 2.2.2
A Python package for cleaning up cluttered files and organizing them into respective folders.
6 versions - Latest release: about 1 year ago - 34 downloads last month - 1 maintainer
yalab-procedures 0.0.1
The yalab-procedures repository is dedicated to managing and executing data processing procedures...
1 version - Latest release: about 1 year ago - 8 downloads last month - 1 maintainer
ctxpro 0.0.5
Simple toolkit that extracts ambiguities in documents that require context to resolve.
5 versions - Latest release: over 1 year ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
sreader 0.0.1
space-reader: Convert any file path into LLM-friendly inputs
1 version - Latest release: over 1 year ago - 10 downloads last month - 1 maintainer
image-features-extract 0.4.19
toolbox for extracting features from an image
10 versions - Latest release: over 4 years ago - 1 dependent repositories - 38 downloads last month - 1 maintainer
featransform 0.9.21
Featransform is an automated feature engineering framework for Supervised Machine Learning
7 versions - Latest release: 11 months ago - 56 downloads last month - 4 stars on GitHub - 1 maintainer
prosto 0.6.0
Data processing toolkit radically changing the way data is processed
5 versions - Latest release: almost 4 years ago - 1 dependent repositories - 21 downloads last month - 91 stars on GitHub - 1 maintainer
outputty 0.3.2
Import, filter and export tabular data with Python easily
6 versions - Latest release: over 12 years ago - 3 dependent repositories - 17 downloads last month - 36 stars on GitHub - 1 maintainer
geniusrise 0.1.7
An LLM framework
50 versions - Latest release: over 1 year ago - 9 dependent packages - 1 dependent repositories - 319 downloads last month - 60 stars on GitHub - 1 maintainer
rivusio 0.2.0
A type-safe, async-first data processing pipeline framework
2 versions - Latest release: 7 months ago - 16 downloads last month - 1 stars on GitHub - 1 maintainer
ml-dataloader 0.9.0
dataloader for machelearning
24 versions - Latest release: almost 3 years ago - 1 dependent repositories - 28 downloads last month - 0 stars on GitHub - 1 maintainer
narrator 0.0.0.4
A set of functions that process and create descriptive summary visualizations to help develop a b...
3 versions - Latest release: almost 6 years ago - 1 dependent repositories - 46 downloads last month - 0 stars on GitHub - 1 maintainer
dawgsml 0.0.3
A simple library for machine learning without a requirements.txt
2 versions - Latest release: over 1 year ago - 13 downloads last month - 1 stars on GitHub - 1 maintainer
augmently 1.0.9
A library for Data Augmentation of images in computer vision.
6 versions - Latest release: almost 6 years ago - 26 downloads last month - 4 stars on GitHub - 1 maintainer
csv2mne 0.0.1
Data formater
2 versions - Latest release: almost 3 years ago - 4 downloads last month - 1 maintainer