pypi.org "data processing" keyword
View the packages on the pypi.org package registry that are tagged with the "data processing" keyword.
netcdf-scm 2.1.1
Processing netCDF files for use with simple climate models43 versions - Latest release: over 4 years ago - 1 dependent repositories - 2.33 thousand downloads last month - 6 stars on GitHub - 2 maintainers
giga-spatial 0.8.0 💰
A package for spatial data download & processing18 versions - Latest release: 3 days ago - 537 downloads last month - 17 stars on GitHub - 1 maintainer
abconnect 0.2.1
A set of tools for connecting and processing data for Annex Brands, featuring API pack and ship q...10 versions - Latest release: about 1 month ago - 499 downloads last month - 1 stars on GitHub - 1 maintainer
yalab-procedures 0.0.1
The yalab-procedures repository is dedicated to managing and executing data processing procedures...1 version - Latest release: over 1 year ago - 10 downloads last month - 1 maintainer
Top 1.9% on pypi.org
15 versions - Latest release: about 2 years ago - 18 dependent packages - 78 dependent repositories - 43.9 thousand downloads last month - 1,879 stars on GitHub - 1 maintainer
datatable 1.1.0
Python library for fast multi-threaded data manipulation and munging.15 versions - Latest release: about 2 years ago - 18 dependent packages - 78 dependent repositories - 43.9 thousand downloads last month - 1,879 stars on GitHub - 1 maintainer
featransform 1.6.65
Featransform is an automated feature engineering framework for supervised machine learning15 versions - Latest release: 3 days ago - 322 downloads last month - 4 stars on GitHub - 1 maintainer
narrator 0.0.0.4
A set of functions that process and create descriptive summary visualizations to help develop a b...3 versions - Latest release: over 6 years ago - 1 dependent repositories - 30 downloads last month - 0 stars on GitHub - 1 maintainer
gavicore 0.0.8
Pydantic data models and common utilities for other Eozilla packages6 versions - Latest release: 3 months ago - 60 downloads last month - 3 stars on GitHub - 1 maintainer
mirobody 1.0.3
Mirobody is a Python package for processing and analyzing health data.1 version - Latest release: 14 days ago
datalab-kernel 0.2.10
A standalone Xeus-Python-based Jupyter kernel for DataLab with optional live synchronization12 versions - Latest release: 3 days ago - 1 maintainer
keprep 0.2.2
Minimally preprocessing TheBase dMRI data.9 versions - Latest release: over 1 year ago - 211 downloads last month - 1 stars on GitHub - 1 maintainer
ml-dataloader 0.9.0
dataloader for machelearning24 versions - Latest release: over 3 years ago - 1 dependent repositories - 1.14 thousand downloads last month - 0 stars on GitHub - 1 maintainer
pi-folder-organizer 2.2.2
A Python package for cleaning up cluttered files and organizing them into respective folders.6 versions - Latest release: over 1 year ago - 25 downloads last month - 1 maintainer
fast-writer 1.0.3
CLI Tools for writing data2 versions - Latest release: 9 months ago - 21 downloads last month - 0 stars on GitHub - 1 maintainer
sreader 0.0.1
space-reader: Convert any file path into LLM-friendly inputs1 version - Latest release: over 1 year ago - 23 downloads last month - 1 maintainer
irt-data-utils 0.0.1a2
Infrared Thermal Data Utils3 versions - Latest release: over 2 years ago - 23 downloads last month - 1 maintainer
Top 9.0% on pypi.org
196 versions - Latest release: 29 days ago - 2 dependent packages - 3 dependent repositories - 2.21 thousand downloads last month - 48 stars on GitHub - 1 maintainer
csv-detective 0.10.12674
Detect tabular files column content196 versions - Latest release: 29 days ago - 2 dependent packages - 3 dependent repositories - 2.21 thousand downloads last month - 48 stars on GitHub - 1 maintainer
checkpointer 2.14.10
checkpointer adds code-aware caching to Python functions, maintaining correctness and speeding up...50 versions - Latest release: 3 months ago - 2 dependent repositories - 950 downloads last month - 6 stars on GitHub - 1 maintainer
bolster 0.4.0
Bolster's Brain, you've been warned9 versions - Latest release: about 2 months ago - 1 dependent repositories - 122 downloads last month - 2 stars on GitHub - 1 maintainer
chunklet-py 2.1.1
Advanced text, code, and document chunking for LLM applications. Split content semantically, visu...7 versions - Latest release: about 2 months ago - 263 downloads last month - 36 stars on GitHub - 1 maintainer
constelation-astronomer 1.1.4
constelation-astronomer: results processing package for CONSTELATION coupled model1 version - Latest release: almost 2 years ago - 15 downloads last month - 0 stars on GitHub - 1 maintainer
space-packet-parser 6.1.0
A CCSDS telemetry packet decoding library based on the XTCE packet format description standard.26 versions - Latest release: 22 days ago - 6.19 thousand downloads last month - 31 stars on GitHub - 1 maintainer
tensorneko-tool 0.3.22
The CLI Tools for Library TensorNeko.9 versions - Latest release: 3 months ago - 46 downloads last month - 11 stars on GitHub - 1 maintainer
python-datatable 1.1.3
Python library for fast multi-threaded data manipulation and munging.4 versions - Latest release: about 3 years ago - 3 dependent packages - 428 downloads last month - 1,878 stars on GitHub - 1 maintainer
fibphoflow 0.1.8
Python package to process and visualize TDT fiber photometry data3 versions - Latest release: over 2 years ago - 15 downloads last month - 1 maintainer
geniusrise-databases 0.1.4
listeners bolts for geniusrise4 versions - Latest release: over 2 years ago - 26 downloads last month - 2 stars on GitHub - 1 maintainer
scmcallib 0.5.1
Perform calibration for simple climate models20 versions - Latest release: almost 6 years ago - 1 dependent repositories - 170 downloads last month - 0 stars on gitlab.com - 2 maintainers
llmbuilder 2.0.0
A comprehensive toolkit for building, training, and deploying language models10 versions - Latest release: 3 months ago - 49 downloads last month - 0 stars on GitHub - 1 maintainer
geniusrise 0.1.7
An LLM framework50 versions - Latest release: almost 2 years ago - 9 dependent packages - 1 dependent repositories - 154 downloads last month - 60 stars on GitHub - 1 maintainer
litdata 0.2.59
The Deep Learning framework to train, deploy, and ship AI products Lightning fast.65 versions - Latest release: 2 months ago - 2 dependent packages - 188 thousand downloads last month - 544 stars on GitHub - 2 maintainers
invoice-generator-pdf 1.0.0
A Python package designed to effortlessly convert Excel-based invoices into professionally format...1 version - Latest release: over 1 year ago - 9 downloads last month - 1 maintainer
arus 1.1.22
Activity Recognition with Ubiquitous Sensing59 versions - Latest release: about 5 years ago - 1 dependent repositories - 335 downloads last month - 0 stars on GitHub - 1 maintainer
saqc 2.7.0
A timeseries data quality control and processing tool/framework21 versions - Latest release: 5 months ago - 1 dependent repositories - 956 downloads last month - 8 stars on git.ufz.de - 2 maintainers
data-preprocessing-library-sevvalcucuk-asudesozcu 1.1.7
A comprehensive toolkit for data processing including handling dates, encoding categorical variab...2 versions - Latest release: over 1 year ago - 18 downloads last month - 0 stars on GitHub - 2 maintainers
conveyor-streaming 1.2.1
A Python library for streamlining asynchronous streaming tasks and pipelines.5 versions - Latest release: 6 months ago - 52 downloads last month - 2 stars on GitHub - 1 maintainer
crystflow 0.0.1
Name reservation for WIP package1 version - Latest release: about 2 months ago - 29 downloads last month - 1 maintainer
chunklet 1.4.0
A smart multilingual text chunker for LLMs, RAG, and beyond.19 versions - Latest release: 6 months ago - 162 downloads last month - 23 stars on GitHub - 1 maintainer
scalary 0.1.3
Collection of practical tools for working with image data3 versions - Latest release: over 5 years ago - 1 dependent repositories - 12 downloads last month - 0 stars on GitHub - 1 maintainer
adaptivebridge 1.1.0
Revolutionizing ML adaptive modelling for handling missing features and data. The model can predi...5 versions - Latest release: about 2 years ago - 66 downloads last month - 1 stars on GitHub - 1 maintainer
rezolve-ai-ingestion 0.1.4
A private package for ingesting and processing SharePoint data with AI capabilities4 versions - Latest release: over 1 year ago - 43 downloads last month - 6,176 stars on GitHub - 1 maintainer
urlcounter 0.0.3
A set of functions that tally URLs within an event-based corpus. It assumes that you have data di...2 versions - Latest release: over 5 years ago - 1 dependent repositories - 32 downloads last month - 0 stars on GitHub - 1 maintainer
hfutils 0.13.0
Useful utilities for huggingface51 versions - Latest release: about 2 months ago - 1 dependent package - 1.05 million downloads last month - 21 stars on GitHub - 1 maintainer
geniusrise-listeners 0.1.7
listeners bolts for geniusrise7 versions - Latest release: over 2 years ago - 51 downloads last month - 2 stars on GitHub - 1 maintainer
ersatz 1.0.0
Simple sentence segmentation toolkit for segmenting and scoring2 versions - Latest release: over 4 years ago - 1 dependent repositories - 149 downloads last month - 34 stars on GitHub - 2 maintainers
sportradar-unofficial 0.1.15
An unofficial python package to access sportradar NFL APIs.13 versions - Latest release: almost 2 years ago - 128 downloads last month - 0 stars on GitHub - 1 maintainer
hll 2.4.0
Fast HyperLogLog for Python20 versions - Latest release: 6 months ago - 17.8 thousand downloads last month - 109 stars on GitHub - 1 maintainer
informatics 0.0.1
Framework of fast implementation data processing and operating pipelines6 versions - Latest release: over 2 years ago - 1 dependent repositories - 222 downloads last month - 583 stars on GitHub - 1 maintainer
bertrand 0.0.1
(in development) Type-safe language bindings for Python/C++1 version - Latest release: over 1 year ago - 25 downloads last month - 2 stars on GitHub - 1 maintainer
eseas 1.0.4
eseas is a Python package that serves as a wrapper for the jwsacruncher Java package. This tool a...14 versions - Latest release: 11 months ago - 118 downloads last month - 1 stars on GitHub - 1 maintainer
csvuniondiff 0.0.0.dev1
A package for comparing CSV-like files through union and difference operations.2 versions - Latest release: over 1 year ago - 27 downloads last month - 7 stars on GitHub - 1 maintainer
cuery 0.33.2
Prompt (cue) management and execution for tabular data.78 versions - Latest release: about 2 months ago - 984 downloads last month - 1 stars on GitHub - 1 maintainer
outputty 0.3.2
Import, filter and export tabular data with Python easily6 versions - Latest release: almost 13 years ago - 3 dependent repositories - 62 downloads last month - 36 stars on GitHub - 1 maintainer
acfortformat 0.1.3
Python library for reading and writing data in Fortran-style and native Python formats.1 version - Latest release: 7 months ago - 20 downloads last month - 0 stars on GitHub - 1 maintainer
lidirl 0.0.1
LID toolkit to improve performance on spontaneous noisy text with data augmentation.1 version - Latest release: almost 3 years ago - 24 downloads last month - 0 stars on GitHub - 1 maintainer
imaxt-mosaic 1.15.0
Image stitching20 versions - Latest release: about 3 years ago - 155 downloads last month - 0 stars on gitlab.developers.cam.ac.uk - 1 maintainer
hysteresis 2.0.5
Hysteresis data processing tools.25 versions - Latest release: over 1 year ago - 1 dependent repositories - 1.06 thousand downloads last month - 62 stars on GitHub - 1 maintainer
datacleanerpro 0.1.1
A simple and efficient data cleaning library for CSV files1 version - Latest release: 3 months ago - 1 maintainer
framemerge 1.1.0
Lightweight tool to merge crystallographic frames3 versions - Latest release: 2 months ago - 38 downloads last month - 1 maintainer
analytics_tasks 0.1.0
Automation including file search and slide deck preparation.1 version - Latest release: 8 months ago - 17 downloads last month - 0 stars on GitHub - 1 maintainer
py-simple-flow 2020.8.23
Simple data processing (ETL) library with support for multi-processing10 versions - Latest release: over 5 years ago - 3 dependent repositories - 80 downloads last month - 1 maintainer
augmently 1.0.9
A library for Data Augmentation of images in computer vision.6 versions - Latest release: over 6 years ago - 58 downloads last month - 4 stars on GitHub - 1 maintainer
datasponge-monitoring 0.0.1
A real-time data processing pipeline1 version - Latest release: over 1 year ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
opencf 0.3.3
A collection of Python scripts for file conversion tasks, built on top of the opencf-core framework.7 versions - Latest release: over 1 year ago - 46 downloads last month - 0 stars on GitHub - 1 maintainer
adpa 1.5.0
Advanced Data Processing and Analytics Framework3 versions - Latest release: 12 months ago - 34 downloads last month - 1 maintainer
arekit-ss 0.25.0
Low Resource Context Relation Sampler for contexts with relations for fact-checking and fine-tuni...3 versions - Latest release: about 1 year ago - 42 downloads last month - 4 stars on GitHub - 1 maintainer
prosto 0.6.0
Data processing toolkit radically changing the way data is processed5 versions - Latest release: about 4 years ago - 1 dependent repositories - 26 downloads last month - 91 stars on GitHub - 1 maintainer
datasponge-core 0.0.6
A real-time data processing pipeline5 versions - Latest release: over 1 year ago - 23 downloads last month - 2 stars on GitHub - 1 maintainer
datasetplus 0.6.0
An enhanced wrapper for Hugging Face datasets with additional functionality12 versions - Latest release: 5 months ago - 78 downloads last month - 1 stars on GitHub - 1 maintainer
blossom-data 0.5.0
A simple way to synthesize LLM training data.6 versions - Latest release: 2 months ago - 41 downloads last month - 25 stars on GitHub - 1 maintainer
strahlenexposition-uba 1.0.0
Package for importing, processing and visualising radition exposure data1 version - Latest release: 9 months ago - 10 downloads last month - 1 maintainer
geniusrise-audio 0.1.12
audio bolts for geniusrise13 versions - Latest release: almost 2 years ago - 69 downloads last month - 2 stars on GitHub - 1 maintainer
eozilla 0.0.8
Comprises all packages of the Eozilla suite6 versions - Latest release: 3 months ago - 45 downloads last month - 0 stars on GitHub - 1 maintainer
pyees 2.4.8
EES but for python. Pyees can be used do perform uncertanty (error) propagation. Furthermore, it ...153 versions - Latest release: about 1 month ago - 1 dependent repositories - 658 downloads last month - 1 stars on GitHub - 1 maintainer
pycfs 0.2.0
Python library for automating and data handling tasks for openCFS.20 versions - Latest release: 3 months ago - 331 downloads last month - 4 stars on gitlab.com - 1 maintainer
proadv 2.1.5
Process Acoustic Doppler Velocimeter data with advanced despiking and analysis tools6 versions - Latest release: over 1 year ago - 25 downloads last month - 11 stars on GitHub - 1 maintainer
rlylutils 0.1.23
General file and data processing tools15 versions - Latest release: almost 3 years ago - 19 downloads last month - 1 stars on GitHub - 1 maintainer
csv2mne 0.0.1
Data formater2 versions - Latest release: about 3 years ago - 9 downloads last month - 1 maintainer
pipd 0.2.2
Utility functions for python data pipelines.20 versions - Latest release: over 2 years ago - 272 downloads last month - 15 stars on GitHub - 1 maintainer
dagstd 0.1.3
Dagstd4 versions - Latest release: over 3 years ago - 148 downloads last month - 2 stars on GitHub - 1 maintainer
image-features-extract 0.4.19
toolbox for extracting features from an image10 versions - Latest release: about 5 years ago - 1 dependent repositories - 33 downloads last month - 1 maintainer
imsciences 1.0.9
IMS Data Processing Package169 versions - Latest release: about 1 month ago - 769 downloads last month - 5 maintainers
ds11mltoolkit 1.9
Helper functions for all stages of the machine learning model building process8 versions - Latest release: almost 3 years ago - 44 downloads last month - 3 stars on GitHub - 2 maintainers
procodile 0.0.8
A light-weight processor development framework5 versions - Latest release: 3 months ago - 177 downloads last month - 0 stars on GitHub - 1 maintainer
wraptile 0.0.8
FastAPI server that implements the OGC API - Processes5 versions - Latest release: 3 months ago - 38 downloads last month - 0 stars on GitHub - 1 maintainer
cross_ml 2.0.1
⚠️ DEPRECATED: Please use BeaverFE instead (https://pypi.org/project/beaverfe/)9 versions - Latest release: 7 months ago - 38 downloads last month - 0 stars on GitHub - 1 maintainer
cuiman 0.0.8
Provides a client Python API, GUI, and CLI for servers compliant with OGC API - Processes5 versions - Latest release: 3 months ago - 34 downloads last month - 0 stars on GitHub - 1 maintainer
meowmotion 0.1.2
Mobile phone GPS data processor for trip generation and travel mode detection3 versions - Latest release: 9 months ago - 13 downloads last month - 0 stars on GitHub - 1 maintainer
rivusio 0.2.0
A type-safe, async-first data processing pipeline framework2 versions - Latest release: about 1 year ago - 12 downloads last month - 1 stars on GitHub - 1 maintainer
arus-stream-metawear 1.0.4
arus plugin that helps creating stream for metawear devices2 versions - Latest release: over 6 years ago - 1 dependent repositories - 24 downloads last month - 1 stars on GitHub - 1 maintainer
textmining-module 2.1.2
A Python Module for Comprehensive Text Mining, including Keyword Extraction and Text Analysis.7 versions - Latest release: about 1 year ago - 10 downloads last month - 0 stars on GitHub - 1 maintainer
dawgsml 0.0.3
A simple library for machine learning without a requirements.txt2 versions - Latest release: almost 2 years ago - 28 downloads last month - 1 stars on GitHub - 1 maintainer
mgd-outliers 0.1.4
MGD_Outliers is a Pypyththon package for identifying and visualizing outliers in a DataFrame.5 versions - Latest release: almost 3 years ago - 32 downloads last month - 1 maintainer
vre-eoles 0.2.1
toolbox for computing charge factor used in EOLES model9 versions - Latest release: about 5 years ago - 1 dependent repositories - 41 downloads last month - 0 stars on GitHub - 1 maintainer
binaryrain-helper-data-processing 0.1.2
Aims to simplify and help with commonly used functions in the data processing areas.14 versions - Latest release: 4 months ago - 73 downloads last month - 0 stars on GitHub - 3 maintainers
quiffen 4.0.1
Quiffen30 versions - Latest release: about 1 month ago - 1 dependent repositories - 409 downloads last month - 38 stars on GitHub - 1 maintainer
opencf-core 0.3.4
A robust framework for handling file conversion tasks in Python12 versions - Latest release: about 1 year ago - 1 dependent package - 56 downloads last month - 0 stars on GitHub - 1 maintainer
geniusrise-vision 0.1.5
Huggingface bolts for geniusrise8 versions - Latest release: almost 2 years ago - 45 downloads last month - 7 stars on GitHub - 1 maintainer
biomdp 0.8.0
Usefull set of functions for analyzing time series records, particularly for biomechanical data20 versions - Latest release: 3 months ago - 168 downloads last month - 1 stars on GitHub - 1 maintainer
nullaxe 0.4.2
A data cleaning library for Pandas and Polars DataFrames with a simple, chainable API.3 versions - Latest release: 5 months ago - 11 downloads last month - 2 stars on GitHub - 1 maintainer
Related Keywords
python
32
machine learning
29
data science
18
data analysis
14
llm
14
pandas
11
deep learning
10
ai
10
data cleaning
10
mlops
9
geniusrise
9
data
9
time series
8
automation
7
pipeline
7
AI
7
data-science
7
real-time
7
llmops
6
data-analysis
6
llm-framework
6
agentops
6
agent-based-framework
6
etl
6
analytics
6
natural language processing
6
ogc
6
eo
6
esa
6
streaming
5
nlp
5
feature engineering
5
preprocessing
5
data visualization
5
polars
5
data transformation
5
huggingface
5
predictive modeling
4
data manipulation
4
scikit-learn
4
data engineering
4
pytorch
4
ETL
4
numpy
4
data validation
4
dataframe
4
cli
4
api
4
data preprocessing
4
fine-tuning
4
inference
4
async
4
visualization
3
data integrity
3
file conversion
3
sklearn
3
machine-learning
3
common
3
help
3
data-engineering
3
regression
3
functions
3
classification
3
validation
3
NLP
3
fast
3
workflow
3
clustering
3
statistics
3
data handling
3
database
3
neuroscience
3
performance
3
data pipeline
3
data wrangling
3
cloud
3
artificial intelligence
3
inference-server
3
rag
2
uncertanty
2
ml
2
image
2
data collection
2
multilingual
2
text-splitting
2
chunking
2
file formats
2
file-conversion
2
big data
2
dataset
2
datasets
2
crystallography
2
optimization
2
Python
2
Image
2
automl
2
missing data
2
data imputation
2
data quality
2
finance
2