pypi.org "data processing" keyword
View the packages on the pypi.org package registry that are tagged with the "data processing" keyword.
tensorneko 0.3.21
Tensor Neural Engine Kompanion. An util library based on PyTorch and PyTorch Lightning.91 versions - Latest release: 9 months ago - 1 dependent repositories - 263 downloads last month - 11 stars on GitHub - 1 maintainer
vre-eoles 0.2.1
toolbox for computing charge factor used in EOLES model9 versions - Latest release: over 4 years ago - 1 dependent repositories - 19 downloads last month - 0 stars on GitHub - 1 maintainer
python-datatable 1.1.3
Python library for fast multi-threaded data manipulation and munging.4 versions - Latest release: over 2 years ago - 3 dependent packages - 193 downloads last month - 1,871 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
15 versions - Latest release: almost 2 years ago - 18 dependent packages - 78 dependent repositories - 39.7 thousand downloads last month - 1,871 stars on GitHub - 3 maintainers
datatable 1.1.0
Python library for fast multi-threaded data manipulation and munging.15 versions - Latest release: almost 2 years ago - 18 dependent packages - 78 dependent repositories - 39.7 thousand downloads last month - 1,871 stars on GitHub - 3 maintainers
fast-writer 1.0.3
CLI Tools for writing data2 versions - Latest release: 3 months ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
informatics 0.0.1
Framework of fast implementation data processing and operating pipelines6 versions - Latest release: almost 2 years ago - 1 dependent repositories - 133 downloads last month - 582 stars on GitHub - 1 maintainer
saqc 2.6.0
A timeseries data quality control and processing tool/framework16 versions - Latest release: over 1 year ago - 1 dependent repositories - 1.18 thousand downloads last month - 8 stars on git.ufz.de - 2 maintainers
pyees 2.3.2
EES but for python. Pyees can be used do perform uncertanty (error) propagation. Furthermore, it ...137 versions - Latest release: about 1 month ago - 1 dependent repositories - 344 downloads last month - 1 stars on GitHub - 1 maintainer
imsciences 1.0.6
IMS Data Processing Package168 versions - Latest release: about 1 month ago - 514 downloads last month - 5 maintainers
litdata 0.2.52
The Deep Learning framework to train, deploy, and ship AI products Lightning fast.58 versions - Latest release: 26 days ago - 2 dependent packages - 156 thousand downloads last month - 536 stars on GitHub - 2 maintainers
joinem 0.10.0
CLI for fast, flexbile concatenation of tabular data using Polars.19 versions - Latest release: 4 months ago - 5.8 thousand downloads last month - 16 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
142 versions - Latest release: 12 days ago - 2 dependent packages - 3 dependent repositories - 1.2 thousand downloads last month - 48 stars on GitHub - 1 maintainer
csv-detective 0.9.2
Detect tabular files column content142 versions - Latest release: 12 days ago - 2 dependent packages - 3 dependent repositories - 1.2 thousand downloads last month - 48 stars on GitHub - 1 maintainer
atlantic 1.1.80
Atlantic: Automated Preprocessing Framework for Supervised Machine Learning46 versions - Latest release: 8 months ago - 2 dependent packages - 154 downloads last month - 29 stars on GitHub - 1 maintainer
cuery 0.19.0
Prompt (cue) management and execution for tabular data.51 versions - Latest release: 15 days ago - 2.23 thousand downloads last month - 1 stars on GitHub - 1 maintainer
bolster 0.3.4
Bolster's Brain, you've been warned8 versions - Latest release: 5 months ago - 1 dependent repositories - 61 downloads last month - 3 stars on GitHub - 1 maintainer
textmining-module 2.1.2
A Python Module for Comprehensive Text Mining, including Keyword Extraction and Text Analysis.7 versions - Latest release: 9 months ago - 10 downloads last month - 0 stars on GitHub - 1 maintainer
space-packet-parser 5.0.1
A CCSDS telemetry packet decoding library based on the XTCE packet format description standard.23 versions - Latest release: 11 months ago - 8.76 thousand downloads last month - 31 stars on GitHub - 1 maintainer
giga-spatial 0.6.9 đź’°
A package for spatial data download & processing10 versions - Latest release: about 1 month ago - 67 downloads last month - 16 stars on GitHub - 1 maintainer
data-preprocessing-library-sevvalcucuk-asudesozcu 1.1.7
A comprehensive toolkit for data processing including handling dates, encoding categorical variab...2 versions - Latest release: over 1 year ago - 13 downloads last month - 0 stars on GitHub - 2 maintainers
geniusrise-text 0.1.13
Text bolts for geniusrise14 versions - Latest release: over 1 year ago - 45 downloads last month - 5 stars on GitHub - 1 maintainer
dagstd 0.1.3
Dagstd4 versions - Latest release: about 3 years ago - 84 downloads last month - 2 stars on GitHub - 1 maintainer
quiffen 4.0.0
Quiffen29 versions - Latest release: 14 days ago - 1 dependent repositories - 434 downloads last month - 38 stars on GitHub - 1 maintainer
bertrand 0.0.1
(in development) Type-safe language bindings for Python/C++1 version - Latest release: over 1 year ago - 24 downloads last month - 2 stars on GitHub - 1 maintainer
netcdf-scm 2.1.1
Processing netCDF files for use with simple climate models43 versions - Latest release: about 4 years ago - 1 dependent repositories - 3.32 thousand downloads last month - 6 stars on GitHub - 2 maintainers
datasetplus 0.6.0
An enhanced wrapper for Hugging Face datasets with additional functionality12 versions - Latest release: 4 days ago - 684 downloads last month - 1 stars on GitHub - 1 maintainer
pysetl 1.2.1
A PySpark ETL Framework14 versions - Latest release: about 1 month ago - 55 downloads last month - 5 stars on GitHub - 1 maintainer
urlcounter 0.0.3
A set of functions that tally URLs within an event-based corpus. It assumes that you have data di...2 versions - Latest release: about 5 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
analysts-tool-share 0.0.1
Tools for analyzing data, using Python.1 version - Latest release: over 5 years ago - 1 dependent repositories - 26 downloads last month - 0 stars on GitHub - 1 maintainer
analytics_tasks 0.1.0
Automation including file search and slide deck preparation.1 version - Latest release: 2 months ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
mapminer 0.1.59
An advanced geospatial data extraction and processing toolkit for Earth observation datasets.58 versions - Latest release: 5 days ago - 799 downloads last month - 43 stars on GitHub - 1 maintainer
nttc 0.6.1
A set of functions that process and create topic models from a sample of community-detected Twitt...52 versions - Latest release: over 4 years ago - 1 dependent repositories - 53 downloads last month - 3 stars on GitHub - 1 maintainer
pycfs 0.1.9
Python library for automating and data handling tasks for openCFS.19 versions - Latest release: 6 days ago - 104 downloads last month - 4 stars on gitlab.com - 1 maintainer
datasponge-monitoring 0.0.1
A real-time data processing pipeline1 version - Latest release: 11 months ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
adpa 1.5.0
Advanced Data Processing and Analytics Framework3 versions - Latest release: 7 months ago - 40 downloads last month - 1 maintainer
logicsponge 0.0.9
A real-time data processing pipeline1 version - Latest release: 11 months ago - 17 downloads last month - 3 stars on GitHub - 3 maintainers
imaxt-mosaic 1.15.0
Image stitching20 versions - Latest release: over 2 years ago - 30 downloads last month - 0 stars on gitlab.developers.cam.ac.uk - 1 maintainer
arus-stream-metawear 1.0.4
arus plugin that helps creating stream for metawear devices2 versions - Latest release: almost 6 years ago - 1 dependent repositories - 34 downloads last month - 1 stars on GitHub - 1 maintainer
hysteresis 2.0.5
Hysteresis data processing tools.25 versions - Latest release: 11 months ago - 1 dependent repositories - 152 downloads last month - 62 stars on GitHub - 1 maintainer
rezolve-ai-ingestion 0.1.4
A private package for ingesting and processing SharePoint data with AI capabilities4 versions - Latest release: 12 months ago - 17 downloads last month - 6,176 stars on GitHub - 1 maintainer
constelation-astronomer 1.1.4
constelation-astronomer: results processing package for CONSTELATION coupled model1 version - Latest release: over 1 year ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
tensorneko-tool 0.3.21
The CLI Tools for Library TensorNeko.8 versions - Latest release: 9 months ago - 34 downloads last month - 11 stars on GitHub - 1 maintainer
ds11mltoolkit 1.9
Helper functions for all stages of the machine learning model building process8 versions - Latest release: over 2 years ago - 13 downloads last month - 3 stars on GitHub - 2 maintainers
damast 0.1.12
Package to improve the development of transparent, replicable data processing pipelines13 versions - Latest release: 12 days ago - 379 downloads last month - 1 stars on GitHub - 1 maintainer
arachnea 0.0.5
A Python library for efficient array operations using a fluent API.4 versions - Latest release: about 1 year ago - 44 downloads last month - 1 maintainer
scmcallib 0.5.1
Perform calibration for simple climate models20 versions - Latest release: over 5 years ago - 1 dependent repositories - 47 downloads last month - 0 stars on gitlab.com - 2 maintainers
logicsponge-monitoring 0.0.5
A real-time data processing pipeline5 versions - Latest release: 5 months ago - 33 downloads last month - 0 stars on GitHub - 3 maintainers
invoice-generator-pdf 1.0.0
A Python package designed to effortlessly convert Excel-based invoices into professionally format...1 version - Latest release: 10 months ago - 9 downloads last month - 1 maintainer
sas-to-polars 1.0.0
Convert SAS datasets (.sas7bdat files) to Polars DataFrames.1 version - Latest release: 4 months ago - 25 downloads last month - 1 stars on GitHub - 1 maintainer
blossom-data 0.4.3
A simple way to synthesize LLM training data.5 versions - Latest release: 2 months ago - 24 downloads last month - 25 stars on GitHub - 1 maintainer
eseas 1.0.4
eseas is a Python package that serves as a wrapper for the jwsacruncher Java package. This tool a...14 versions - Latest release: 6 months ago - 61 downloads last month - 1 stars on GitHub - 1 maintainer
chunklet 1.4.0
A smart multilingual text chunker for LLMs, RAG, and beyond.19 versions - Latest release: 10 days ago - 1.31 thousand downloads last month - 23 stars on GitHub - 1 maintainer
chunklet-py 1.4.0
A smart multilingual text chunker for LLMs, RAG, and beyond.1 version - Latest release: 10 days ago - 23 stars on GitHub - 1 maintainer
binaryrain-helper-data-processing 0.0.12
Aims to simplify and help with commonly used functions in the data processing areas.12 versions - Latest release: about 2 months ago - 86 downloads last month - 0 stars on GitHub - 3 maintainers
arus 1.1.22
Activity Recognition with Ubiquitous Sensing59 versions - Latest release: over 4 years ago - 1 dependent repositories - 455 downloads last month - 0 stars on GitHub - 1 maintainer
adaptivebridge 1.1.0
Revolutionizing ML adaptive modelling for handling missing features and data. The model can predi...5 versions - Latest release: over 1 year ago - 92 downloads last month - 1 stars on GitHub - 1 maintainer
memories-dev 2.0.8
Collective Memory Infrastructure for AGI10 versions - Latest release: 4 months ago - 41 downloads last month - 8 stars on GitHub - 1 maintainer
biomdp 0.7.27
Usefull set of functions for analyzing time series records, particularly for biomechanical data19 versions - Latest release: 20 days ago - 292 downloads last month - 1 stars on GitHub - 1 maintainer
particle-pack-tools 0.0.4
A toolkit for processing and visualizing particle pack data.4 versions - Latest release: 12 days ago - 306 downloads last month - 0 stars on GitHub - 1 maintainer
strahlenexposition-uba 1.0.0
Package for importing, processing and visualising radition exposure data1 version - Latest release: 4 months ago - 7 downloads last month - 1 maintainer
scalary 0.1.3
Collection of practical tools for working with image data3 versions - Latest release: over 5 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
keprep 0.2.2
Minimally preprocessing TheBase dMRI data.9 versions - Latest release: 12 months ago - 51 downloads last month - 1 stars on GitHub - 1 maintainer
pipd 0.2.2
Utility functions for python data pipelines.20 versions - Latest release: about 2 years ago - 1.01 thousand downloads last month - 15 stars on GitHub - 1 maintainer
hfutils 0.11.1
Useful utilities for huggingface47 versions - Latest release: 4 months ago - 1 dependent package - 416 thousand downloads last month - 21 stars on GitHub - 1 maintainer
uniform 0.2.2
Uniform - dress your form processing endpointsđź“‹4 versions - Latest release: over 5 years ago - 6 dependent repositories - 254 downloads last month - 1 stars on GitLab.com - 1 maintainer
cross_ml 2.0.1
⚠️ DEPRECATED: Please use BeaverFE instead (https://pypi.org/project/beaverfe/)9 versions - Latest release: about 2 months ago - 55 downloads last month - 0 stars on GitHub - 1 maintainer
csvuniondiff 0.0.0.dev1
A package for comparing CSV-like files through union and difference operations.2 versions - Latest release: about 1 year ago - 15 downloads last month - 7 stars on GitHub - 1 maintainer
fibphoflow 0.1.8
Python package to process and visualize TDT fiber photometry data3 versions - Latest release: over 2 years ago - 18 downloads last month - 1 maintainer
geniusrise-vision 0.1.5
Huggingface bolts for geniusrise8 versions - Latest release: over 1 year ago - 55 downloads last month - 7 stars on GitHub - 1 maintainer
meowmotion 0.1.2
Mobile phone GPS data processor for trip generation and travel mode detection3 versions - Latest release: 4 months ago - 24 downloads last month - 0 stars on GitHub - 1 maintainer
hll 2.3.0
Fast HyperLogLog for Python19 versions - Latest release: 9 months ago - 18.1 thousand downloads last month - 109 stars on GitHub - 1 maintainer
datasponge 0.0.1
A real-time data processing pipeline1 version - Latest release: 11 months ago - 11 downloads last month - 3 stars on GitHub - 1 maintainer
logicsponge-processmining 0.0.5
A real-time data processing pipeline5 versions - Latest release: 3 months ago - 73 downloads last month - 1 stars on GitHub - 4 maintainers
py-simple-flow 2020.8.23
Simple data processing (ETL) library with support for multi-processing10 versions - Latest release: about 5 years ago - 3 dependent repositories - 29 downloads last month - 1 maintainer
checkpointer 2.14.6
checkpointer adds code-aware caching to Python functions, maintaining correctness and speeding up...46 versions - Latest release: about 2 months ago - 2 dependent repositories - 1.12 thousand downloads last month - 6 stars on GitHub - 1 maintainer
geniusrise-listeners 0.1.7
listeners bolts for geniusrise7 versions - Latest release: almost 2 years ago - 48 downloads last month - 2 stars on GitHub - 1 maintainer
acfortformat 0.1.3
Python library for reading and writing data in Fortran-style and native Python formats.1 version - Latest release: about 2 months ago - 0 stars on GitHub - 1 maintainer
fiiireflyyy 0.3.0
A python package covering miscellaneous uses, from system management to machine learning and imag...35 versions - Latest release: 8 months ago - 1 dependent repositories - 58 downloads last month - 1 maintainer
geniusrise-databases 0.1.4
listeners bolts for geniusrise4 versions - Latest release: almost 2 years ago - 34 downloads last month - 2 stars on GitHub - 1 maintainer
kmeans-tjdwill 1.0.4
A function-based implementation of k-means clustering that maintains data association.5 versions - Latest release: about 1 year ago - 36 downloads last month - 0 stars on GitHub - 1 maintainer
opencf-core 0.3.4
A robust framework for handling file conversion tasks in Python12 versions - Latest release: 9 months ago - 1 dependent package - 81 downloads last month - 0 stars on GitHub - 1 maintainer
lidirl 0.0.1
LID toolkit to improve performance on spontaneous noisy text with data augmentation.1 version - Latest release: over 2 years ago - 26 downloads last month - 0 stars on GitHub - 1 maintainer
django-crunch 0.1.12
A data processing orcestration tool.1 version - Latest release: over 2 years ago - 9 downloads last month - 3 stars on GitHub - 1 maintainer
rlylutils 0.1.23
General file and data processing tools15 versions - Latest release: over 2 years ago - 11 downloads last month - 1 stars on GitHub - 1 maintainer
geniusrise-audio 0.1.12
audio bolts for geniusrise13 versions - Latest release: over 1 year ago - 89 downloads last month - 2 stars on GitHub - 1 maintainer
abconnect 0.1.9
A set of tools for connecting and processing data for Annex Brands, featuring API pack and ship q...8 versions - Latest release: 3 months ago - 311 downloads last month - 1 stars on GitHub - 1 maintainer
pi-folder-organizer 2.2.2
A Python package for cleaning up cluttered files and organizing them into respective folders.6 versions - Latest release: about 1 year ago - 34 downloads last month - 1 maintainer
yalab-procedures 0.0.1
The yalab-procedures repository is dedicated to managing and executing data processing procedures...1 version - Latest release: about 1 year ago - 8 downloads last month - 1 maintainer
ctxpro 0.0.5
Simple toolkit that extracts ambiguities in documents that require context to resolve.5 versions - Latest release: over 1 year ago - 14 downloads last month - 0 stars on GitHub - 1 maintainer
sreader 0.0.1
space-reader: Convert any file path into LLM-friendly inputs1 version - Latest release: over 1 year ago - 10 downloads last month - 1 maintainer
image-features-extract 0.4.19
toolbox for extracting features from an image10 versions - Latest release: over 4 years ago - 1 dependent repositories - 38 downloads last month - 1 maintainer
featransform 0.9.21
Featransform is an automated feature engineering framework for Supervised Machine Learning7 versions - Latest release: 11 months ago - 56 downloads last month - 4 stars on GitHub - 1 maintainer
prosto 0.6.0
Data processing toolkit radically changing the way data is processed5 versions - Latest release: almost 4 years ago - 1 dependent repositories - 21 downloads last month - 91 stars on GitHub - 1 maintainer
outputty 0.3.2
Import, filter and export tabular data with Python easily6 versions - Latest release: over 12 years ago - 3 dependent repositories - 17 downloads last month - 36 stars on GitHub - 1 maintainer
geniusrise 0.1.7
An LLM framework50 versions - Latest release: over 1 year ago - 9 dependent packages - 1 dependent repositories - 319 downloads last month - 60 stars on GitHub - 1 maintainer
rivusio 0.2.0
A type-safe, async-first data processing pipeline framework2 versions - Latest release: 7 months ago - 16 downloads last month - 1 stars on GitHub - 1 maintainer
ml-dataloader 0.9.0
dataloader for machelearning24 versions - Latest release: almost 3 years ago - 1 dependent repositories - 28 downloads last month - 0 stars on GitHub - 1 maintainer
narrator 0.0.0.4
A set of functions that process and create descriptive summary visualizations to help develop a b...3 versions - Latest release: almost 6 years ago - 1 dependent repositories - 46 downloads last month - 0 stars on GitHub - 1 maintainer
dawgsml 0.0.3
A simple library for machine learning without a requirements.txt2 versions - Latest release: over 1 year ago - 13 downloads last month - 1 stars on GitHub - 1 maintainer
augmently 1.0.9
A library for Data Augmentation of images in computer vision.6 versions - Latest release: almost 6 years ago - 26 downloads last month - 4 stars on GitHub - 1 maintainer
csv2mne 0.0.1
Data formater2 versions - Latest release: almost 3 years ago - 4 downloads last month - 1 maintainer
Related Keywords
python
30
machine learning
28
data science
15
data analysis
14
llm
13
ai
10
geniusrise
9
mlops
9
deep learning
9
time series
8
data
8
automation
7
pandas
7
real-time
7
agentops
6
llm-framework
6
llmops
6
agent-based-framework
6
analytics
6
data-science
6
pipeline
6
AI
6
data cleaning
6
nlp
5
natural language processing
5
data visualization
5
feature engineering
5
preprocessing
5
data transformation
5
huggingface
5
data-analysis
5
streaming
4
pytorch
4
dataframe
4
data preprocessing
4
predictive modeling
4
data manipulation
4
numpy
4
ETL
4
etl
4
data engineering
4
api
4
cli
4
scikit-learn
4
fine-tuning
4
async
4
NLP
3
statistics
3
visualization
3
data-engineering
3
functions
3
help
3
common
3
artificial intelligence
3
data wrangling
3
data pipeline
3
clustering
3
sklearn
3
performance
3
polars
3
file conversion
3
inference
3
inference-server
3
fast
3
validation
3
neuroscience
3
machine-learning
3
data validation
3
classification
3
regression
3
database
3
data handling
3
workflow
3
cloud
3
ftrl
2
analysis
2
dataset
2
sensing
2
type-safe
2
ubiquitous computing
2
xarray
2
finance
2
simple climate model
2
sql
2
spark
2
reduced complexity climate model
2
datasets
2
optimization
2
big data
2
pyspark
2
processing
2
data completeness
2
impute missing values
2
data missingness
2
missing data detection
2
data quality assessment
2
data pre-processing tool
2
image
2
data collection
2
MRI
2