pypi.org "data processing" keyword
View the packages on the pypi.org package registry that are tagged with the "data processing" keyword.
cuery 0.31.0
Prompt (cue) management and execution for tabular data.72 versions - Latest release: about 6 hours ago - 2.64 thousand downloads last month - 1 stars on GitHub - 1 maintainer
Top 1.9% on pypi.org
15 versions - Latest release: almost 2 years ago - 18 dependent packages - 78 dependent repositories - 57.6 thousand downloads last month - 1,873 stars on GitHub - 3 maintainers
datatable 1.1.0
Python library for fast multi-threaded data manipulation and munging.15 versions - Latest release: almost 2 years ago - 18 dependent packages - 78 dependent repositories - 57.6 thousand downloads last month - 1,873 stars on GitHub - 3 maintainers
fibphoflow 0.1.8
Python package to process and visualize TDT fiber photometry data3 versions - Latest release: over 2 years ago - 18 downloads last month - 1 maintainer
rezolve-ai-ingestion 0.1.4
A private package for ingesting and processing SharePoint data with AI capabilities4 versions - Latest release: about 1 year ago - 19 downloads last month - 6,176 stars on GitHub - 1 maintainer
saqc 2.7.0
A timeseries data quality control and processing tool/framework19 versions - Latest release: 23 days ago - 1 dependent repositories - 2.78 thousand downloads last month - 8 stars on git.ufz.de - 2 maintainers
Top 9.0% on pypi.org
143 versions - Latest release: about 1 month ago - 2 dependent packages - 3 dependent repositories - 1.03 thousand downloads last month - 48 stars on GitHub - 1 maintainer
csv-detective 0.9.2
Detect tabular files column content143 versions - Latest release: about 1 month ago - 2 dependent packages - 3 dependent repositories - 1.03 thousand downloads last month - 48 stars on GitHub - 1 maintainer
biomdp 0.7.27
Usefull set of functions for analyzing time series records, particularly for biomechanical data19 versions - Latest release: about 1 month ago - 251 downloads last month - 1 stars on GitHub - 1 maintainer
strahlenexposition-uba 1.0.0
Package for importing, processing and visualising radition exposure data1 version - Latest release: 5 months ago - 6 downloads last month - 1 maintainer
damast 0.2.0
Package to improve the development of transparent, replicable data processing pipelines14 versions - Latest release: 4 days ago - 91 downloads last month - 1 stars on GitHub - 1 maintainer
abconnect 0.1.9
A set of tools for connecting and processing data for Annex Brands, featuring API pack and ship q...8 versions - Latest release: 4 months ago - 337 downloads last month - 1 stars on GitHub - 1 maintainer
bertrand 0.0.1
(in development) Type-safe language bindings for Python/C++1 version - Latest release: over 1 year ago - 19 downloads last month - 2 stars on GitHub - 1 maintainer
cross_ml 2.0.1
⚠️ DEPRECATED: Please use BeaverFE instead (https://pypi.org/project/beaverfe/)9 versions - Latest release: 2 months ago - 60 downloads last month - 0 stars on GitHub - 1 maintainer
binaryrain-helper-data-processing 0.0.12
Aims to simplify and help with commonly used functions in the data processing areas.12 versions - Latest release: 3 months ago - 223 downloads last month - 0 stars on GitHub - 3 maintainers
logicsponge-core 0.0.17
A real-time data processing pipeline8 versions - Latest release: 4 months ago - 366 downloads last month - 2 stars on GitHub - 3 maintainers
acfortformat 0.1.3
Python library for reading and writing data in Fortran-style and native Python formats.1 version - Latest release: 2 months ago - 43 downloads last month - 0 stars on GitHub - 1 maintainer
phx-filters 3.4.0
Validation and data pipelines made easy!10 versions - Latest release: almost 2 years ago - 2 dependent packages - 16 dependent repositories - 349 downloads last month - 2 stars on GitHub - 1 maintainer
arachnea 0.0.5
A Python library for efficient array operations using a fluent API.4 versions - Latest release: about 1 year ago - 44 downloads last month - 0 stars on GitHub - 1 maintainer
space-packet-parser 6.0.0
A CCSDS telemetry packet decoding library based on the XTCE packet format description standard.24 versions - Latest release: 27 days ago - 8.76 thousand downloads last month - 31 stars on GitHub - 1 maintainer
particle-pack-tools 0.0.4
A toolkit for processing and visualizing particle pack data.4 versions - Latest release: about 1 month ago - 282 downloads last month - 0 stars on GitHub - 1 maintainer
pysetl 1.2.1
A PySpark ETL Framework14 versions - Latest release: 2 months ago - 66 downloads last month - 5 stars on GitHub - 1 maintainer
nttc 0.6.1
A set of functions that process and create topic models from a sample of community-detected Twitt...52 versions - Latest release: over 4 years ago - 1 dependent repositories - 272 downloads last month - 3 stars on GitHub - 1 maintainer
augmently 1.0.9
A library for Data Augmentation of images in computer vision.6 versions - Latest release: almost 6 years ago - 60 downloads last month - 4 stars on GitHub - 1 maintainer
quiffen 4.0.0
Quiffen29 versions - Latest release: about 1 month ago - 1 dependent repositories - 448 downloads last month - 38 stars on GitHub - 1 maintainer
geniusrise-databases 0.1.4
listeners bolts for geniusrise4 versions - Latest release: about 2 years ago - 17 downloads last month - 2 stars on GitHub - 1 maintainer
tensorneko-tool 0.3.21
The CLI Tools for Library TensorNeko.8 versions - Latest release: 10 months ago - 29 downloads last month - 11 stars on GitHub - 1 maintainer
geniusrise-vision 0.1.5
Huggingface bolts for geniusrise8 versions - Latest release: over 1 year ago - 45 downloads last month - 7 stars on GitHub - 1 maintainer
chunklet-py 1.4.0
A smart multilingual text chunker for LLMs, RAG, and beyond.1 version - Latest release: about 1 month ago - 212 downloads last month - 34 stars on GitHub - 1 maintainer
hll 2.4.0
Fast HyperLogLog for Python20 versions - Latest release: about 1 month ago - 20.7 thousand downloads last month - 109 stars on GitHub - 1 maintainer
informatics 0.0.1
Framework of fast implementation data processing and operating pipelines6 versions - Latest release: about 2 years ago - 1 dependent repositories - 203 downloads last month - 582 stars on GitHub - 1 maintainer
datasponge-core 0.0.6
A real-time data processing pipeline5 versions - Latest release: 12 months ago - 26 downloads last month - 2 stars on GitHub - 1 maintainer
pycfs 0.1.9
Python library for automating and data handling tasks for openCFS.19 versions - Latest release: about 1 month ago - 3.1 thousand downloads last month - 4 stars on gitlab.com - 1 maintainer
adaptivebridge 1.1.0
Revolutionizing ML adaptive modelling for handling missing features and data. The model can predi...5 versions - Latest release: over 1 year ago - 102 downloads last month - 1 stars on GitHub - 1 maintainer
sanex 0.3.0
A data cleaning library for Pandas and Polars DataFrames with a simple, chainable API.5 versions - Latest release: 22 days ago - 699 downloads last month - 2 stars on GitHub - 1 maintainer
nullaxe 0.4.2
A data cleaning library for Pandas and Polars DataFrames with a simple, chainable API.3 versions - Latest release: 21 days ago - 2 stars on GitHub - 1 maintainer
geniusrise-listeners 0.1.7
listeners bolts for geniusrise7 versions - Latest release: about 2 years ago - 33 downloads last month - 2 stars on GitHub - 1 maintainer
pyees 2.3.4
EES but for python. Pyees can be used do perform uncertanty (error) propagation. Furthermore, it ...139 versions - Latest release: 24 days ago - 1 dependent repositories - 517 downloads last month - 1 stars on GitHub - 1 maintainer
image-features-extract 0.4.19
toolbox for extracting features from an image10 versions - Latest release: over 4 years ago - 1 dependent repositories - 36 downloads last month - 1 maintainer
ctxpro 0.0.5
Simple toolkit that extracts ambiguities in documents that require context to resolve.5 versions - Latest release: over 1 year ago - 12 downloads last month - 0 stars on GitHub - 1 maintainer
logicsponge-monitoring 0.0.5
A real-time data processing pipeline5 versions - Latest release: 6 months ago - 39 downloads last month - 0 stars on GitHub - 3 maintainers
opencf 0.3.3
A collection of Python scripts for file conversion tasks, built on top of the opencf-core framework.7 versions - Latest release: over 1 year ago - 42 downloads last month - 0 stars on GitHub - 1 maintainer
rlylutils 0.1.23
General file and data processing tools15 versions - Latest release: over 2 years ago - 31 downloads last month - 1 stars on GitHub - 1 maintainer
analytics_tasks 0.1.0
Automation including file search and slide deck preparation.1 version - Latest release: 3 months ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
atlantic 1.1.80
Atlantic: Automated Preprocessing Framework for Supervised Machine Learning46 versions - Latest release: 9 months ago - 2 dependent packages - 154 downloads last month - 29 stars on GitHub - 1 maintainer
tensorneko-util 0.3.21
The Utils for Library TensorNeko.52 versions - Latest release: 10 months ago - 1 dependent package - 1 dependent repositories - 344 downloads last month - 11 stars on GitHub - 1 maintainer
hfutils 0.12.0
Useful utilities for huggingface48 versions - Latest release: 17 days ago - 1 dependent package - 476 thousand downloads last month - 21 stars on GitHub - 1 maintainer
tensorneko 0.3.21
Tensor Neural Engine Kompanion. An util library based on PyTorch and PyTorch Lightning.91 versions - Latest release: 10 months ago - 1 dependent repositories - 263 downloads last month - 11 stars on GitHub - 1 maintainer
arus-stream-metawear 1.0.4
arus plugin that helps creating stream for metawear devices2 versions - Latest release: almost 6 years ago - 1 dependent repositories - 34 downloads last month - 1 stars on GitHub - 1 maintainer
sas-to-polars 1.0.0
Convert SAS datasets (.sas7bdat files) to Polars DataFrames.1 version - Latest release: 5 months ago - 25 downloads last month - 1 stars on GitHub - 1 maintainer
conveyor-streaming 1.2.1
A Python library for streamlining asynchronous streaming tasks and pipelines.5 versions - Latest release: about 2 months ago - 57 downloads last month - 2 stars on GitHub - 1 maintainer
hysteresis 2.0.5
Hysteresis data processing tools.25 versions - Latest release: 12 months ago - 1 dependent repositories - 152 downloads last month - 62 stars on GitHub - 1 maintainer
memories-dev 2.0.8
Collective Memory Infrastructure for AGI10 versions - Latest release: 5 months ago - 26 downloads last month - 8 stars on GitHub - 1 maintainer
featransform 0.9.21
Featransform is an automated feature engineering framework for Supervised Machine Learning7 versions - Latest release: 11 months ago - 8 downloads last month - 4 stars on GitHub - 1 maintainer
ersatz 1.0.0
Simple sentence segmentation toolkit for segmenting and scoring2 versions - Latest release: over 4 years ago - 1 dependent repositories - 375 downloads last month - 34 stars on GitHub - 2 maintainers
casting-expert 0.1.7
A comprehensive Python package for type casting, conversion, and validation with advanced features7 versions - Latest release: 11 months ago - 28 downloads last month - 0 stars on GitHub - 1 maintainer
featurebridge 0.9.5
FeatureBridge: Revolutionizing ML adaptive modelling for handling missing features and data. The ...3 versions - Latest release: about 2 years ago - 191 downloads last month - 0 stars on GitHub - 1 maintainer
blossom-data 0.4.3
A simple way to synthesize LLM training data.5 versions - Latest release: 3 months ago - 24 downloads last month - 25 stars on GitHub - 1 maintainer
prosto 0.6.0
Data processing toolkit radically changing the way data is processed5 versions - Latest release: almost 4 years ago - 1 dependent repositories - 6 downloads last month - 91 stars on GitHub - 1 maintainer
irt-data-utils 0.0.1a2
Infrared Thermal Data Utils3 versions - Latest release: over 2 years ago - 18 downloads last month - 1 maintainer
data-preprocessing-library-sevvalcucuk-asudesozcu 1.1.7
A comprehensive toolkit for data processing including handling dates, encoding categorical variab...2 versions - Latest release: over 1 year ago - 13 downloads last month - 0 stars on GitHub - 2 maintainers
geniusrise-text 0.1.13
Text bolts for geniusrise14 versions - Latest release: over 1 year ago - 45 downloads last month - 5 stars on GitHub - 1 maintainer
rivusio 0.2.0
A type-safe, async-first data processing pipeline framework2 versions - Latest release: 8 months ago - 10 downloads last month - 1 stars on GitHub - 1 maintainer
csvuniondiff 0.0.0.dev1
A package for comparing CSV-like files through union and difference operations.2 versions - Latest release: about 1 year ago - 15 downloads last month - 7 stars on GitHub - 1 maintainer
dagstd 0.1.3
Dagstd4 versions - Latest release: over 3 years ago - 84 downloads last month - 2 stars on GitHub - 1 maintainer
vre-eoles 0.2.1
toolbox for computing charge factor used in EOLES model9 versions - Latest release: over 4 years ago - 1 dependent repositories - 19 downloads last month - 0 stars on GitHub - 1 maintainer
chunklet 1.4.0
A smart multilingual text chunker for LLMs, RAG, and beyond.19 versions - Latest release: about 1 month ago - 1.31 thousand downloads last month - 23 stars on GitHub - 1 maintainer
kmeans-tjdwill 1.0.4
A function-based implementation of k-means clustering that maintains data association.5 versions - Latest release: about 1 year ago - 18 downloads last month - 0 stars on GitHub - 1 maintainer
fast-writer 1.0.3
CLI Tools for writing data2 versions - Latest release: 4 months ago - 19 downloads last month - 0 stars on GitHub - 1 maintainer
dawgsml 0.0.3
A simple library for machine learning without a requirements.txt2 versions - Latest release: over 1 year ago - 10 downloads last month - 1 stars on GitHub - 1 maintainer
meowmotion 0.1.2
Mobile phone GPS data processor for trip generation and travel mode detection3 versions - Latest release: 5 months ago - 24 downloads last month - 0 stars on GitHub - 1 maintainer
arus 1.1.22
Activity Recognition with Ubiquitous Sensing59 versions - Latest release: over 4 years ago - 1 dependent repositories - 455 downloads last month - 0 stars on GitHub - 1 maintainer
arekit-ss 0.25.0
Low Resource Context Relation Sampler for contexts with relations for fact-checking and fine-tuni...3 versions - Latest release: 10 months ago - 32 downloads last month - 3 stars on GitHub - 1 maintainer
netcdf-scm 2.1.1
Processing netCDF files for use with simple climate models43 versions - Latest release: about 4 years ago - 1 dependent repositories - 3.32 thousand downloads last month - 6 stars on GitHub - 2 maintainers
giga-spatial 0.7.0 💰
A package for spatial data download & processing11 versions - Latest release: 14 days ago - 66 downloads last month - 16 stars on GitHub - 1 maintainer
geniusrise-openai 0.1.3
Openai bolts for geniusrise4 versions - Latest release: about 2 years ago - 69 downloads last month - 2 stars on GitHub - 1 maintainer
canoa-data-validate 0.7.47
O data_validate é um validador e processador de planilhas, que automatiza a checagem de integrida...11 versions - Latest release: 13 days ago - 1.21 thousand downloads last month - 0 stars on GitHub - 1 maintainer
scalary 0.1.3
Collection of practical tools for working with image data3 versions - Latest release: over 5 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
keprep 0.2.2
Minimally preprocessing TheBase dMRI data.9 versions - Latest release: about 1 year ago - 51 downloads last month - 1 stars on GitHub - 1 maintainer
geomagnetism 0.1.0
toolbox for geomagnetism computation8 versions - Latest release: about 5 years ago - 1 dependent repositories - 15 downloads last month - 1 stars on GitHub - 1 maintainer
joinem 0.10.0
CLI for fast, flexbile concatenation of tabular data using Polars.19 versions - Latest release: 5 months ago - 5.8 thousand downloads last month - 16 stars on GitHub - 1 maintainer
datasetplus 0.6.0
An enhanced wrapper for Hugging Face datasets with additional functionality12 versions - Latest release: 29 days ago - 684 downloads last month - 1 stars on GitHub - 1 maintainer
ds11mltoolkit 1.9
Helper functions for all stages of the machine learning model building process8 versions - Latest release: over 2 years ago - 13 downloads last month - 3 stars on GitHub - 2 maintainers
ml-dataloader 0.9.0
dataloader for machelearning24 versions - Latest release: almost 3 years ago - 1 dependent repositories - 15 downloads last month - 0 stars on GitHub - 1 maintainer
sportradar-unofficial 0.1.15
An unofficial python package to access sportradar NFL APIs.13 versions - Latest release: over 1 year ago - 23 downloads last month - 0 stars on GitHub - 1 maintainer
datatidy 1.0.1
A powerful, configuration-driven data processing and cleaning package4 versions - Latest release: 2 months ago - 34 downloads last month - 0 stars on GitHub - 1 maintainer
pipd 0.2.2
Utility functions for python data pipelines.20 versions - Latest release: about 2 years ago - 1.01 thousand downloads last month - 15 stars on GitHub - 1 maintainer
outputty 0.3.2
Import, filter and export tabular data with Python easily6 versions - Latest release: over 12 years ago - 3 dependent repositories - 30 downloads last month - 36 stars on GitHub - 1 maintainer
uniform 0.2.2
Uniform - dress your form processing endpoints📋4 versions - Latest release: over 5 years ago - 6 dependent repositories - 200 downloads last month - 1 stars on GitLab.com - 1 maintainer
geniusrise-huggingface 0.4.9
Huggingface bolts for geniusrise13 versions - Latest release: about 2 years ago - 286 downloads last month - 3 stars on GitHub - 1 maintainer
zeef 0.1.3
A Python Framework for Deep Active Learning4 versions - Latest release: over 3 years ago - 1 dependent repositories - 15 downloads last month - 1 maintainer
pysnom 0.2.1
Python tools for SNOM data processing.4 versions - Latest release: 3 months ago - 3.43 thousand downloads last month - 0 stars on GitHub - 2 maintainers
scmcallib 0.5.1
Perform calibration for simple climate models20 versions - Latest release: over 5 years ago - 1 dependent repositories - 47 downloads last month - 0 stars on gitlab.com - 2 maintainers
adpa 1.5.0
Advanced Data Processing and Analytics Framework3 versions - Latest release: 8 months ago - 40 downloads last month - 1 maintainer
narrator 0.0.0.4
A set of functions that process and create descriptive summary visualizations to help develop a b...3 versions - Latest release: almost 6 years ago - 1 dependent repositories - 38 downloads last month - 0 stars on GitHub - 1 maintainer
mapminer 0.1.64
An advanced geospatial data extraction and processing toolkit for Earth observation datasets.63 versions - Latest release: 20 days ago - 1.46 thousand downloads last month - 43 stars on GitHub - 1 maintainer
urlcounter 0.0.3
A set of functions that tally URLs within an event-based corpus. It assumes that you have data di...2 versions - Latest release: over 5 years ago - 1 dependent repositories - 9 downloads last month - 0 stars on GitHub - 1 maintainer
bolster 0.3.4
Bolster's Brain, you've been warned8 versions - Latest release: 5 months ago - 1 dependent repositories - 61 downloads last month - 3 stars on GitHub - 1 maintainer
eseas 1.0.4
eseas is a Python package that serves as a wrapper for the jwsacruncher Java package. This tool a...14 versions - Latest release: 7 months ago - 36 downloads last month - 1 stars on GitHub - 1 maintainer
textmining-module 2.1.2
A Python Module for Comprehensive Text Mining, including Keyword Extraction and Text Analysis.7 versions - Latest release: 10 months ago - 10 downloads last month - 0 stars on GitHub - 1 maintainer
proadv 2.1.5
Process Acoustic Doppler Velocimeter data with advanced despiking and analysis tools6 versions - Latest release: about 1 year ago - 32 downloads last month - 11 stars on GitHub - 1 maintainer
geniusrise-audio 0.1.12
audio bolts for geniusrise13 versions - Latest release: over 1 year ago - 39 downloads last month - 2 stars on GitHub - 1 maintainer
Related Keywords
python
30
machine learning
28
data science
17
data analysis
14
llm
13
ai
10
geniusrise
9
mlops
9
deep learning
9
pandas
9
data cleaning
8
time series
8
data
8
real-time
7
automation
7
AI
6
llmops
6
llm-framework
6
agentops
6
agent-based-framework
6
data-science
6
etl
6
analytics
6
pipeline
6
data transformation
5
natural language processing
5
data-analysis
5
feature engineering
5
nlp
5
polars
5
huggingface
5
preprocessing
5
data visualization
5
data engineering
4
data manipulation
4
streaming
4
predictive modeling
4
pytorch
4
async
4
cli
4
fine-tuning
4
data preprocessing
4
scikit-learn
4
api
4
data validation
4
dataframe
4
ETL
4
numpy
4
data wrangling
3
data integrity
3
fast
3
validation
3
data handling
3
machine-learning
3
regression
3
classification
3
artificial intelligence
3
statistics
3
common
3
help
3
functions
3
sklearn
3
inference-server
3
workflow
3
data-engineering
3
inference
3
performance
3
cloud
3
visualization
3
neuroscience
3
database
3
file conversion
3
NLP
3
data pipeline
3
clustering
3
spark
2
type-safe
2
finance
2
image
2
pyspark
2
optimization
2
chunking
2
text-splitting
2
rag
2
data-visualization
2
multilingual
2
computer vision
2
text processing
2
information retrieval
2
semantic search
2
document processing
2
data-preprocessing
2
download
2
ubiquitous computing
2
sensing
2
data conversion
2
feature-engineering
2
analysis
2
sql
2
ml
2