Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-processing" keyword

Top 6.2% on pypi.org
cdp-backend 4.1.3
Data storage utilities and processing pipelines used by CDP instances.
108 versions - Latest release: 5 months ago - 3 dependent packages - 22 dependent repositories - 2.83 thousand downloads last month - 21 stars on GitHub - 1 maintainer
nvidia-dali-nightly-cuda120 1.38.0.dev20240501
NVIDIA DALI nightly for CUDA 12.0. Git SHA: 80b67f93fcbd57985b35db94e9788602334ea37f
118 versions - Latest release: about 8 hours ago - 639 downloads last month - 4,906 stars on GitHub - 2 maintainers
Top 5.1% on pypi.org
padasip 1.2.2
Python Adaptive Signal Processing
13 versions - Latest release: over 1 year ago - 1 dependent package - 10 dependent repositories - 1.52 thousand downloads last month - 284 stars on GitHub - 1 maintainer
omforme 2023.6.18
Reshape (Danish: omforme).
2 versions - Latest release: 11 months ago - 977 downloads last month - 2 maintainers
eyecu-bumblebee 0.5.7
Advanced pipelines for video datasets
46 versions - Latest release: about 3 years ago - 1 dependent repositories - 351 downloads last month - 8 stars on GitHub - 2 maintainers
sakura-py 0.9.6
Sakura platform installation files.
20 versions - Latest release: about 3 years ago - 1 dependent repositories - 50 downloads last month - 3 stars on GitHub - 2 maintainers
Top 3.8% on pypi.org
pysparkling 0.6.2
Pure Python implementation of the Spark RDD interface.
69 versions - Latest release: over 1 year ago - 1 dependent package - 34 dependent repositories - 11.8 thousand downloads last month - 261 stars on GitHub - 2 maintainers
nvidia-dali-tf-plugin-nightly-cuda110 1.38.0.dev20240430
NVIDIA DALI nightly TensorFlow plugin for CUDA 11.0. Git SHA: 82983535cd65dc1ba11018b4b35dbae6e2...
103 versions - Latest release: 1 day ago - 605 downloads last month - 4,906 stars on GitHub - 2 maintainers
nvidia-dali-nightly-cuda110 1.38.0.dev20240430
NVIDIA DALI nightly for CUDA 11.0. Git SHA: 82983535cd65dc1ba11018b4b35dbae6e2c305d5
104 versions - Latest release: 1 day ago - 1 dependent repositories - 514 downloads last month - 4,906 stars on GitHub - 2 maintainers
cratedb-toolkit 0.0.10
CrateDB Toolkit
9 versions - Latest release: 23 days ago - 2 dependent packages - 1 dependent repositories - 1.82 thousand downloads last month - 3 stars on GitHub - 4 maintainers
pystream-pipeline 0.2.0
Python package to create and manage fast parallelized data processing pipeline for real-time appl...
5 versions - Latest release: 6 months ago - 1 dependent repositories - 37 downloads last month - 2 stars on GitHub - 1 maintainer
lose 1.0.0
A helper package for handling data using hdf5 format
24 versions - Latest release: almost 4 years ago - 2 dependent repositories - 236 downloads last month - 0 stars on GitHub - 2 maintainers
Top 3.0% on pypi.org
raydp 1.6.0
RayDP: Distributed Data Processing on Ray
55 versions - Latest release: 9 months ago - 1 dependent package - 70 dependent repositories - 17 thousand downloads last month - 263 stars on GitHub - 4 maintainers
Top 5.8% on pypi.org
raydp-nightly 2024.5.1.dev0
RayDP: Distributed Data Processing on Ray
132 versions - Latest release: 1 day ago - 90 dependent repositories - 1.12 thousand downloads last month - 265 stars on GitHub - 1 maintainer
Top 3.4% on pypi.org
bytewax 0.19.1
Python Stream Processing
30 versions - Latest release: about 1 month ago - 3 dependent packages - 20 dependent repositories - 7.03 thousand downloads last month - 918 stars on GitHub - 2 maintainers
electiongraphs 0.3.4
Create graphs for displaying the result of a election based on a csv-inputfile.
4 versions - Latest release: 6 months ago - 21 downloads last month - 0 stars on GitHub - 2 maintainers
dolma 1.0.3
Data filters
17 versions - Latest release: 22 days ago - 9.79 thousand downloads last month - 775 stars on GitHub - 4 maintainers
amanogawa 0.0.1a1
Flexible graph construction and data pre-processing engine
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 39 downloads last month - 5 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
libertem 0.13.1
Open pixelated STEM framework
25 versions - Latest release: 6 months ago - 2 dependent packages - 3 dependent repositories - 644 downloads last month - 105 stars on GitHub - 3 maintainers
dpyp 1.0.0
A pandas convenience wrapper for small-scale data pipelines
1 version - Latest release: 10 days ago - 190 downloads last month - 2 stars on GitHub - 2 maintainers
perke 0.4.4
A keyphrase extractor for Persian
13 versions - Latest release: 10 months ago - 1 dependent repositories - 110 downloads last month - 68 stars on GitHub - 1 maintainer
zenaton 0.4.2
Zenaton client library
17 versions - Latest release: over 4 years ago - 2 dependent repositories - 123 downloads last month - 24 stars on GitHub - 2 maintainers
redis-message-queue 0.8.0
Python message queuing with Redis and message deduplication
11 versions - Latest release: 3 months ago - 238 downloads last month - 2 stars on GitHub - 2 maintainers
roll-rate-analysis 0.1.7
Roll Rate Analysis python package. Both month over month and snapshot roll rate functionalities a...
6 versions - Latest release: about 2 months ago - 23 downloads last month - 3 stars on GitHub - 1 maintainer
nvidia-dali-tf-plugin-cuda120 1.37.0
NVIDIA DALI TensorFlow plugin for CUDA 12.0. Git SHA: 1bc7fc20b0ff373a3320eca0c7f4860feb4a3bd2
16 versions - Latest release: 3 days ago - 163 downloads last month - 4,906 stars on GitHub - 2 maintainers
Top 2.1% on pypi.org
nvidia-dali-cuda110 1.37.0
NVIDIA DALI for CUDA 11.0. Git SHA: 1bc7fc20b0ff373a3320eca0c7f4860feb4a3bd2
23 versions - Latest release: 3 days ago - 3 dependent packages - 95 dependent repositories - 9.46 thousand downloads last month - 4,906 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
nvidia-dali-tf-plugin-cuda110 1.37.0
NVIDIA DALI TensorFlow plugin for CUDA 11.0. Git SHA: 1bc7fc20b0ff373a3320eca0c7f4860feb4a3bd2
23 versions - Latest release: 3 days ago - 6 dependent repositories - 2.04 thousand downloads last month - 4,906 stars on GitHub - 1 maintainer
machine-learning-data-pipeline 1.0.3
Pipeline module for parallel real-time data processing for machine learning models development an...
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 22 downloads last month - 22 stars on GitHub - 2 maintainers
trepr 0.2.0
Package for handling tr-EPR data.
3 versions - Latest release: over 2 years ago - 1 dependent repositories - 40 downloads last month - 0 stars on GitHub - 2 maintainers
data-science-kit 0.0.1
Data Science Basic Functions
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 16 downloads last month - 1 stars on GitHub - 1 maintainer
Top 7.9% on pypi.org
texar 0.2.4
Toolkit for Machine Learning and Text Generation
5 versions - Latest release: over 4 years ago - 9 dependent repositories - 31 downloads last month - 2,383 stars on GitHub - 4 maintainers
gmeterpy 0.0.2
Processing gravity measurements with Python
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 10 downloads last month - 10 stars on GitHub - 2 maintainers
dataform 1.0.0
DataForm: Data processing and transformation tool.
1 version - Latest release: 3 months ago - 26 downloads last month - 1 stars on GitHub - 2 maintainers
pyfgcz 3.0.1
PyFGCZ contains BioBeamer and FCC python code.
5 versions - Latest release: over 4 years ago - 1 dependent repositories - 23 downloads last month - 2 stars on GitHub - 2 maintainers
xl2times 0.1.0
An open source tool to convert TIMES models specified in Excel a format ready for processing by GAMS
2 versions - Latest release: about 1 month ago - 136 downloads last month - 8 stars on GitHub - 4 maintainers
Top 8.4% on pypi.org
forte 0.2.0
Forte is extensible framework for building composable and modularized NLP workflows.
13 versions - Latest release: about 2 years ago - 6 dependent repositories - 221 downloads last month - 236 stars on GitHub - 1 maintainer
ini2csv 1.0.0
A simple utility that converts and combines a folder of .ini files with identical keys into one c...
1 version - Latest release: 11 months ago - 12 downloads last month - 1 stars on GitHub - 4 maintainers
light-pipe 0.3.1
A high-level syntax for data pipelines, designed to make pipeline development quick and painless.
5 versions - Latest release: 11 months ago - 29 downloads last month - 3 stars on GitHub - 2 maintainers
abpytools 0.3.2
Python package for antibody analysis
11 versions - Latest release: over 5 years ago - 1 dependent repositories - 127 downloads last month - 23 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
pandera 0.18.3 💰
A light-weight and flexible data validation and testing tool for statistical data objects.
86 versions - Latest release: about 2 months ago - 60 dependent packages - 229 dependent repositories - 2.76 million downloads last month - 2,979 stars on GitHub - 3 maintainers
Top 5.5% on pypi.org
nonechucks 0.4.2
nonechucks is a library that provides wrappers for PyTorch's datasets, samplers, and transforms t...
18 versions - Latest release: almost 3 years ago - 26 dependent repositories - 1.14 thousand downloads last month - 372 stars on GitHub - 2 maintainers
smartpipeline 0.7.3
A framework for fast developing scalable data pipelines following a simple design pattern
11 versions - Latest release: 4 months ago - 1 dependent repositories - 68 downloads last month - 21 stars on GitHub - 2 maintainers
nhanes-pytool-api 0.1.1
A tool for programmatic access to NHANES downloadable datasets
2 versions - Latest release: 6 months ago - 47 downloads last month - 0 stars on GitHub - 1 maintainer
tasrif 0.1.0
Tasrif is a python library for processing of wearable data from fitness trackers and wearable hea...
7 versions - Latest release: about 2 years ago - 1 dependent repositories - 41 downloads last month - 15 stars on GitHub - 1 maintainer
generatorpipeline 1.0
Parallelize your data-processing pipelines with just a decorator.
1 version - Latest release: over 3 years ago - 1 dependent repositories - 9 downloads last month - 2 stars on GitHub - 1 maintainer
spifpy 1.0.5
Single Particle Image Format (SPIF) data converter and interface
4 versions - Latest release: over 1 year ago - 35 downloads last month - 0 stars on GitHub - 1 maintainer
datasetops 0.0.6
Fluent dataset operations, compatible with your favorite libraries
4 versions - Latest release: about 4 years ago - 4 dependent repositories - 57 downloads last month - 10 stars on GitHub - 2 maintainers
forte.health 0.1.0
NLP pipeline framework for biomedical and clinical domains
1 version - Latest release: almost 2 years ago - 11 downloads last month - 10 stars on GitHub - 6 maintainers
Top 6.9% on pypi.org
itertable 2.2.0
Iterable API for tabular datasets including CSV, XLSX, XML, & JSON.
4 versions - Latest release: 11 months ago - 1 dependent package - 8 dependent repositories - 4.76 thousand downloads last month - 51 stars on GitHub - 2 maintainers
cwepr 0.5.1
Package for handling cw-EPR data.
10 versions - Latest release: 2 months ago - 1 dependent repositories - 84 downloads last month - 1 stars on GitHub - 4 maintainers
ror 0.1.1
Simple pipelining framework in Python
3 versions - Latest release: 4 months ago - 13 downloads last month - 0 stars on GitHub - 2 maintainers
glide 0.4.1
Easy ETL
45 versions - Latest release: over 1 year ago - 1 dependent repositories - 270 downloads last month - 19 stars on GitHub - 2 maintainers
pandas-optimum 0.0.6
Optimised pandas, Best practices in-built
6 versions - Latest release: 10 months ago - 31 downloads last month - 0 stars on GitHub - 2 maintainers
pyvaspflow 0.0.3
Vasp Calculation
1 version - Latest release: about 4 years ago - 1 dependent repositories - 14 downloads last month - 17 stars on GitHub - 2 maintainers
Top 4.0% on pypi.org
lithops 3.3.0
Lithops lets you transparently run your Python applications in the Cloud
49 versions - Latest release: 7 days ago - 2 dependent packages - 8 dependent repositories - 2.56 thousand downloads last month - 289 stars on GitHub - 2 maintainers
sparklanes 0.2.4
A lightweight framework to build and execute data processing pipelines in pyspark (Apache Spark's...
5 versions - Latest release: over 5 years ago - 1 dependent repositories - 51 downloads last month - 16 stars on GitHub - 1 maintainer
polars-istr 0.0.1
Polars extension for general data science use cases
1 version - Latest release: about 1 month ago - 909 downloads last month - 212 stars on GitHub - 2 maintainers
fondant 1.0.0
Fondant - Large-scale data processing made easy and reusable
45 versions - Latest release: 3 months ago - 1 dependent repositories - 890 downloads last month - 316 stars on GitHub - 4 maintainers
amical 1.6.0
Extraction pipeline and analysis tools for Aperture Masking Interferometry mode of the last gener...
5 versions - Latest release: about 1 year ago - 1 dependent repositories - 44 downloads last month - 9 stars on GitHub - 2 maintainers
Top 9.8% on pypi.org
haupt 2.1.8
Lineage metadata API, artifacts streams, sandbox, ML-API, and spaces for Polyaxon.
125 versions - Latest release: 12 days ago - 1 dependent package - 947 downloads last month - 452 stars on GitHub - 2 maintainers
nvidia-dali-tf-plugin-nightly-cuda120 1.37.0.dev20240409
NVIDIA DALI nightly TensorFlow plugin for CUDA 12.0. Git SHA: fb5786c82c162af3f2120e8ab8cbb8d5d5...
114 versions - Latest release: 21 days ago - 518 downloads last month - 4,906 stars on GitHub - 2 maintainers
Top 4.0% on pypi.org
nvidia-dali-cuda120 1.36.0
NVIDIA DALI for CUDA 12.0. Git SHA: e2ae685702638e3f8fae8091344f0f7ea045a1f9
16 versions - Latest release: about 1 month ago - 1 dependent package - 11 dependent repositories - 2.18 thousand downloads last month - 4,906 stars on GitHub - 2 maintainers
sofine 0.2.4
Lightweight framework for creating data-collection plugins and chaining together calls to them, f...
10 versions - Latest release: over 9 years ago - 2 dependent repositories - 23 downloads last month - 7 stars on GitHub - 2 maintainers
hstreamdb-api 0.6.1
HStreamDB api for Python
11 versions - Latest release: 9 months ago - 2 dependent repositories - 26 downloads last month - 691 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
vip-hci 1.6.0
Package for astronomical high-contrast image processing.
43 versions - Latest release: about 1 month ago - 1 dependent package - 2 dependent repositories - 391 downloads last month - 69 stars on GitHub - 4 maintainers
text-dedup 0.4.0
All-in-one text de-duplication
24 versions - Latest release: 15 days ago - 2 dependent repositories - 389 downloads last month - 479 stars on GitHub - 1 maintainer
faster-os 0.0.11
Up to 6700% faster OS module.
1 version - Latest release: about 2 years ago - 1 dependent repositories - 10 downloads last month - 14 stars on GitHub - 1 maintainer
Top 7.2% on pypi.org
rapidtables 0.1.11
Format/create and print tables from lists of dicts
37 versions - Latest release: over 4 years ago - 9 dependent repositories - 188 downloads last month - 288 stars on GitHub - 2 maintainers
Top 7.7% on pypi.org
heat 1.3.1
A framework for high-performance data analytics and machine learning.
18 versions - Latest release: 5 months ago - 8 dependent repositories - 135 downloads last month - 178 stars on GitHub - 3 maintainers
libertem-blobfinder 0.5.0
LiberTEM correlation and refinement library
3 versions - Latest release: 12 months ago - 1 dependent package - 1 dependent repositories - 17 downloads last month - 5 stars on GitHub - 4 maintainers
glassflow 1.0.3
GlassFlow Python Client SDK
8 versions - Latest release: 16 days ago - 325 downloads last month - 7 stars on GitHub - 2 maintainers
lineagemd 0.0.0
Lineage metadata for ML/AI/Data.
1 version - Latest release: over 1 year ago - 12 downloads last month - 452 stars on GitHub - 2 maintainers
hauptai 0.0.0
Haupt ai.
1 version - Latest release: over 1 year ago - 5 downloads last month - 452 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
bonobo 0.6.4
Bonobo, a simple, modern and atomic extract-transform-load toolkit for python 3.5+.
37 versions - Latest release: almost 5 years ago - 35 dependent repositories - 28.2 thousand downloads last month - 1,573 stars on GitHub - 2 maintainers
cotk 0.1.0
Conversational Toolkits
3 versions - Latest release: almost 4 years ago - 2 dependent repositories - 98 downloads last month - 128 stars on GitHub - 2 maintainers
pipe21 1.23.0
simple functional pipes
46 versions - Latest release: 19 days ago - 4 dependent packages - 109 downloads last month - 13 stars on GitHub - 2 maintainers
prosto 0.6.0
Data processing toolkit radically changing the way data is processed
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 6 downloads last month - 89 stars on GitHub - 2 maintainers
vaspy 0.8.12
A pure Python library designed to make it easy and quick to manipulate VASP files
19 versions - Latest release: about 5 years ago - 1 dependent repositories - 56 downloads last month - 261 stars on GitHub - 1 maintainer
mercury-dataschema 0.0.1
Mercury's DataSchema package allows the automatic recognition and validation of feature types.
1 version - Latest release: about 1 year ago - 2 dependent packages - 92 downloads last month - 11 stars on GitHub - 2 maintainers
unipipe 0.5.4
project_description
16 versions - Latest release: over 1 year ago - 38 downloads last month - 3 stars on GitHub - 1 maintainer
artifician 0.6.4
Artifician is an event driven framework developed to simplify the process of preparation of the d...
35 versions - Latest release: 3 months ago - 77 downloads last month - 10 stars on GitHub - 1 maintainer
thepipe 1.3.8
A lightweight, general purpose pipeline framework.
15 versions - Latest release: almost 2 years ago - 1 dependent package - 2 dependent repositories - 421 downloads last month - 13 stars on GitHub - 4 maintainers
kothon 0.3.1
A Python library that brings Kotlin's Sequence class functionalities and the power of functional ...
7 versions - Latest release: about 2 months ago - 41 downloads last month - 4 stars on GitHub - 2 maintainers
batch-dev 0.0.3
Generic python module for handling dictionary-based batch data
3 versions - Latest release: about 2 months ago - 71 downloads last month - 0 stars on GitHub - 2 maintainers
Top 9.1% on pypi.org
pywren-ibm-cloud 1.7.3
Run many jobs over IBM Cloud
33 versions - Latest release: over 3 years ago - 1 dependent package - 2 dependent repositories - 30 downloads last month - 279 stars on GitHub - 2 maintainers
pygaps 4.5.0
A framework for processing adsorption data for porous materials.
31 versions - Latest release: 11 months ago - 1 dependent repositories - 463 downloads last month - 50 stars on GitHub - 1 maintainer
Top 8.3% on pypi.org
texar-pytorch 0.1.4
Toolkit for Machine Learning and Text Generation
5 versions - Latest release: about 2 years ago - 1 dependent package - 14 dependent repositories - 69 downloads last month - 741 stars on GitHub - 2 maintainers
connectome 0.10.0
A library for datasets containing heterogeneous data
34 versions - Latest release: 24 days ago - 1 dependent repositories - 275 downloads last month - 12 stars on GitHub - 4 maintainers
meltano-target-cratedb 0.0.1
A Singer target for CrateDB, built with the Meltano SDK, and based on the Meltano PostgreSQL target.
1 version - Latest release: 5 months ago - 5 downloads last month - 0 stars on GitHub - 2 maintainers
reki 2024.4.0
A data preparation tool for CEMC/CMA.
2 versions - Latest release: 23 days ago - 1 dependent repositories - 24 downloads last month - 16 stars on GitHub - 2 maintainers
schemarrow 0.1.1a0
A library for switching pandas backend to pyarrow
2 versions - Latest release: about 2 months ago - 39 downloads last month - 2 stars on GitHub - 2 maintainers
uvvispy 0.1.1
Package for handling optical absorption data.
4 versions - Latest release: almost 3 years ago - 1 dependent repositories - 26 downloads last month - 2 stars on GitHub - 2 maintainers
torchglyph 0.3.2
Data Processor Combinators for Natural Language Processing
8 versions - Latest release: about 2 years ago - 2 dependent repositories - 57 downloads last month - 7 stars on GitHub - 2 maintainers
mathbox 0.0.8
A math toolbox.
6 versions - Latest release: over 1 year ago - 1 dependent repositories - 6 downloads last month - 5 stars on GitHub - 2 maintainers
postql 1.0.3
Python wrapper for Postgres
3 versions - Latest release: 2 months ago - 52 downloads last month - 0 stars on GitHub - 2 maintainers
daxpy 0.2
A pre-machine-learning model package
1 version - Latest release: over 2 years ago - 1 dependent repositories - 13 downloads last month - 0 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
wq.io 1.1.0
Consistent iterable API for reading and writing to external datasets
18 versions - Latest release: almost 6 years ago - 1 dependent package - 25 dependent repositories - 130 downloads last month - 51 stars on GitHub - 2 maintainers
Top 8.1% on pypi.org
bonobo-docker 0.6.0
Docker extension for Bonobo
18 versions - Latest release: over 6 years ago - 2 dependent packages - 3 dependent repositories - 99 downloads last month - 13 stars on GitHub - 4 maintainers
nvidia-nvimgcodec-cu12 0.2.0.7
NVIDIA nvimgcodec for CUDA 12. Git SHA:
1 version - Latest release: 3 months ago - 4.43 thousand downloads last month - 18 stars on GitHub - 2 maintainers
nvidia-nvimgcodec-cu11 0.2.0.7
NVIDIA nvimgcodec for CUDA 11. Git SHA:
1 version - Latest release: 3 months ago - 3.17 thousand downloads last month - 18 stars on GitHub - 2 maintainers