Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-processing" keyword

Top 2.1% on pypi.org
nvidia-dali-cuda110 1.37.1
NVIDIA DALI for CUDA 11.0. Git SHA: d1685acebd5c41743cab0e15890660130e0276ce
24 versions - Latest release: 10 days ago - 5 dependent packages - 95 dependent repositories - 16.3 thousand downloads last month - 4,906 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
nvidia-dali-cuda120 1.37.1
NVIDIA DALI for CUDA 12.0. Git SHA: d1685acebd5c41743cab0e15890660130e0276ce
18 versions - Latest release: 10 days ago - 1 dependent package - 11 dependent repositories - 2.18 thousand downloads last month - 4,906 stars on GitHub - 1 maintainer
Top 6.1% on pypi.org
nvidia-dali-tf-plugin-cuda110 1.37.1
NVIDIA DALI TensorFlow plugin for CUDA 11.0. Git SHA: d1685acebd5c41743cab0e15890660130e0276ce
24 versions - Latest release: 10 days ago - 6 dependent repositories - 1.9 thousand downloads last month - 4,906 stars on GitHub - 1 maintainer
nvidia-dali-tf-plugin-nightly-cuda110 1.38.0.dev20240506
NVIDIA DALI nightly TensorFlow plugin for CUDA 11.0. Git SHA: 80b67f93fcbd57985b35db94e978860233...
108 versions - Latest release: 6 days ago - 807 downloads last month - 4,906 stars on GitHub - 1 maintainer
nvidia-dali-tf-plugin-nightly-cuda120 1.37.0.dev20240409
NVIDIA DALI nightly TensorFlow plugin for CUDA 12.0. Git SHA: fb5786c82c162af3f2120e8ab8cbb8d5d5...
114 versions - Latest release: about 1 month ago - 518 downloads last month - 4,906 stars on GitHub - 1 maintainer
nvidia-dali-tf-plugin-weekly-cuda120 1.38.0.dev20240505
NVIDIA DALI weekly TensorFlow plugin for CUDA 12.0. Git SHA: 80b67f93fcbd57985b35db94e9788602334...
21 versions - Latest release: 9 days ago - 118 downloads last month - 4,906 stars on GitHub - 1 maintainer
nvidia-dali-tf-plugin-cuda120 1.37.1
NVIDIA DALI TensorFlow plugin for CUDA 12.0. Git SHA: d1685acebd5c41743cab0e15890660130e0276ce
17 versions - Latest release: 10 days ago - 177 downloads last month - 4,906 stars on GitHub - 1 maintainer
nvidia-dali-weekly-cuda120 1.38.0.dev20240505
NVIDIA DALI weekly for CUDA 12.0. Git SHA: 80b67f93fcbd57985b35db94e9788602334ea37f
21 versions - Latest release: 9 days ago - 108 downloads last month - 4,906 stars on GitHub - 1 maintainer
nvidia-dali-nightly-cuda120 1.38.0.dev20240507
NVIDIA DALI nightly for CUDA 12.0. Git SHA: 80b67f93fcbd57985b35db94e9788602334ea37f
122 versions - Latest release: 7 days ago - 903 downloads last month - 4,906 stars on GitHub - 1 maintainer
nvidia-dali-nightly-cuda110 1.38.0.dev20240506
NVIDIA DALI nightly for CUDA 11.0. Git SHA: 80b67f93fcbd57985b35db94e9788602334ea37f
109 versions - Latest release: 6 days ago - 1 dependent repositories - 799 downloads last month - 4,906 stars on GitHub - 1 maintainer
Top 1.4% on pypi.org
pandera 0.19.3 💰
A light-weight and flexible data validation and testing tool for statistical data objects.
91 versions - Latest release: 3 days ago - 97 dependent packages - 229 dependent repositories - 2.53 million downloads last month - 3,009 stars on GitHub - 3 maintainers
Top 7.9% on pypi.org
texar 0.2.4
Toolkit for Machine Learning and Text Generation
5 versions - Latest release: over 4 years ago - 9 dependent repositories - 31 downloads last month - 2,383 stars on GitHub - 2 maintainers
Top 5.2% on pypi.org
bonobo 0.6.4
Bonobo, a simple, modern and atomic extract-transform-load toolkit for python 3.5+.
37 versions - Latest release: about 5 years ago - 35 dependent repositories - 17.3 thousand downloads last month - 1,573 stars on GitHub - 2 maintainers
Top 3.4% on pypi.org
bytewax 0.19.1
Python Stream Processing
30 versions - Latest release: about 2 months ago - 4 dependent packages - 20 dependent repositories - 6.12 thousand downloads last month - 918 stars on GitHub - 1 maintainer
dolma 1.0.3
Data filters
17 versions - Latest release: about 1 month ago - 16.1 thousand downloads last month - 794 stars on GitHub - 2 maintainers
Top 8.3% on pypi.org
texar-pytorch 0.1.4
Toolkit for Machine Learning and Text Generation
5 versions - Latest release: about 2 years ago - 1 dependent package - 14 dependent repositories - 67 downloads last month - 741 stars on GitHub - 1 maintainer
hstreamdb-api 0.6.1
HStreamDB api for Python
11 versions - Latest release: 10 months ago - 2 dependent repositories - 26 downloads last month - 691 stars on GitHub - 1 maintainer
text-dedup 0.4.0
All-in-one text de-duplication
24 versions - Latest release: 30 days ago - 2 dependent repositories - 389 downloads last month - 479 stars on GitHub - 1 maintainer
lineagemd 0.0.0
Lineage metadata for ML/AI/Data.
1 version - Latest release: over 1 year ago - 10 downloads last month - 452 stars on GitHub - 1 maintainer
hauptai 0.0.0
Haupt ai.
1 version - Latest release: over 1 year ago - 14 downloads last month - 452 stars on GitHub - 1 maintainer
Top 9.8% on pypi.org
haupt 2.1.8
Lineage metadata API, artifacts streams, sandbox, ML-API, and spaces for Polyaxon.
125 versions - Latest release: 27 days ago - 1 dependent package - 947 downloads last month - 452 stars on GitHub - 1 maintainer
Top 5.5% on pypi.org
nonechucks 0.4.2
nonechucks is a library that provides wrappers for PyTorch's datasets, samplers, and transforms t...
18 versions - Latest release: almost 3 years ago - 26 dependent repositories - 1.14 thousand downloads last month - 372 stars on GitHub - 1 maintainer
fondant 1.0.0
Fondant - Large-scale data processing made easy and reusable
45 versions - Latest release: 4 months ago - 1 dependent repositories - 890 downloads last month - 316 stars on GitHub - 2 maintainers
Top 4.0% on pypi.org
lithops 3.3.0
Lithops lets you transparently run your Python applications in the Cloud
49 versions - Latest release: 22 days ago - 2 dependent packages - 8 dependent repositories - 2.56 thousand downloads last month - 289 stars on GitHub - 2 maintainers
Top 7.2% on pypi.org
rapidtables 0.1.11
Format/create and print tables from lists of dicts
37 versions - Latest release: over 4 years ago - 9 dependent repositories - 188 downloads last month - 288 stars on GitHub - 1 maintainer
Top 5.1% on pypi.org
padasip 1.2.2
Python Adaptive Signal Processing
13 versions - Latest release: almost 2 years ago - 1 dependent package - 10 dependent repositories - 1.52 thousand downloads last month - 284 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
pywren-ibm-cloud 1.7.3
Run many jobs over IBM Cloud
33 versions - Latest release: over 3 years ago - 1 dependent package - 2 dependent repositories - 300 downloads last month - 279 stars on GitHub - 1 maintainer
Top 5.8% on pypi.org
raydp-nightly 2024.5.1.dev0
RayDP: Distributed Data Processing on Ray
132 versions - Latest release: 16 days ago - 90 dependent repositories - 1.12 thousand downloads last month - 265 stars on GitHub - 1 maintainer
Top 3.0% on pypi.org
raydp 1.6.0
RayDP: Distributed Data Processing on Ray
55 versions - Latest release: 9 months ago - 1 dependent package - 70 dependent repositories - 17 thousand downloads last month - 263 stars on GitHub - 2 maintainers
vaspy 0.8.12
A pure Python library designed to make it easy and quick to manipulate VASP files
19 versions - Latest release: about 5 years ago - 1 dependent repositories - 261 downloads last month - 261 stars on GitHub - 1 maintainer
Top 3.8% on pypi.org
pysparkling 0.6.2
Pure Python implementation of the Spark RDD interface.
69 versions - Latest release: over 1 year ago - 1 dependent package - 34 dependent repositories - 11.8 thousand downloads last month - 261 stars on GitHub - 2 maintainers
Top 8.4% on pypi.org
forte 0.2.0
Forte is extensible framework for building composable and modularized NLP workflows.
13 versions - Latest release: about 2 years ago - 6 dependent repositories - 221 downloads last month - 236 stars on GitHub - 1 maintainer
polars-istr 0.0.1
Polars extension for general data science use cases
1 version - Latest release: about 2 months ago - 909 downloads last month - 212 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
heat 1.4.1
A framework for high-performance data analytics and machine learning.
20 versions - Latest release: 4 days ago - 8 dependent repositories - 341 downloads last month - 193 stars on GitHub - 4 maintainers
cotk 0.1.0
Conversational Toolkits
3 versions - Latest release: almost 4 years ago - 2 dependent repositories - 80 downloads last month - 128 stars on GitHub - 1 maintainer
Top 6.6% on pypi.org
libertem 0.14.0
Open pixelated STEM framework
30 versions - Latest release: about 16 hours ago - 2 dependent packages - 3 dependent repositories - 992 downloads last month - 106 stars on GitHub - 3 maintainers
prosto 0.6.0
Data processing toolkit radically changing the way data is processed
5 versions - Latest release: over 2 years ago - 1 dependent repositories - 64 downloads last month - 89 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
vip-hci 1.6.0
Package for astronomical high-contrast image processing.
43 versions - Latest release: about 2 months ago - 1 dependent package - 2 dependent repositories - 391 downloads last month - 69 stars on GitHub - 2 maintainers
perke 0.4.4
A keyphrase extractor for Persian
13 versions - Latest release: 11 months ago - 1 dependent repositories - 110 downloads last month - 68 stars on GitHub - 1 maintainer
Top 6.9% on pypi.org
itertable 2.2.0
Iterable API for tabular datasets including CSV, XLSX, XML, & JSON.
4 versions - Latest release: 11 months ago - 1 dependent package - 8 dependent repositories - 4.76 thousand downloads last month - 51 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
wq.io 1.1.0
Consistent iterable API for reading and writing to external datasets
18 versions - Latest release: almost 6 years ago - 1 dependent package - 25 dependent repositories - 149 downloads last month - 51 stars on GitHub - 1 maintainer
pygaps 4.5.0
A framework for processing adsorption data for porous materials.
31 versions - Latest release: 11 months ago - 1 dependent repositories - 547 downloads last month - 50 stars on GitHub - 1 maintainer
nvidia-nvimgcodec-cu11 0.2.0.7
NVIDIA nvimgcodec for CUDA 11. Git SHA:
1 version - Latest release: 4 months ago - 2 dependent packages - 9.89 thousand downloads last month - 25 stars on GitHub - 1 maintainer
Top 6.4% on pypi.org
bonobo-sqlalchemy 0.6.1
Bonobo SQLAlchemy Extension
14 versions - Latest release: almost 6 years ago - 2 dependent packages - 5 dependent repositories - 498 downloads last month - 25 stars on GitHub - 2 maintainers
nvidia-nvimgcodec-cu12 0.2.0.7
NVIDIA nvimgcodec for CUDA 12. Git SHA:
1 version - Latest release: 4 months ago - 3 dependent packages - 11.5 thousand downloads last month - 25 stars on GitHub - 1 maintainer
zenaton 0.4.2
Zenaton client library
17 versions - Latest release: over 4 years ago - 2 dependent repositories - 123 downloads last month - 24 stars on GitHub - 1 maintainer
abpytools 0.3.2
Python package for antibody analysis
11 versions - Latest release: over 5 years ago - 1 dependent repositories - 127 downloads last month - 23 stars on GitHub - 1 maintainer
machine-learning-data-pipeline 1.0.3
Pipeline module for parallel real-time data processing for machine learning models development an...
2 versions - Latest release: over 5 years ago - 1 dependent repositories - 22 downloads last month - 22 stars on GitHub - 1 maintainer
smartpipeline 0.7.3
A framework for fast developing scalable data pipelines following a simple design pattern
11 versions - Latest release: 5 months ago - 1 dependent repositories - 68 downloads last month - 21 stars on GitHub - 1 maintainer
Top 6.2% on pypi.org
cdp-backend 4.1.3
Data storage utilities and processing pipelines used by CDP instances.
108 versions - Latest release: 5 months ago - 3 dependent packages - 22 dependent repositories - 2.49 thousand downloads last month - 21 stars on GitHub - 1 maintainer
glide 0.4.1
Easy ETL
45 versions - Latest release: almost 2 years ago - 1 dependent repositories - 270 downloads last month - 19 stars on GitHub - 1 maintainer
pyvaspflow 0.0.3
Vasp Calculation
1 version - Latest release: about 4 years ago - 1 dependent repositories - 14 downloads last month - 17 stars on GitHub - 1 maintainer
reki 2024.4.0
A data preparation tool for CEMC/CMA.
2 versions - Latest release: about 1 month ago - 1 dependent repositories - 132 downloads last month - 16 stars on GitHub - 1 maintainer
sparklanes 0.2.4
A lightweight framework to build and execute data processing pipelines in pyspark (Apache Spark's...
5 versions - Latest release: over 5 years ago - 1 dependent repositories - 51 downloads last month - 16 stars on GitHub - 1 maintainer
tasrif 0.1.0
Tasrif is a python library for processing of wearable data from fitness trackers and wearable hea...
7 versions - Latest release: about 2 years ago - 1 dependent repositories - 41 downloads last month - 15 stars on GitHub - 1 maintainer
qmm 0.14.0
Quadratic Majorize-Minimize Python toolbox
19 versions - Latest release: 12 months ago - 1 dependent repositories - 141 downloads last month - 14 stars on GitHub - 1 maintainer
faster-os 0.0.11
Up to 6700% faster OS module.
1 version - Latest release: about 2 years ago - 1 dependent repositories - 10 downloads last month - 14 stars on GitHub - 1 maintainer
thepipe 1.3.8
A lightweight, general purpose pipeline framework.
15 versions - Latest release: almost 2 years ago - 1 dependent package - 2 dependent repositories - 464 downloads last month - 13 stars on GitHub - 2 maintainers
pipe21 1.23.0
simple functional pipes
46 versions - Latest release: about 1 month ago - 4 dependent packages - 378 downloads last month - 13 stars on GitHub - 1 maintainer
Top 8.1% on pypi.org
bonobo-docker 0.6.0
Docker extension for Bonobo
18 versions - Latest release: over 6 years ago - 2 dependent packages - 3 dependent repositories - 332 downloads last month - 13 stars on GitHub - 2 maintainers
codraft 2.2.1 removed
Signal and image processing software
11 versions - Latest release: 12 months ago - 79 downloads last month - 12 stars on GitHub - 1 maintainer
connectome 0.10.0
A library for datasets containing heterogeneous data
34 versions - Latest release: about 1 month ago - 1 dependent repositories - 246 downloads last month - 12 stars on GitHub - 2 maintainers
msdlib 1.1.13
msdlib is meant for making life easier of a common data scientist/data analyst/ML enginner.
47 versions - Latest release: 3 months ago - 1 dependent repositories - 118 downloads last month - 12 stars on GitHub - 1 maintainer
mercury-dataschema 0.0.1
Mercury's DataSchema package allows the automatic recognition and validation of feature types.
1 version - Latest release: about 1 year ago - 2 dependent packages - 199 downloads last month - 11 stars on GitHub - 1 maintainer
datasetops 0.0.6
Fluent dataset operations, compatible with your favorite libraries
4 versions - Latest release: about 4 years ago - 4 dependent repositories - 57 downloads last month - 10 stars on GitHub - 1 maintainer
artifician 0.6.4
Artifician is an event driven framework developed to simplify the process of preparation of the d...
35 versions - Latest release: 4 months ago - 77 downloads last month - 10 stars on GitHub - 1 maintainer
forte.health 0.1.0
NLP pipeline framework for biomedical and clinical domains
1 version - Latest release: almost 2 years ago - 11 downloads last month - 10 stars on GitHub - 3 maintainers
gmeterpy 0.0.2
Processing gravity measurements with Python
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 10 downloads last month - 10 stars on GitHub - 1 maintainer
amical 1.6.0
Extraction pipeline and analysis tools for Aperture Masking Interferometry mode of the last gener...
5 versions - Latest release: about 1 year ago - 1 dependent repositories - 44 downloads last month - 9 stars on GitHub - 1 maintainer
xl2times 0.1.0
An open source tool to convert TIMES models specified in Excel a format ready for processing by GAMS
2 versions - Latest release: about 2 months ago - 136 downloads last month - 8 stars on GitHub - 2 maintainers
eyecu-bumblebee 0.5.7
Advanced pipelines for video datasets
46 versions - Latest release: over 3 years ago - 1 dependent repositories - 351 downloads last month - 8 stars on GitHub - 1 maintainer
torchglyph 0.3.2
Data Processor Combinators for Natural Language Processing
8 versions - Latest release: about 2 years ago - 2 dependent repositories - 102 downloads last month - 7 stars on GitHub - 1 maintainer
glassflow 1.0.3
GlassFlow Python Client SDK
8 versions - Latest release: about 1 month ago - 153 downloads last month - 7 stars on GitHub - 1 maintainer
sofine 0.2.4
Lightweight framework for creating data-collection plugins and chaining together calls to them, f...
10 versions - Latest release: over 9 years ago - 2 dependent repositories - 23 downloads last month - 7 stars on GitHub - 1 maintainer
mathbox 0.0.8
A math toolbox.
6 versions - Latest release: over 1 year ago - 1 dependent repositories - 6 downloads last month - 5 stars on GitHub - 1 maintainer
libertem-blobfinder 0.6.1
LiberTEM correlation and refinement library
5 versions - Latest release: 30 days ago - 1 dependent package - 1 dependent repositories - 329 downloads last month - 5 stars on GitHub - 2 maintainers
amanogawa 0.0.1a1
Flexible graph construction and data pre-processing engine
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 39 downloads last month - 5 stars on GitHub - 1 maintainer
bonobo-selenium 0.1.1
Bonobo Selenium Extension
2 versions - Latest release: over 6 years ago - 37 downloads last month - 4 stars on GitHub - 2 maintainers
kothon 0.3.1
A Python library that brings Kotlin's Sequence class functionalities and the power of functional ...
7 versions - Latest release: 2 months ago - 46 downloads last month - 4 stars on GitHub - 1 maintainer
cratedb-toolkit 0.0.10
CrateDB Toolkit
9 versions - Latest release: about 1 month ago - 3 dependent packages - 1 dependent repositories - 1.81 thousand downloads last month - 3 stars on GitHub - 3 maintainers
light-pipe 0.3.1
A high-level syntax for data pipelines, designed to make pipeline development quick and painless.
5 versions - Latest release: 12 months ago - 29 downloads last month - 3 stars on GitHub - 1 maintainer
roll-rate-analysis 0.1.7
Roll Rate Analysis python package. Both month over month and snapshot roll rate functionalities a...
6 versions - Latest release: 2 months ago - 23 downloads last month - 3 stars on GitHub - 1 maintainer
unipipe 0.5.4
project_description
16 versions - Latest release: over 1 year ago - 155 downloads last month - 3 stars on GitHub - 1 maintainer
sakura-py 0.9.6
Sakura platform installation files.
20 versions - Latest release: over 3 years ago - 1 dependent repositories - 50 downloads last month - 3 stars on GitHub - 1 maintainer
schemarrow 0.1.1a0 💰
A library for switching pandas backend to pyarrow
2 versions - Latest release: 2 months ago - 19 downloads last month - 2 stars on GitHub - 1 maintainer
pystream-pipeline 0.2.0
Python package to create and manage fast parallelized data processing pipeline for real-time appl...
5 versions - Latest release: 6 months ago - 1 dependent repositories - 37 downloads last month - 2 stars on GitHub - 1 maintainer
pyfgcz 3.0.1
PyFGCZ contains BioBeamer and FCC python code.
5 versions - Latest release: over 4 years ago - 1 dependent repositories - 23 downloads last month - 2 stars on GitHub - 1 maintainer
uvvispy 0.1.1
Package for handling optical absorption data.
4 versions - Latest release: almost 3 years ago - 1 dependent repositories - 43 downloads last month - 2 stars on GitHub - 1 maintainer
generatorpipeline 1.0
Parallelize your data-processing pipelines with just a decorator.
1 version - Latest release: over 3 years ago - 1 dependent repositories - 9 downloads last month - 2 stars on GitHub - 1 maintainer
dpyp 1.0.0
A pandas convenience wrapper for small-scale data pipelines
1 version - Latest release: 25 days ago - 202 downloads last month - 2 stars on GitHub - 1 maintainer
redis-message-queue 0.8.0
Python message queuing with Redis and message deduplication
11 versions - Latest release: 4 months ago - 238 downloads last month - 2 stars on GitHub - 1 maintainer
dataform 1.0.0
DataForm: Data processing and transformation tool.
1 version - Latest release: 4 months ago - 26 downloads last month - 1 stars on GitHub - 1 maintainer
data-science-kit 0.0.1
Data Science Basic Functions
1 version - Latest release: almost 3 years ago - 1 dependent repositories - 16 downloads last month - 1 stars on GitHub - 1 maintainer
ini2csv 1.0.0
A simple utility that converts and combines a folder of .ini files with identical keys into one c...
1 version - Latest release: 11 months ago - 12 downloads last month - 1 stars on GitHub - 2 maintainers
batch-dev 0.0.3
Generic python module for handling dictionary-based batch data
3 versions - Latest release: 2 months ago - 78 downloads last month - 1 stars on GitHub - 1 maintainer
cwepr 0.5.1
Package for handling cw-EPR data.
10 versions - Latest release: 3 months ago - 1 dependent repositories - 84 downloads last month - 1 stars on GitHub - 2 maintainers
ror 0.1.1
Simple pipelining framework in Python
3 versions - Latest release: 4 months ago - 13 downloads last month - 0 stars on GitHub - 1 maintainer
postql 1.0.3
Python wrapper for Postgres
3 versions - Latest release: 3 months ago - 52 downloads last month - 0 stars on GitHub - 1 maintainer
spifpy 1.0.5
Single Particle Image Format (SPIF) data converter and interface
4 versions - Latest release: over 1 year ago - 35 downloads last month - 0 stars on GitHub - 1 maintainer
nhanes-pytool-api 0.1.1
A tool for programmatic access to NHANES downloadable datasets
2 versions - Latest release: 7 months ago - 47 downloads last month - 0 stars on GitHub - 1 maintainer
Related Keywords
python 52 machine-learning 37 deep-learning 25 data-science 22 pytorch 21 image-processing 17 data 13 gpu 13 fast-data-pipeline 12 data-analysis 11 paddle 10 neural-network 10 mxnet 10 image-augmentation 10 audio-processing 10 data-augmentation 10 gpu-tensorflow 10 pipeline 9 data-visualization 9 pipelines 8 natural-language-processing 8 pandas 7 data-cleaning 7 python3 6 data-preprocessing 6 tensorflow 5 distributed 5 kubernetes 5 workflow 5 spark 4 etl 4 data-engineering 4 mlops 4 analytics 4 extract-transform-load 4 bonobo 4 text-data 4 data-pipeline 4 numpy 3 ui 3 real-time 3 tracking 3 csv 3 serving 3 polyaxon 3 plotly 3 models 3 bokeh 3 data-profiling 3 matplotlib 3 lineage 3 jupyter 3 data science 3 data-analytics 3 reproducible-science 3 reproducible-research 3 data-preparation 3 information-retrieval 3 recipe-driven data analysis 3 good scientific practice 3 reproducible research 3 parallel 3 big-data 3 reproducible science 3 data processing and analysis 3 preprocessing 3 optimization 3 data-mining 3 visualization 3 stream-processing 3 computer-vision 3 automation 3 nlp 3 sqlalchemy 3 json 3 cuda 2 hacktoberfest 2 processing 2 serverless-functions 2 serverless-computing 2 serverless 2 data-processing-pipelines 2 object-storage 2 multiprocessing 2 multicloud 2 cpp 2 cloud-computing 2 big-data-analytics 2 bert 2 casl-project 2 dialog-systems 2 gpt-2 2 machine learning 2 machine-translation 2 torch 2 data-pipelines 2 electron 2 microscopy 2 electron-microscopy 2 task-queue 2