An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "data-loading" keyword

celeres-dl 0.1.0
Celeres Data Loader — A Parallel Data Loading System with Constant-Memory Shuffling for Scalable ...
1 version - Latest release: about 1 month ago - 1 maintainer
imdloader 0.1.0
Download and process English Indices of Deprivation data files
1 version - Latest release: 5 months ago - 1 maintainer
bulkflow 0.1.4
A high-performance CSV to PostgreSQL data loader with chunked processing and error handling
3 versions - Latest release: over 1 year ago - 27 downloads last month
zenith-ai 0.3.0
High-Performance Data Infrastructure for Machine Learning - Native Rust Performance
2 versions - Latest release: 3 months ago - 68 downloads last month - 1 maintainer
m2data 0.0.14
A package for reading .m2 files commonly used in GEC tasks
5 versions - Latest release: almost 5 years ago - 1 dependent repositories - 33 downloads last month - 0 stars on GitHub - 1 maintainer
Top 3.6% on pypi.org
dlt 1.23.0
dlt is an open-source python-first scalable data loading library that does not require any backen...
164 versions - Latest release: 18 days ago - 5 dependent packages - 23 dependent repositories - 8.07 million downloads last month - 2,522 stars on GitHub - 2 maintainers
iceberg-loader 0.1.2
A convenience wrapper around PyIceberg for simplified data loading into Apache Iceberg tables
7 versions - Latest release: 3 months ago - 54 downloads last month - 0 stars on GitHub - 1 maintainer
meltano-tap-cratedb 0.0.1
A Singer tap / Meltano extractor for CrateDB, built with the Meltano SDK, and based on the Meltan...
2 versions - Latest release: about 2 months ago - 256 downloads last month - 2 maintainers
hyperload 0.1.0
A high-performance distributed data loader powered by Rust. Load files from S3 and local disk wit...
1 version - Latest release: about 2 months ago - 1 maintainer
dlt-dataops 0.5.4a0
dlt is an open-source python-first scalable data loading library that does not require any backen...
1 version - Latest release: over 1 year ago - 14 downloads last month - 4,742 stars on GitHub - 1 maintainer
bp-storage 1.0.1
Helper Library for various Dataset formats
2 versions - Latest release: over 6 years ago - 1 dependent repositories - 20 downloads last month - 1 stars on GitHub - 1 maintainer
mongo2pq 0.1.0
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
1 version - Latest release: about 2 years ago - 19 downloads last month - 3,558 stars on GitHub - 1 maintainer
Top 9.5% on pypi.org
datarax 0.1.1
Datarax: A high-performance, NNX-based data pipeline framework for JAX
3 versions - Latest release: about 1 month ago - 163 downloads last month - 1 maintainer
elecphys 0.0.57
Electrophysiology data processing
13 versions - Latest release: almost 2 years ago - 81 downloads last month - 1 stars on GitHub - 1 maintainer
pytest-data-loader 0.7.2
Pytest plugin for loading test data for data-driven testing (DDT)
12 versions - Latest release: about 1 month ago - 336 downloads last month - 0 stars on GitHub - 1 maintainer
turboloader 2.25.0
Production-ready ML data loading library with distributed training support, SIMD-accelerated tran...
79 versions - Latest release: about 1 month ago - 822 downloads last month - 2 stars on GitHub - 1 maintainer
hyper-aidev 0.1.1
A Python library to simplify model learning, training, and creation for powerful AI models across...
2 versions - Latest release: 9 months ago - 13 downloads last month - 1 maintainer
datavolt 0.0.1
A reusable workflow for data engineering pipelines
1 version - Latest release: about 1 year ago - 19 downloads last month - 18 stars on GitHub - 1 maintainer
dlt-source-personio 0.0.4
A DLT source for personio
4 versions - Latest release: 4 months ago - 41 downloads last month - 1 stars on GitHub - 1 maintainer
iodloader 0.1.0
Download and process English Indices of Deprivation data
1 version - Latest release: 5 months ago - 17 downloads last month - 0 stars on GitHub - 1 maintainer
podium-nlp 0.1.1
Podium: a framework agnostic Python NLP library for data loading and preprocessing
2 versions - Latest release: almost 5 years ago - 1 dependent repositories - 18 downloads last month - 60 stars on GitHub - 1 maintainer
spltr 0.3.2
A simple PyTorch-based data loader and splitter
3 versions - Latest release: over 6 years ago - 1 dependent repositories - 19 downloads last month - 1 stars on GitHub - 1 maintainer
Related Keywords
python 6 deep-learning 5 machine-learning 5 data-engineering 5 pytorch 4 data-preprocessing 4 etl 4 transform 3 load 3 extract 3 elt 3 data-warehouse 3 data-lake 3 data 3 natural-language-processing 2 preprocessing 2 nlp 2 ai 2 training 2 rust 2 high-performance 2 datasets 2 data-processing 2 learning 2 dataset 2 performance 2 uk-statistics 2 indices-of-deprivation 2 duckdb 2 performance-estimation 1 model-compilation 1 code-generation 1 ml-workflow-automation 1 boilerplate-generation 1 data-profiling 1 api-stub-generation 1 ai-reasoning 1 causal-inference 1 constraint-solving 1 forward-chaining 1 nuance-understanding 1 contextual-sentiment 1 anomaly-detection 1 text-vectorization 1 semantic-similarity 1 intent-recognition 1 named-entity-recognition 1 data-pipelines 1 dl-data-prep 1 data-quality 1 model-monitoring 1 resource-aware-training 1 production-ml 1 human-in-the-loop 1 active-learning 1 feedback-systems 1 iterative-ai 1 mlops 1 edge-ai 1 onnx 1 auto-ml-dl 1 autopilot-ai 1 learning-rate-scheduling 1 adaptive-prediction 1 rule-based-ai 1 experiment-management 1 model-card 1 callbacks 1 tensorboard 1 tflite 1 openvino 1 tensorrt 1 coreml 1 Data loader 1 Data splitter 1 DataLoader 1 random_split 1 train test 1 train test validation split 1 data preprocessing 1 pytorch dataset split 1 data-loader 1 data-preparation 1 data-preprocess 1 data-split 1 data-split-pytorch 1 easy-data-split 1 easy-split 1 easy-to-use 1 neural-networks 1 pytorch-dataloader-objects 1 pytorch-dataset-split 1 splitter 1 train-split-pytorch 1 train-test-split 1 train-test-validation 1 pytorch-dataloaders 1 tensorflow-tf.data 1 reinforcement-learning 1 continual-learning 1