Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

pypi.org "dataset" keyword

Top 0.3% on pypi.org
faker 25.2.0 πŸ’°
Faker is a Python package that generates fake data for you.
361 versions - Latest release: 14 days ago - 382 dependent packages - 15,807 dependent repositories - 15.2 million downloads last month - 16,716 stars on GitHub - 2 maintainers
Top 1.3% on pypi.org
tensorflow-io-gcs-filesystem 0.37.0
TensorFlow IO
22 versions - Latest release: 26 days ago - 105 dependent packages - 5,470 dependent repositories - 14.4 million downloads last month - 690 stars on GitHub - 3 maintainers
Top 0.6% on pypi.org
tensorflow-datasets 4.9.4
tensorflow/datasets is a library of datasets ready to use with TensorFlow.
33 versions - Latest release: 5 months ago - 116 dependent packages - 3,946 dependent repositories - 4.36 million downloads last month - 4,085 stars on GitHub - 8 maintainers
Top 1.4% on pypi.org
tensorflow-io 0.37.0
TensorFlow IO
44 versions - Latest release: 26 days ago - 19 dependent packages - 293 dependent repositories - 3.92 million downloads last month - 690 stars on GitHub - 6 maintainers
Top 1.0% on pypi.org
tfds-nightly 4.9.4.dev202401070044
tensorflow/datasets is a library of datasets ready to use with TensorFlow.
1,859 versions - Latest release: 5 months ago - 13 dependent packages - 296 dependent repositories - 1.35 million downloads last month - 4,085 stars on GitHub - 8 maintainers
Top 0.7% on pypi.org
torchtext 0.18.0
Text utilities, models, transforms, and datasets for PyTorch.
33 versions - Latest release: about 1 month ago - 92 dependent packages - 2,976 dependent repositories - 722 thousand downloads last month - 3,452 stars on GitHub - 4 maintainers
Top 1.2% on pypi.org
whylogs 1.4.0
Profile and monitor your ML data pipeline end-to-end
313 versions - Latest release: 13 days ago - 6 dependent packages - 413 dependent repositories - 474 thousand downloads last month - 2,482 stars on GitHub - 4 maintainers
Top 0.9% on pypi.org
pandas-datareader 0.10.0
Data readers extracted from the pandas codebase,should be compatible with recent pandas versions
22 versions - Latest release: almost 3 years ago - 73 dependent packages - 3,913 dependent repositories - 467 thousand downloads last month - 2,811 stars on GitHub - 2 maintainers
Top 2.2% on pypi.org
pytest-cases 3.8.5
Separate test code from test cases in pytest.
119 versions - Latest release: about 2 months ago - 61 dependent packages - 227 dependent repositories - 318 thousand downloads last month - 326 stars on GitHub - 1 maintainer
Top 3.9% on pypi.org
fastdownload 0.0.7
A general purpose data downloading library.
7 versions - Latest release: almost 2 years ago - 10 dependent packages - 316 dependent repositories - 218 thousand downloads last month - 44 stars on GitHub - 1 maintainer
Top 9.0% on pypi.org
opendatalab 0.0.10
OpenDataLab Python SDK
83 versions - Latest release: 10 months ago - 1 dependent package - 1 dependent repositories - 188 thousand downloads last month - 52 stars on GitHub - 2 maintainers
Top 3.6% on pypi.org
musdb 0.4.2
Python parser for the SIGSEP MUSDB18 dataset
10 versions - Latest release: 6 months ago - 5 dependent packages - 130 dependent repositories - 143 thousand downloads last month - 146 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
quandl 3.7.0
Package for quandl API access
62 versions - Latest release: over 2 years ago - 11 dependent packages - 303 dependent repositories - 125 thousand downloads last month - 1,357 stars on GitHub - 6 maintainers
Top 2.2% on pypi.org
colour-science 0.4.4 πŸ’°
Colour Science for Python
23 versions - Latest release: 5 months ago - 23 dependent packages - 94 dependent repositories - 101 thousand downloads last month - 1,969 stars on GitHub - 1 maintainer
Top 2.5% on pypi.org
beir 2.0.0
A Heterogeneous Benchmark for Information Retrieval
29 versions - Latest release: 10 months ago - 9 dependent packages - 30 dependent repositories - 70.3 thousand downloads last month - 1,370 stars on GitHub - 1 maintainer
iden 0.0.3
simple library to manage a dataset of shards to train machine learning models
9 versions - Latest release: 2 months ago - 2 dependent packages - 63.7 thousand downloads last month - 0 stars on GitHub - 1 maintainer
Top 2.6% on pypi.org
mosaicml-streaming 0.7.6
Streaming lets users create PyTorch compatible datasets that can be streamed from cloud-based obj...
27 versions - Latest release: 16 days ago - 4 dependent packages - 61 dependent repositories - 61.5 thousand downloads last month - 703 stars on GitHub - 5 maintainers
Top 0.8% on pypi.org
fake-factory 9999.9.9 πŸ’°
The `fake-factory` package was deprecated on December 15th, 2016. Use the `Faker` package instead.
22 versions - Latest release: over 7 years ago - 7 dependent packages - 806 dependent repositories - 58.4 thousand downloads last month - 16,716 stars on GitHub - 2 maintainers
Top 2.5% on pypi.org
rdata 0.11.2
Read R datasets from Python.
17 versions - Latest release: 3 months ago - 8 dependent packages - 17 dependent repositories - 52.4 thousand downloads last month - 34 stars on GitHub - 1 maintainer
Top 5.8% on pypi.org
datadotworld 2.0.0
Python library for data.world
25 versions - Latest release: about 1 month ago - 26 dependent repositories - 42.6 thousand downloads last month - 100 stars on GitHub - 1 maintainer
Top 1.6% on pypi.org
label-studio 1.12.0
Label Studio annotation tool
185 versions - Latest release: about 1 month ago - 1 dependent package - 39 dependent repositories - 41.8 thousand downloads last month - 15,269 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
names-dataset 3.1.0 πŸ’°
The python library to handle names
18 versions - Latest release: about 2 years ago - 2 dependent packages - 8 dependent repositories - 41.3 thousand downloads last month - 778 stars on GitHub - 1 maintainer
Top 3.0% on pypi.org
img2dataset 1.45.0
Easily turn a set of image urls to an image dataset
87 versions - Latest release: 4 months ago - 2 dependent packages - 10 dependent repositories - 39.1 thousand downloads last month - 3,256 stars on GitHub - 1 maintainer
Top 3.3% on pypi.org
ir-datasets 0.5.7
provides a common interface to many IR ad-hoc ranking benchmarks, training datasets, etc.
18 versions - Latest release: 28 days ago - 10 dependent packages - 21 dependent repositories - 31.5 thousand downloads last month - 296 stars on GitHub - 1 maintainer
Top 7.1% on pypi.org
waymo-open-dataset-tf-2-11-0 1.6.1
Waymo Open Dataset
5 versions - Latest release: 5 months ago - 1 dependent package - 1 dependent repositories - 31 thousand downloads last month - 2,546 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
tensorflow-io-nightly 0.31.0.dev20230309180344
TensorFlow IO
902 versions - Latest release: about 1 year ago - 3 dependent repositories - 29.9 thousand downloads last month - 690 stars on GitHub - 5 maintainers
Top 2.8% on pypi.org
split-folders 0.5.1
Split folders with files (e.g. images) into training, validation and test (dataset) folders.
12 versions - Latest release: over 2 years ago - 4 dependent packages - 91 dependent repositories - 26.4 thousand downloads last month - 406 stars on GitHub - 1 maintainer
stringzilla 3.8.3
SIMD-accelerated string search, sort, hashes, fingerprints, & edit distances
37 versions - Latest release: about 1 month ago - 1 dependent package - 23.8 thousand downloads last month - 1,749 stars on GitHub - 1 maintainer
Top 5.4% on pypi.org
cpi 1.1.5 πŸ’°
Quickly adjust U.S. dollars for inflation using the Consumer Price Index (CPI)
52 versions - Latest release: 12 days ago - 1 dependent package - 21 dependent repositories - 21.5 thousand downloads last month - 127 stars on GitHub - 1 maintainer
Top 3.0% on pypi.org
tfrecord 1.14.4
TFRecord reader
19 versions - Latest release: 9 months ago - 1 dependent package - 33 dependent repositories - 19.6 thousand downloads last month - 837 stars on GitHub - 1 maintainer
wiz-craft 1.1.1
A CLI-based dataset preprocessing tool for machine learning tasks. Features include data explorat...
6 versions - Latest release: 7 months ago - 18.3 thousand downloads last month - 18 stars on GitHub - 1 maintainer
Top 2.1% on pypi.org
datalad 1.0.2
data distribution geared toward scientific datasets
114 versions - Latest release: about 1 month ago - 43 dependent packages - 78 dependent repositories - 18 thousand downloads last month - 492 stars on GitHub - 5 maintainers
Top 2.9% on pypi.org
dataprofiler 0.10.9
What is in your data? Detect schema, statistics and entities in almost any file.
54 versions - Latest release: 3 months ago - 2 dependent packages - 28 dependent repositories - 17 thousand downloads last month - 1,369 stars on GitHub - 2 maintainers
Top 1.8% on pypi.org
cvat-sdk 2.13.0
CVAT REST API
36 versions - Latest release: 18 days ago - 4 dependent packages - 54 dependent repositories - 14.6 thousand downloads last month - 11,417 stars on GitHub - 3 maintainers
Top 7.2% on pypi.org
xarray-dataclasses 1.7.0
xarray data creation made easy by dataclass
25 versions - Latest release: 7 months ago - 3 dependent packages - 8 dependent repositories - 14.1 thousand downloads last month - 62 stars on GitHub - 1 maintainer
Top 3.5% on pypi.org
torchxrayvision 1.2.3 πŸ’°
TorchXRayVision: A library of chest X-ray datasets and models
43 versions - Latest release: 13 days ago - 1 dependent package - 31 dependent repositories - 13.6 thousand downloads last month - 775 stars on GitHub - 2 maintainers
Top 10.0% on pypi.org
mathematics-dataset 1.0.1
A synthetic dataset of school-level mathematics questions
2 versions - Latest release: about 5 years ago - 1 dependent repositories - 13 thousand downloads last month - 1,720 stars on GitHub - 2 maintainers
Top 3.1% on pypi.org
datumaro 1.6.1
Dataset Management Framework (Datumaro)
57 versions - Latest release: 24 days ago - 7 dependent packages - 30 dependent repositories - 11.9 thousand downloads last month - 481 stars on GitHub - 3 maintainers
smashed 0.21.5
SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields ext...
64 versions - Latest release: 8 months ago - 2 dependent packages - 1 dependent repositories - 11.4 thousand downloads last month - 30 stars on GitHub - 2 maintainers
Top 6.2% on pypi.org
randfacts 0.21.0 πŸ’°
Package to generate random facts
56 versions - Latest release: 6 months ago - 5 dependent packages - 24 dependent repositories - 10.3 thousand downloads last month - 18 stars on GitHub - 1 maintainer
Top 4.0% on pypi.org
doccano 1.8.4 πŸ’°
doccano, text annotation tool for machine learning practitioners
31 versions - Latest release: 10 months ago - 7 dependent repositories - 10.1 thousand downloads last month - 8,436 stars on GitHub - 1 maintainer
Top 7.7% on pypi.org
neuspell 1.0.0
NeuSpell: A Neural Spelling Correction Toolkit
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 10 thousand downloads last month - 634 stars on GitHub - 1 maintainer
Top 4.4% on pypi.org
medmnist 3.0.1
MedMNIST: 18 MNIST-like Datasets for 2D and 3D Biomedical Image Classification
10 versions - Latest release: 4 months ago - 3 dependent packages - 18 dependent repositories - 9.97 thousand downloads last month - 977 stars on GitHub - 1 maintainer
pytest-dataset 0.3.2
Plugin for loading different datasets for pytest by prefix from json or yaml files
4 versions - Latest release: 9 months ago - 9.63 thousand downloads last month - 3 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
tensorflow-io-gcs-filesystem-nightly 0.31.0.dev20230309180344
TensorFlow IO
160 versions - Latest release: about 1 year ago - 1 dependent package - 3 dependent repositories - 9.33 thousand downloads last month - 690 stars on GitHub - 3 maintainers
Top 6.6% on pypi.org
scipp 24.2.0
Multi-dimensional data arrays with labeled dimensions
43 versions - Latest release: 3 months ago - 12 dependent packages - 3 dependent repositories - 8.51 thousand downloads last month - 107 stars on GitHub - 3 maintainers
arctix 0.0.5
A library to get a text summary of nested objects
12 versions - Latest release: 13 days ago - 6.38 thousand downloads last month - 0 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
clip-retrieval 2.44.0
Easily computing clip embeddings and building a clip retrieval system with them
86 versions - Latest release: 4 months ago - 3 dependent repositories - 5.68 thousand downloads last month - 2,163 stars on GitHub - 1 maintainer
Top 9.4% on pypi.org
torchdatasets-nightly 1711929801
PyTorch based library focused on data processing and input pipelines in general.
850 versions - Latest release: about 2 months ago - 1 dependent repositories - 5.53 thousand downloads last month - 328 stars on GitHub - 1 maintainer
Top 4.9% on pypi.org
retriever 3.1.0
Data Retriever
12 versions - Latest release: about 2 years ago - 3 dependent repositories - 5.51 thousand downloads last month - 302 stars on GitHub - 4 maintainers
Top 9.8% on pypi.org
synergy-dataset 1.0.3
Python package for the SYNERGY dataset
12 versions - Latest release: 11 months ago - 1 dependent package - 1 dependent repositories - 5.35 thousand downloads last month - 54 stars on GitHub - 1 maintainer
zengin-code 1.1.0.20240415 πŸ’°
bank codes and branch codes for Japanese.
161 versions - Latest release: about 1 month ago - 1 dependent repositories - 5.34 thousand downloads last month - 13 stars on GitHub - 2 maintainers
Top 3.5% on pypi.org
pix2tex 0.1.2
pix2tex: Using a ViT to convert images of equations into LaTeX code.
31 versions - Latest release: about 1 year ago - 2 dependent packages - 5 dependent repositories - 4.58 thousand downloads last month - 10,900 stars on GitHub - 1 maintainer
Top 4.5% on pypi.org
sapien 2.2.2
['SAPIEN: A SimulAted Parted based Interactive ENvironment']
27 versions - Latest release: 10 months ago - 4 dependent packages - 8 dependent repositories - 4.44 thousand downloads last month - 1 maintainer
Top 6.6% on pypi.org
flwr-datasets 0.1.0
Flower Datasets
3 versions - Latest release: 2 months ago - 7 dependent repositories - 4.28 thousand downloads last month - 3,924 stars on GitHub - 2 maintainers
Top 8.5% on pypi.org
segments-ai 1.8.1
Segments.ai Python SDK
131 versions - Latest release: 13 days ago - 1 dependent package - 3 dependent repositories - 4.21 thousand downloads last month - 20 stars on GitHub - 1 maintainer
Top 8.6% on pypi.org
fastdup 1.123
Fast tool for gaining insights from large image repositories.
332 versions - Latest release: 21 days ago - 1 dependent repositories - 3.97 thousand downloads last month - 1,410 stars on GitHub - 4 maintainers
recognizer 1.4
πŸ¦‰Gracefully face reCAPTCHA challenge with ultralytics YOLOv8-seg, CLIPs VIT-B/16 and CLIP-Seg/RD6...
4 versions - Latest release: about 2 months ago - 1 dependent package - 3.36 thousand downloads last month - 61 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
path-dict 4.0.0
Extends Python's dict with useful extras
20 versions - Latest release: about 1 year ago - 1 dependent package - 3 dependent repositories - 3.04 thousand downloads last month - 22 stars on GitHub - 1 maintainer
lakeapi 0.14.0
API for accessing Lake crypto market data
28 versions - Latest release: 25 days ago - 1 dependent repositories - 2.97 thousand downloads last month - 19 stars on GitHub - 1 maintainer
Top 5.0% on pypi.org
convokit 3.0.0
ConvoKit
64 versions - Latest release: 10 months ago - 15 dependent repositories - 2.93 thousand downloads last month - 512 stars on GitHub - 2 maintainers
Top 6.3% on pypi.org
doccano-client 1.2.8
A simple client for doccano API.
15 versions - Latest release: 12 months ago - 1 dependent package - 3 dependent repositories - 2.93 thousand downloads last month - 77 stars on GitHub - 1 maintainer
Top 5.8% on pypi.org
pylabel 0.1.55 πŸ’°
Transform, analyze, and visualize computer vision annotations.
57 versions - Latest release: 6 months ago - 1 dependent package - 3 dependent repositories - 2.71 thousand downloads last month - 297 stars on GitHub - 1 maintainer
sgs 2.1.1
Python wrapper para o webservice do SGS - Sistema Gerenciador de Series Temporais do Banco Centra...
30 versions - Latest release: over 2 years ago - 1 dependent repositories - 2.67 thousand downloads last month - 72 stars on GitHub - 1 maintainer
chrome-fingerprints 1.1
A Collection of 10.000 self-collected Chrome Fingerprints. Wrapped in a easy-to-use API, availabl...
2 versions - Latest release: 6 months ago - 1 dependent package - 2.64 thousand downloads last month - 69 stars on GitHub - 1 maintainer
Top 7.4% on pypi.org
mtdata 0.4.1
mtdata is a tool to download datasets for machine translation
23 versions - Latest release: about 1 month ago - 1 dependent package - 3 dependent repositories - 2.59 thousand downloads last month - 139 stars on GitHub - 1 maintainer
cc-net 1.0.0
Tools to download and clean Common Crawl
2 versions - Latest release: over 3 years ago - 1 dependent repositories - 2.53 thousand downloads last month - 866 stars on GitHub - 1 maintainer
Top 8.9% on pypi.org
audb 1.7.2
Load and publish databases in audformat
38 versions - Latest release: 11 days ago - 1 dependent package - 4 dependent repositories - 2.39 thousand downloads last month - 20 stars on GitHub - 1 maintainer
ocf-datapipes 3.3.24 πŸ’°
Pytorch Datapipes built for use in Open Climate Fix's forecasting work
245 versions - Latest release: 17 days ago - 3 dependent packages - 2.39 thousand downloads last month - 10 stars on GitHub - 2 maintainers
Top 9.9% on pypi.org
cfpq-data 4.0.3
Python package containing Graphs and Grammars for experimental analysis of Context-Free Path Quer...
10 versions - Latest release: 3 months ago - 15 dependent repositories - 1.89 thousand downloads last month - 9 stars on GitHub - 2 maintainers
Top 3.8% on pypi.org
cryptocmd 0.6.4 πŸ’°
Cryptocurrency historical market price data scrapper.
27 versions - Latest release: 8 months ago - 3 dependent packages - 21 dependent repositories - 1.74 thousand downloads last month - 523 stars on GitHub - 1 maintainer
emnist 0.0
Extended MNIST - Python Package
1 version - Latest release: about 5 years ago - 22 dependent repositories - 1.7 thousand downloads last month - 6 stars on GitHub - 1 maintainer
Top 4.2% on pypi.org
tape-proteins 0.5
Repostory of Protein Benchmarking and Modeling
5 versions - Latest release: over 2 years ago - 1 dependent package - 15 dependent repositories - 1.67 thousand downloads last month - 626 stars on GitHub - 1 maintainer
Top 4.7% on pypi.org
mirdata 0.3.8
Common loaders for MIR datasets.
46 versions - Latest release: 7 months ago - 3 dependent packages - 5 dependent repositories - 1.67 thousand downloads last month - 344 stars on GitHub - 3 maintainers
Top 3.5% on pypi.org
trdg 1.8.0 πŸ’°
TextRecognitionDataGenerator: A synthetic data generator for text recognition
10 versions - Latest release: almost 2 years ago - 1 dependent package - 17 dependent repositories - 1.47 thousand downloads last month - 3,063 stars on GitHub - 1 maintainer
Top 6.8% on pypi.org
cvat-cli 2.13.0
Command-line client for CVAT
35 versions - Latest release: 18 days ago - 1 dependent repositories - 1.46 thousand downloads last month - 11,417 stars on GitHub - 3 maintainers
Top 8.8% on pypi.org
wooldridge 0.4.4
Data sets from Introductory Econometrics: A Modern Approach (6th ed, J.M. Wooldridge)
9 versions - Latest release: almost 3 years ago - 3 dependent repositories - 1.44 thousand downloads last month - 39 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
frictionless-ckan-mapper 1.0.9
A library for mapping CKAN metadata <=> Frictionless metadata.
9 versions - Latest release: over 1 year ago - 3 dependent packages - 7 dependent repositories - 1.3 thousand downloads last month - 9 stars on GitHub - 4 maintainers
open-mastr 0.14.3
A package that provides an interface for downloading and processing the data of the Marktstammdat...
14 versions - Latest release: about 1 month ago - 1.27 thousand downloads last month - 65 stars on GitHub - 1 maintainer
Top 10.0% on pypi.org
tensorflow-io-plugin-gs-nightly 0.18.0.dev20210513213318
TensorFlow IO
29 versions - Latest release: about 3 years ago - 1 dependent package - 1 dependent repositories - 1.23 thousand downloads last month - 690 stars on GitHub - 4 maintainers
moviechat 0.6.3
Long video understanding
10 versions - Latest release: about 1 month ago - 1.13 thousand downloads last month - 408 stars on GitHub - 1 maintainer
globox 2.4.5
Globox is a package and command line interface to read and convert object detection databases (CO...
20 versions - Latest release: about 1 month ago - 1 dependent repositories - 1.12 thousand downloads last month - 149 stars on GitHub - 1 maintainer
video2dataset 1.3.0
Easily create large video dataset from video urls
4 versions - Latest release: 4 months ago - 1.04 thousand downloads last month - 449 stars on GitHub - 1 maintainer
Top 9.1% on pypi.org
soundata 0.1.3
Python library for loading and working with sound datasets.
16 versions - Latest release: 4 months ago - 2 dependent repositories - 1.02 thousand downloads last month - 270 stars on GitHub - 2 maintainers
waymo-open-dataset-tf-2-12-0 1.6.4
Waymo Open Dataset
3 versions - Latest release: about 2 months ago - 1.01 thousand downloads last month - 2,551 stars on GitHub - 1 maintainer
genomic-benchmarks 0.0.9
Genomic Benchmarks
8 versions - Latest release: almost 2 years ago - 1 dependent repositories - 949 downloads last month - 87 stars on GitHub - 3 maintainers
Top 9.6% on pypi.org
crowsetta 5.0.2
A Python tool to work with any format for annotating animal vocalizations and bioacoustics data
32 versions - Latest release: 4 months ago - 4 dependent packages - 5 dependent repositories - 946 downloads last month - 48 stars on GitHub - 1 maintainer
cc2dataset 1.5.0
Easily convert common crawl to image caption set using pyspark
3 versions - Latest release: 11 months ago - 884 downloads last month - 292 stars on GitHub - 1 maintainer
datasetrising 1.0.4
Toolchain for creating and training Stable Diffusion models with custom datasets
86 versions - Latest release: 6 months ago - 802 downloads last month - 11 stars on GitHub - 1 maintainer
torchvideo 0.0.1
PyTorch video dataset library
2 versions - Latest release: about 3 years ago - 1 dependent repositories - 795 downloads last month - 82 stars on GitHub - 1 maintainer
aroma 0.0.0a7
A library to prepare asynchronous time series datasets
3 versions - Latest release: about 1 year ago - 1 dependent repositories - 753 downloads last month - 1 stars on GitHub - 1 maintainer
Top 9.6% on pypi.org
tensorbay 1.24.2
Graviti TensorBay Python SDK
74 versions - Latest release: over 1 year ago - 1 dependent package - 2 dependent repositories - 742 downloads last month - 75 stars on GitHub - 1 maintainer
extra-keras-datasets 1.2.0 πŸ’°
Extending the Keras Datasets module with extra ones.
13 versions - Latest release: over 3 years ago - 3 dependent repositories - 737 downloads last month - 31 stars on GitHub - 1 maintainer
Top 6.7% on pypi.org
pytreebank 0.2.7
Python package for loading Stanford Sentiment Treebank corpus
21 versions - Latest release: over 4 years ago - 2 dependent packages - 17 dependent repositories - 715 downloads last month - 98 stars on GitHub - 1 maintainer
Top 5.2% on pypi.org
fuel 0.2.0
Data pipeline framework for machine learning
2 versions - Latest release: over 7 years ago - 32 dependent repositories - 708 downloads last month - 867 stars on GitHub - 1 maintainer
cross 1.0.3
Tool to cross CSV/TSV datasets
4 versions - Latest release: over 7 years ago - 5 dependent repositories - 700 downloads last month - 1 stars on GitHub - 1 maintainer
Top 6.3% on pypi.org
continuum 1.2.7
A clean and simple library for Continual Learning in PyTorch.
51 versions - Latest release: over 1 year ago - 11 dependent repositories - 693 downloads last month - 400 stars on GitHub - 2 maintainers
datamaestro-text 2024.3.10
Datamaestro module for text-related datasets
65 versions - Latest release: 3 months ago - 1 dependent package - 1 dependent repositories - 684 downloads last month - 3 stars on GitHub - 1 maintainer
starwhale-bootstrap 0.2.2b6
MLOps Platform
65 versions - Latest release: almost 2 years ago - 1 dependent repositories - 672 downloads last month - 187 stars on GitHub - 1 maintainer
nlprep 0.2.1
Download and pre-processing data for nlp tasks
70 versions - Latest release: almost 3 years ago - 1 dependent repositories - 635 downloads last month - 28 stars on GitHub - 1 maintainer