Ecosyste.ms: Packages

An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.

Top 7.3% forks on anaconda.org

anaconda.org : datasets

Datasets is a lightweight library providing two main features: - one-line dataloaders for many public datasets: one-liners to download and pre-process any of the number of datasets major public datasets (text datasets in 467 languages and dialects, image datasets, audio datasets, etc.) provided on the HuggingFace Datasets Hub. With a simple command like squad_dataset = load_dataset("squad"), get any of these datasets ready to use in a dataloader for training/evaluating a ML model (Numpy/Pandas/PyTorch/TensorFlow/JAX), - efficient data pre-processing: simple, fast and reproducible data pre-processing for the above public datasets as well as your own local datasets in CSV/JSON/text/PNG/JPEG/etc. With simple commands like `processed_dataset = dataset.map(process_example)`, efficiently prepare the dataset for inspection and ML model evaluation and training.

Registry - Source - JSON
purl: pkg:conda/datasets
Keywords: computer-vision, datasets, deep-learning, hacktoberfest, machine-learning, natural-language-processing, nlp, numpy, pandas, pytorch, speech, tensorflow
License: Apache-2.0
Latest release: 12 months ago
First release: over 2 years ago
Dependent packages: 4
Dependent repositories: 29
Stars: 15,553 on GitHub
Forks: 2,068 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: about 11 hours ago

transformers 4.37.2
Transformers provides thousands of pretrained models to perform tasks on different modalities suc...
8 versions - Latest release: 2 months ago - 4 dependent packages - 101 dependent repositories - 86,717 stars on GitHub
optimum 1.12.0
🤗 Optimum is an extension of 🤗 Transformers, providing a set of performance optimization tools en...
2 versions - Latest release: 8 months ago - 989 stars on GitHub
evaluate 0.4.0
Evaluate is a library that makes evaluating and comparing models and reporting their performance ...
2 versions - Latest release: about 1 year ago - 3 dependent repositories - 1,219 stars on GitHub
anaconda 5.3.1
Anaconda is a Python distribution which includes over 400 Python packages for science, math, engi...
25 versions - Latest release: over 5 years ago - 206 dependent repositories