pypi.org : llm-datasets
A collection of datasets for language model training including scripts for downloading, preprocesssing, and sampling.
Registry
-
Source
- Documentation
- JSON
purl: pkg:pypi/llm-datasets
Keywords:
datasets
, language-models
, llm
License: Apache-2.0
Latest release: about 1 year ago
First release: about 1 year ago
Downloads: 71 last month
Stars: 56 on GitHub
Forks: 5 on GitHub
See more repository details: repos.ecosyste.ms
Last synced: 20 days ago