proxy.golang.org : github.com/allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
Registry
-
Source
- Documentation
- JSON
purl: pkg:golang/github.com/allenai/dolma
Keywords:
data-processing
, large-language-models
, llm
, machile-learning
, nlp
License: Apache-2.0
Latest release: 3 months ago
First release: about 2 years ago
Stars: 1,314 on GitHub
Forks: 151 on GitHub
Total Commits: 283
Committers: 22
Average commits per author: 12.864
Development Distribution Score (DDS): 0.689
More commit stats: commits.ecosyste.ms
See more repository details: repos.ecosyste.ms
Last synced: 5 days ago