proxy.golang.org "data-prep" keyword
Top 5.8% on proxy.golang.org
14 versions - Latest release: 4 months ago - 613 stars on GitHub
github.com/IBM/data-prep-kit v1.1.6
Open source project for data preparation of LLM application builders14 versions - Latest release: 4 months ago - 613 stars on GitHub
Top 5.8% on proxy.golang.org
15 versions - Latest release: 22 days ago - 613 stars on GitHub
github.com/ibm/data-prep-kit v1.1.7
Open source project for data preparation of LLM application builders15 versions - Latest release: 22 days ago - 613 stars on GitHub
Top 5.6% on proxy.golang.org
11 versions - Latest release: 5 months ago - 918 stars on GitHub
github.com/NVIDIA/NeMo-Curator v1.0.0
Scalable data pre processing and curation toolkit for LLMs11 versions - Latest release: 5 months ago - 918 stars on GitHub
Related Keywords
data
3
data-preparation
3
python
3
datacuration
3
datarecipes
3
deduplication
3
large-language-models
3
large-scale-data-processing
3
llm
3
llmapps
3
spark
2
ray
2
malware
2
finetuning
2
data-preprocessing-pipelines
2
data-preprocessing
2
code-quality
2
data-curation
1
data-processing
1
data-processing-pipelines
1
data-quality
1
fast-data-processing
1
fine-tuning
1
llm-data-quality
1
semantic-deduplication
1