Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "RLHF" keyword
shtec-rlhf 0.0.2.dev0
shtec-rlhf: Safe Reinforcement Learning from Human Feedback3 versions - Latest release: about 1 month ago - 30 downloads last month - 1 maintainer
nemo-aligner 0.2.0
NeMo-Aligner - a toolkit for model alignment2 versions - Latest release: 2 months ago - 77 downloads last month - 179 stars on GitHub - 1 maintainer
Related Keywords
Reinforcement Learning
1
Safe Reinforcement Learning
1
Reinforcement Learning from Human Feedback
1
Safe Reinforcement Learning from Human Feedback
1
Large Language Model
1
Language Model
1
Safe RLHF
1
LLM
1
deep learning
1
machine learning
1
gpu
1
NLP
1
NeMo
1
nvidia
1
pytorch
1
torch
1
language
1
reinforcement learning
1
preference modeling
1
SteerLM
1
DPO
1