pypi.org "proximal-policy-optimization" keyword
View the packages on the pypi.org package registry that are tagged with the "proximal-policy-optimization" keyword.
wacky-rl 0.0.9
Create custom reinforcement learning agents with wacky-rl.9 versions - Latest release: over 3 years ago - 1 dependent repositories - 12 downloads last month - 3 stars on GitHub - 1 maintainer
Top 7.8% on pypi.org
18 versions - Latest release: over 1 year ago - 3 dependent repositories - 123 downloads last month - 649 stars on GitHub - 1 maintainer
autonomous-learning-library 0.9.1
A library for building reinforcement learning agents in Pytorch18 versions - Latest release: over 1 year ago - 3 dependent repositories - 123 downloads last month - 649 stars on GitHub - 1 maintainer
cleanrl 1.2.0 💰
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-f...13 versions - Latest release: over 2 years ago - 1 dependent repositories - 449 downloads last month - 7,766 stars on GitHub - 1 maintainer
openrlhf 0.8.10
A Ray-based High-performance RLHF framework.71 versions - Latest release: 12 days ago - 2.7 thousand downloads last month - 6,674 stars on GitHub - 1 maintainer
ac_solver 0.1.0 💰
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-f...1 version - Latest release: about 1 year ago - 34 downloads last month - 6,944 stars on GitHub - 1 maintainer
cleanrl-test 1.1.2 💰
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-f...5 versions - Latest release: over 2 years ago - 19 downloads last month - 6,944 stars on GitHub - 1 maintainer
wacky-envs 0.0.1
Create custom reinforcement learning environments with wacky-rl.1 version - Latest release: about 3 years ago - 4 downloads last month - 2 stars on GitHub - 1 maintainer
Related Keywords
reinforcement-learning
7
ppo
6
a2c
6
deep-reinforcement-learning
6
python
5
actor-critic
5
deep-learning
5
gym
5
advantage-actor-critic
4
ale
3
soft-actor-critic
3
atari
3
machine-learning
3
phasic-policy-gradient
3
dqn
3
sac
3
pytorch
3
wandb
3
research
2
learning
2
machine
2
reinforcement
2
rl-algorithms
2
rl-agents
2
policy-gradient
2
rl
2
raylib
1
reinforcement-learning-from-human-feedback
1
transformers
1
vllm
1
environments
1
envs
1
openai-o1
1
large-language-models
1
reinforcement-learning-algorithms
1
dqn-pytorch
1
deep-q-learning
1
deep-deterministic-policy-gradient
1
ddpg
1
actor_critic
1