Ecosyste.ms: Packages
An open API service providing package, version and dependency metadata of many open source software ecosystems and registries.
pypi.org "self-play" keyword
lightzero 0.0.5
A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkits.3 versions - Latest release: about 1 month ago - 353 downloads last month - 858 stars on GitHub - 1 maintainer
Top 5.3% on pypi.org
20 versions - Latest release: 4 months ago - 5 dependent repositories - 1.23 thousand downloads last month - 2,554 stars on GitHub - 2 maintainers
di-engine 0.5.1
Decision AI Engine20 versions - Latest release: 4 months ago - 5 dependent repositories - 1.23 thousand downloads last month - 2,554 stars on GitHub - 2 maintainers
tartrl-nightly 0.0.4
reinforcement learning framework3 versions - Latest release: about 1 year ago - 27 downloads last month - 3 stars on GitHub - 1 maintainer
tartrl 0.0.3
reinforcement learning framework3 versions - Latest release: about 1 year ago - 24 downloads last month - 3 stars on GitHub - 1 maintainer
convogym 0.1.2
A gym environment to train conversational agents for custom tasks through active learning an...3 versions - Latest release: over 2 years ago - 1 dependent repositories - 25 downloads last month - 20 stars on GitHub - 1 maintainer
Related Keywords
reinforcement-learning
5
pytorch
4
reinforcement-learning-algorithms
3
machine-learning
3
gym
3
python
3
multi-agent
2
baselines
2
toolbox
2
data-science
2
gymnasium
2
distributed-training
2
game-ai
2
multi-agent-reinforcement-learning
2
ppo
2
robotics
2
atari
2
offline-rl
1
multiagent-reinforcement-learning
1
mujoco
1
pytorch-rl
1
model-based-reinforcement-learning
1
r2d2
1
minigrid
1
smac
1
active-learning
1
chatbot-platform
1
convogym
1
dialog-systems
1
natural-language-generation
1
natural-language-processing
1
nlp
1
Reinforcement Learning
1
MCTS
1
MuZero
1
alpha-beta-pruning
1
alphazero
1
board-game
1
board-games
1
continuous-control
1
efficientzero
1
gomoku
1
gumbel-muzero
1
mcts
1
mcts-algorithm
1
monte-carlo-tree-search
1
muzero
1
sampled-muzero
1
stochastic-muzero
1
tictactoe
1
Decision
1
AI
1
Engine
1
distributed-reinforcement-learning
1
distributed-system
1
drl
1
exploration-exploitation
1
imitation-learning
1
impala
1
inverse-reinforcement-learning
1