Implementation of Trust Region Policy Optimization and Proximal Policy Optimization algorithms on the objective of Robot Walk.
reinforcement-learning
robotics
motion
deep-reinforcement-learning
openai-gym
pytorch
reinforcement-learning-algorithms
trpo
robotics-simulation
pybullet
reinforcement-learning-analysis
gym-environment
ppo
reinforcement-learning-agent
gym-environments
reinforcement-learning-environments
robot-walking
pybullet-environments
pybullet-physics
-
Updated
Mar 9, 2021 - Python