OpenAI's Mar 2018 request for research highlighted the research trajectory of combining HER with other advances in RL. The goal of HER Variations is to explore these possibilities.
Requires baselines, which can be installed here: https://github.com/openai/baselines
- Combine PPO with experience replay
- Add IPG
- Build HER roadmap based on paper
- Extend to hindsight experience replay
- Great intro article
- More papers than there is time to read
- I finally stopped and read this helpful Reinforcement Learning (Sutton & Barto) textbook
- Steve Brunton's awesome Control Bootcamp
- Brian Douglas' control playlist
- Emma Brunskill's talks at Simons Institute
- Deep RL Bootcamp
- HER Variation with PPO