DDPG + HER

Implementation of the Deep Deterministic Policy Gradient with Hindsight Experience Replay Extension on the MuJoCo's robotic FetchPickAndPlace environment.

Visit vanilla_DDPG branch for the implementation without the HER extention.

Dependencies

gym == 0.17.2
matplotlib == 3.1.2
mpi4py == 3.0.3
mujoco-py == 2.0.2.13
numpy == 1.19.1
opencv_contrib_python == 3.4.0.12
psutil == 5.4.2
torch == 1.4.0

Installation

pip3 install -r requirements.txt

Usage

mpirun -np $(nproc) python3 -u main.py

Demo

Result

Reference

Acknowledgement

All the credit goes to @TianhongDai for his simplified implementation of the original OpenAI's code.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
Demo		Demo
Pre-trained models		Pre-trained models
Result		Result
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
main.py		main.py
memory.py		memory.py
models.py		models.py
normalizer.py		normalizer.py
play.py		play.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DDPG + HER

Dependencies

Installation

Usage

Demo

Result

Reference

Acknowledgement

About

Releases

Packages

Languages

alirezakazemipour/DDPG-HER

Folders and files

Latest commit

History

Repository files navigation

DDPG + HER

Dependencies

Installation

Usage

Demo

Result

Reference

Acknowledgement

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages