Imitation learning is supervised learning where data comes as expert demonstration. The expert can be a human or any other agent. Input data is referred to as "state" and output data as "action." In discrete action spaces, it resembles classification; in continuous action spaces, it is regression.
Policy
Behavioral Cloning (BC) is offline imitation learning that use only the collected demonstrations and doesn't use simulator during learning.
- This tutorial is educational purpose, so code isn't optimized for production but easy to understand.
- Each policy training is done in a single jupyter notebook.
- Each directory contain a readme file.
pip install gym==0.26.2
pip install readchar
pip install imageio
pip install -U scikit-learn
- Install PyTorch https://pytorch.org/get-started/locally/
- We will use hdf5 file for robomimic (see the 'readme.md' in robomimic directory to understand the data format) and real robot.
- For rest of the environment we will store as *.pkl file with the following structure.
*.pkl structure we are going to use.