Imitation_Learning

Lunar Lander game from OpenAI Gym using behavioral cloning, DAgger methods, and POMDP(Partially-Observable Markov Decision Processes)

Behavioral Cloning (BC):

Behavioral cloning is a straightforward method in imitation learning. In this approach, supervised learning is executed on a provided expert dataset. For discrete actions, we typically maximize log likelihood or minimize cross entropy. For continuous control scenarios, the method involves minimizing the mean-squared error, though maximum likelihood estimation (MLE) is also an option.

Dataset Aggregation (DAgger):

DAgger operates as an interactive learning algorithm, enabling us to consult the expert whenever necessary. This interactivity offers the learner more flexibility since it isn't strictly bound by an initial dataset; instead, it can continuously seek expert insights throughout the learning process.

For DAgger, just as with BC, you'll need to implement the accompanying learn() function.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
BC.ipynb		BC.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Imitation_Learning

Behavioral Cloning (BC):

Dataset Aggregation (DAgger):

About

Releases

Packages

Languages

Hilton-AH/Imitation_Learning-Behavioral_Cloning-for-Robot-Learning

Folders and files

Latest commit

History

Repository files navigation

Imitation_Learning

Behavioral Cloning (BC):

Dataset Aggregation (DAgger):

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages