CS420: Reinforcement Learning

This repository contains our solutions to the assignment problems of the course "CS420/414 : Reinforcement Learning" offered by Dr. Prabuchandran K J at IIT Dharwad

Assignment 2 : Bandit Algorithms
- Implemented epsilon-greedy, variable epsilon-greedy, Softmax, Upper Confidence Bound (UCB) and Thompson sampling algorithms for Bernoulli and Normal reward setting.
Assignment 3 : Value Based Methods
- A classical maze problem was considered and policy iteration and value iteration were used to solve the problem.
Assignment 4 : Sample Based Monte-Carlo and Temporal Difference Methods
- Implemented Every Visit Monte-Carlo, Q-learning and SARSA agents for classical maze and Mountain Car environment.
Assignment 5 : Temporal Difference methods with function approximation and Reinforce algorithm.
- Implemented Q-learning, SARSA with Tile Coding and Radial basis function approximation methods, and Reinforce with and without baseline for Cart Pole and Mountain Car environment.
Mini Project : Policy Gradient Algorithms for Atari games
- Trained Ray rllib A2C, A3C and PPO agents for Pong, Breakout and Space Invaders atari environments and compared their results along with expalination of each algorithm in the report.

Name		Name	Last commit message	Last commit date
Latest commit History 134 Commits
Assignment3		Assignment3
Assignment_1		Assignment_1
Assignment_2		Assignment_2
Assignment_4		Assignment_4
Assignment_5		Assignment_5
Mini_project		Mini_project
README.md		README.md
Reinforcement Learning Notes.pdf		Reinforcement Learning Notes.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS420: Reinforcement Learning

About

Releases

Packages

Languages

JS2498/CS420-Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

CS420: Reinforcement Learning

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages