Skip to content

Pull requests: dennybritz/reinforcement-learning

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Update README.md
#248 opened Mar 17, 2023 by pajjaecat Loading…
Modify "v (list) : state value function" to "V"
#242 opened Oct 29, 2021 by hslyu Loading…
Hello
#241 opened Oct 21, 2021 by simplephi Loading…
Update README.md
#240 opened Oct 1, 2021 by hardlyhuman Loading…
Minor fixes
#234 opened Dec 22, 2020 by rafardenas Loading…
update slides
#233 opened Oct 10, 2020 by harsh306 Loading…
Exercise notebooks with no outputs.
#207 opened Aug 3, 2019 by avullo Loading…
Add Links to Deepnote
#206 opened Aug 1, 2019 by jirkalhotka Loading…
Test the policy in "Value Iteration" exercise
#205 opened Jun 23, 2019 by link2xt Loading…
Proposal of Expected SARSA algorithm
#197 opened Mar 25, 2019 by AntonioSerrano Loading…
Adding k-bandit implementation
#178 opened Oct 1, 2018 by rae83 Loading…
Create MDP_David_class_first_example.py
#169 opened Jul 11, 2018 by olmerg Loading…
Update dqn.py
#165 opened Jun 7, 2018 by zmonoid Loading…
updated DQN model for tf 1.0
#115 opened Oct 18, 2017 by Airconaaron Loading…
Workaround for environment max step limit of 200.
#107 opened Sep 12, 2017 by sedand Loading…
fix the probabilities for each action bug
#86 opened May 26, 2017 by fstonezst Loading…
ProTip! Updated in the last three days: updated:>2024-11-21.