Blackjack Agent

Treating the game of Blackjack as a Markov Decision Process, this research notebook attempts to train an agent to play the game using the Deep Q-Learning environment.

Packages used

time
collections
gym
numpy
PIL
tensorflow
pyvirtualdisplay
copy

Blackjack Environment

We will OpenAI's Gym library to load and attempt to solve the Blackjack environment.

The goal of the Blakcjack environment is to train an agent to beat the dealer in Blackjack by obtaining cards that sum close to 21, without going over 21, and yet still have a higher value thant the dealer's card.

Blackjack-v1 Environment

Action Space

The action space consists of two actions represented by discrete values.

0: Stick
1: Hit

Observation Space

The agent's observation space is a state vector containing 3 variables:

Player's current sum [int]
Dealer's one showing card (1- 10) [int]
Whether a player holds a usable ace [bool]

Rewards

Win game: +1
Lose game: -1
Draw: 0
Win game with natural Blackjack: +1.5 if natural=True, else +1

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
.DS_Store		.DS_Store
Blackjack_agent.ipynb		Blackjack_agent.ipynb
LICENSE		LICENSE
README.md		README.md
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Blackjack Agent

Packages used

Blackjack Environment

Action Space

Observation Space

Rewards

About

Releases

Packages

Languages

License

bckhm/Blackjack-Agent

Folders and files

Latest commit

History

Repository files navigation

Blackjack Agent

Packages used

Blackjack Environment

Action Space

Observation Space

Rewards

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages