Reinforcement Learning: Vision Based Agent trained with Actor Critic playing Slither.IO

Training a vision-based agent with the Actor Critic model in an online environment, implementation in Tensorflow.
Click on the image below to see the DEMO VIDEO:

Requirements:

Tensorflow
Universe
Python

Introduction

In this report, we present the result of training a vision-based agent for Slither.io, an online massively multiplayer browser game that is partially supported by Universe (OpenAi), using Reinforcement Learning (RL) algorithms.
The framework we used is based on the Actor-Critic models, combining with convolutional neural networks (CNN).
During training, we apply several techniques to encourage exploration and keeping our agent at a high entropy state, successfully avoiding the dilemma of having a highly-peaked policy function (𝜋(𝑎|𝑠)) towards a few actions, a known problem with on-policy models.
The agent we trained requires only raw frames from the screen and game states from the AI side, without using opponents’ information. Therefore, the technique applied is general and suitable for training computer agents in other environments which uses raw frames directly.
Our agent is capable of playing against other human players online and survive in this massively multiplayer game, and is proficient at performing tricky moves upon the encounter of enemies, including intensive sharp turns, high speed twist, and circulations.

Environment

Model

Architecture

Screen Preprocess

Training Pipeline

Exploration Tuning Techniques

Usage

python3 ./src/train_AC.py
python3 ./src/play_AC.py

Detail

See the following files for more detail:

- description.pdf
- presentation.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data		data
presentation.key		presentation.key
src		src
LICENSE		LICENSE
Readme.md		Readme.md
description.pdf		description.pdf
presentation.pdf		presentation.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning: Vision Based Agent trained with Actor Critic playing Slither.IO

Requirements:

Introduction

Environment

Model

Architecture

Screen Preprocess

Training Pipeline

Exploration Tuning Techniques

Usage

Detail

About

Releases

Packages

Languages

License

andi611/AC-Actor-Critic-SlitherIO-Tensorflow

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning: Vision Based Agent trained with Actor Critic playing Slither.IO

Requirements:

Introduction

Environment

Model

Architecture

Screen Preprocess

Training Pipeline

Exploration Tuning Techniques

Usage

Detail

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages