Q-LearningGame

The original version is from vmayoral: basic_reinforcement_learning:tutorial1.

About the game

Hunter always chases the escaper in the shortest path using BFS, and the escaper always learns to escape using Q-Learning algorithm from Reinforcement Learning.

About the Q-Learning

The algorithm of Q-Learning is:

Q(s, a) += alpha * (reward(s,a) + gamma * max(Q(s', a') - Q(s,a))

alpha is the learning rate. gamma is the value of the future reward. It use the best next choice of utility in later state to update the former state.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
__pycache__		__pycache__
resources		resources
venv		venv
.DS_Store		.DS_Store
README.md		README.md
config.py		config.py
config.pyc		config.pyc
dqn.py		dqn.py
greedyMouse.py		greedyMouse.py
qlearn.py		qlearn.py
qlearn.pyc		qlearn.pyc
record.txt		record.txt
setup.py		setup.py
setup.pyc		setup.pyc
temp.ppm		temp.ppm

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Q-LearningGame

About the game

About the Q-Learning

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Q-LearningGame

About the game

About the Q-Learning

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages