N-Dimensional Prisoner's Dilemma

Multiple Agents were trained by playing N-player prisoners dilemma game against each other using reinforcement learning method. The agent looks at the previous 15 actions of each player and decides the current move. State is defined by the previous 15 moves combined. Each episode consists of T rounds where the agent learns, and changes its weights according the reward provided after each round. M episodes are played, and the state is refreshed after each game.

N=5 M=200 T=50

N=5 M=2000 T=50

N=5 M=10000 T=50

It can be noticed here that initially the models were competing with each other, resulting in gradual reduction in score. Later, models experimented cooperating for a possible uptick. When ran even further (refer figure 3), one can notice that they converge to a particular score better than all cheat.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
.gitattributes		.gitattributes
Capture.JPG		Capture.JPG
Multi_Agent_learning_on_N_player_Prisoners_Dilemma.pdf		Multi_Agent_learning_on_N_player_Prisoners_Dilemma.pdf
N-player Simulation.ipynb		N-player Simulation.ipynb
README.md		README.md
RL Theory of mind.ipynb		RL Theory of mind.ipynb
Result.JPG		Result.JPG
Result_100_5000_50.JPG		Result_100_5000_50.JPG
Result_5_10000_50.JPG		Result_5_10000_50.JPG
Result_5_2000_50.JPG		Result_5_2000_50.JPG
Result_5_200_50.JPG		Result_5_200_50.JPG
Result_5_5000_500.JPG		Result_5_5000_500.JPG
Result_5_5000_500_100.JPG		Result_5_5000_500_100.JPG
Result_5_5000_500_3.JPG		Result_5_5000_500_3.JPG
checkpoint.pth		checkpoint.pth
checkpoint_100_5000_50.pth		checkpoint_100_5000_50.pth
checkpoint_5_10000_50.pth		checkpoint_5_10000_50.pth
checkpoint_5_2000_50.pth		checkpoint_5_2000_50.pth
checkpoint_5_5000_500.pth		checkpoint_5_5000_500.pth
dqn_agent.py		dqn_agent.py
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

N-Dimensional Prisoner's Dilemma

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

prathyoom/Prisoner-s-Dilemma

Folders and files

Latest commit

History

Repository files navigation

N-Dimensional Prisoner's Dilemma

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages