Skip to content

Commit 2ccde3d

Browse files
Pascal BrokmeierPascal Brokmeier
authored andcommitted
kicked the animals out... not using anymore :-( DeepMind took it
1 parent 976c870 commit 2ccde3d

File tree

2 files changed

+3
-12
lines changed

2 files changed

+3
-12
lines changed

src/chaps/body.tex

Lines changed: 0 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -3,9 +3,6 @@ \chapter{Introduction}
33
\input{chaps/methodology.tex}
44

55
\chapter{Background}
6-
The thesis relies on work in three fields of research: Artificial Intelligence, Reinforcement Learning and (Animal)
7-
Cognition. The application of these fields happens in the field of competitive simulations, specifically an energy
8-
market simulation.
96

107
\input{chaps/artificialintelligence.tex}
118
\subsection{Learning}
@@ -30,10 +27,6 @@ \section{Reinforcement Learning}
3027
\input{chaps/reinforcement.tex}
3128

3229
%TODO still needed after paper by DeepMind? --> showed that learning from teacher helps
33-
%\section{Animal Cognition}
34-
%\subsection{Recognition}
35-
%\subsection{Memory}
36-
%\subsection{Social Cognition}
3730

3831
%\section{Competitive Simulations}%as a tool of experimental research into AI
3932

src/chaps/methodology.tex

Lines changed: 3 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,14 +1,12 @@
11
\section{Methodology}
2-
First, I will perform a literature research into the fields of \ac{AI}, \ac{RL} and Animal Cognition. In the field of AI
2+
First, I will perform a literature research into the fields of \ac{AI}, \ac{RL} and competitive simulations in energy
3+
markets. In the field of AI
34
it's sub fields of \ac {SL} and \ac {UL} will be introduced. Here I will focus on the area of \ac{NN} and a form of
45
\emph{learning} called Backpropagation. In the field of \ac{RL} I will focus on the \ac{MDP} framework as well as the
56
\ac{POMDP} subclass. Next follows an introduction of the recent research in using \ac{NN} in \ac{RL} settings to allow
67
for what is now called Deep Reinforcement Learning. This field has seen tremendous success in recent research, allowing
78
for agents that successfully play Atari games and the game Go on superhuman levels of performance
89
\citep{proximalpolicyopt, silver2016mastering}.
9-
%TODO cite
10-
%TODO .
11-
%some explanation about the field of Animal Cognition and animal learning
1210

1311
%After having introduced the basic research of \ac{AI} and \ac{RL}, I will summarize the state of research of Animal
1412
%Cognition, which focuses on how animals and humans learn, act and remember in their environment. Since humans and
@@ -17,7 +15,7 @@ \section{Methodology}
1715
%thesis, as many animals show forms of social learning, concepts of teaching and learning through observation.
1816
%TODO ref
1917

20-
Following the theoretical foundations, I will introduce the concept of competitive simulations in research
18+
Following the theoretical foundations of \ac{AI}, I will introduce the concept of competitive simulations in research
2119
and summarize the \ac{PowerTAC} competition, it's parts and how agents (called brokers in the context of \ac{PowerTAC})
2220
make decisions. This includes an analysis of previous agents solution approaches.
2321

0 commit comments

Comments
 (0)