Skip to content

Commit 301739e

Browse files
committed
added analysis past performance
1 parent 31ab69c commit 301739e

File tree

3 files changed

+43
-17
lines changed

3 files changed

+43
-17
lines changed

src/body.tex

Lines changed: 43 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -428,9 +428,9 @@ \subsection{Recurrent Neural Networks}%
428428
A \ac{RNN}, therefore computes its output based on the weights $w_i$, commonly noted as $\theta$, it's current input
429429
$x^t$ and it's previous hidden units internal states $h^{t-1}$.
430430

431-
\[
431+
\begin{equation}
432432
h^t = f(h^{t-1}, x^t, \theta)
433-
\]
433+
\end{equation}
434434

435435
The network generally learns to use $h^t$ to encode previously seen aspects relevant to the current task, although this
436436
is inherently lossy as the previous number of inputs (i.e.\ $\mid t-1\mid$) is arbitrary. Figure~\ref{fig:rnn_concept}
@@ -910,18 +910,18 @@ \subsection{Applying observation learning to PowerTAC}%
910910
applied to boost its learning performance prior to having it interact with a live environment. These techniques are
911911
described below.
912912

913-
\subsubsection{Offline record based wholesale environment approximation}%
913+
\subsubsection{Offline wholesale environment approximation}%
914914
\label{ssub:offline_record_based_wholesale_environment_approximation}
915915

916916
\ac{PowerTAC} allows developers to download large amounts of historical game records. Several hundred games are
917-
available for 2017 alone, all with different broker participants and broker counts. The
918-
\texttt{powertac-tools} repository makes it convenient to download all of them and analyze them for specific data,
919-
providing csv files for further analysis. I created records using the powertac-tools project for all games downloadable for 2017 to let the broker train on
920-
the datasets. The customer usage analysis\footnote{\texttt{CustomerProductionConsumption.java}}
921-
provides a historical dataset to create a
922-
hypothetical portfolio for the learning \ac{RL} agent. To design a \ac{RL} environment, the broker needs a realistic
923-
portfolio of required energy. Therefore, a subset of the customers may be chosen to pose as the brokers portfolio. While in a real simulation setting, the customers constantly join and leave brokers tariffs, this offline environment
924-
approximation would assume a static portfolio. Furthermore, the market prices
917+
available for 2017 alone, all with different broker participants and broker counts. The \texttt{powertac-tools}
918+
repository makes it convenient to download all of them and analyze them for specific data, providing csv files for
919+
further analysis. I created records using the powertac-tools project for all games downloadable for 2017 to let the
920+
broker train on the datasets. The customer usage analysis\footnote{\texttt{CustomerProductionConsumption.java}}
921+
provides a historical dataset to create a hypothetical portfolio for the learning \ac{RL} agent. To design a \ac{RL}
922+
environment, the broker needs a realistic portfolio of required energy. Therefore, a subset of the customers may be
923+
chosen to pose as the brokers portfolio. While in a real simulation setting, the customers constantly join and leave
924+
brokers tariffs, this offline environment approximation would assume a static portfolio. Furthermore, the market prices
925925
analysis\footnote{\texttt{MktPriceStats.java}} gives a historical record of all market closings for each game. In a
926926
historical data based environment approximation, the market prices don't get influenced by the brokers placement of ask
927927
or bid orders. This is unrealistic if the broker represents any significant percentage of the overall market but may be
@@ -931,7 +931,7 @@ \subsubsection{Offline record based wholesale environment approximation}%
931931
improvement is due to the agent not having to wait for the server to inform it about a new open time slot. Instead, the
932932
timeslot gets artificially stepped whenever the wholesale trader has completed its trades.
933933

934-
\subsubsection{Learning from historical actions of teacher agents}%
934+
\subsubsection{Learning from recorded teacher agent actions}%
935935
\label{ssub:learning_from_historical_actions_of_teacher_agents}
936936

937937
The \ac{RL} agent may in addition to a fixed portfolio be taught to imitate the recorded behavior of a teacher broker
@@ -1045,12 +1045,38 @@ \subsubsection{Wholesale market strategies}%
10451045

10461046
\subsubsection{Past performances}%
10471047
\label{ssub:past_performances}
1048-
%TODO STOP :
1049-
% some summary on how they performed in comparison to each other in past competitions.
1050-
% summary pages 2017 http://powertac.org/log_archive/PowerTAC_2017_finals.html
1051-
% 2016 http://powertac.org/log_archive/PowerTAC_2016_finals.html
1052-
% also in-depth graphs of the balances from data, polished up with some python
10531048

1049+
To analyze the performance, all cash statistics of the final rounds of year 2017 were analyzed. TacTex did not
1050+
participate in the 2017 competition and is therefore excluded in this analysis. Their last participation was in 2015
1051+
where they ended up in second place. The improvements made to the previously mentioned agents between their latest
1052+
publications and their current performances are Unfortunately not determinable.
1053+
1054+
When looking at the overall performance profiles (see Figure~\ref{fig:cash_vals_across_games}) of the top 6 brokers of
1055+
the 2017 finals, it becomes obvious that most brokers are performing rather bad most of the time. Only SPOT, fimtac and
1056+
AgentUDE managed to consistently stay close to zero or in the case of AgentUDE even above 0 cash balance. When
1057+
inspecting the tariff transactions closer (see Figure~\ref{fig:allttxucline}, it becomes clear that only AgentUDE
1058+
achieves this through actually being successful in the market. SPOT only acts in the market initially and then quickly
1059+
looses many of its customers. Fimtac keeps a small continuous customer base throughout most games. AgentUDE on the other
1060+
hand trades actively in the market, having a solid number of customers subscribed to it. COLDPower also trades actively
1061+
but its financial results are not as satisfying, loosing significant amounts of money each week and also not being able
1062+
to sustain its continous income towards the end of the games.
1063+
1064+
Generally, AgentUDE can be seen as the peer with the most consistent and stable performance. Their broker acts in all
1065+
parts of the simulation and makes use of various strategies, including tariff optimisation and balancing capacity.
1066+
1067+
\begin{figure}[t]
1068+
\centering
1069+
\includegraphics[width=1.0\linewidth]{img/all-ttx-uc-line.png}
1070+
\caption{Tariff TX credit values across all games in the 2017 finals (rolling average)}
1071+
\label{fig:allttxucline}
1072+
\end{figure}
1073+
1074+
\begin{figure}[t]
1075+
\centering
1076+
\includegraphics[width=1.0\linewidth]{img/cash_vals_across_games.png}
1077+
\caption{Cash values across all games in the 2017 finals (median, 0.25 percentile, 0.75 percentile)}
1078+
\label{fig:cash_vals_across_games}
1079+
\end{figure}
10541080

10551081
\chapter{Implementation}
10561082
\label{cha:implementation}

src/img/all-ttx-uc-line.png

293 KB
Loading

src/img/cash_vals_across_games.png

81.3 KB
Loading

0 commit comments

Comments
 (0)