You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
\ac{PowerTAC} allows developers to download large amounts of historical game records. Several hundred games are
917
-
available for 2017 alone, all with different broker participants and broker counts. The
918
-
\texttt{powertac-tools} repository makes it convenient to download all of them and analyze them for specific data,
919
-
providing csv files for further analysis. I created records using the powertac-tools project for all games downloadable for 2017 to let the broker train on
920
-
the datasets. The customer usage analysis\footnote{\texttt{CustomerProductionConsumption.java}}
921
-
provides a historical dataset to create a
922
-
hypothetical portfolio for the learning \ac{RL} agent. To design a \ac{RL} environment, the broker needs a realistic
923
-
portfolio of required energy. Therefore, a subset of the customers may be chosen to pose as the brokers portfolio. While in a real simulation setting, the customers constantly join and leave brokers tariffs, this offline environment
924
-
approximation would assume a static portfolio. Furthermore, the market prices
917
+
available for 2017 alone, all with different broker participants and broker counts. The\texttt{powertac-tools}
918
+
repository makes it convenient to download all of them and analyze them for specific data, providing csv files for
919
+
further analysis. I created records using the powertac-tools project for all games downloadable for 2017 to let the
920
+
broker train on the datasets. The customer usage analysis\footnote{\texttt{CustomerProductionConsumption.java}}
921
+
provides a historical dataset to create a hypothetical portfolio for the learning \ac{RL} agent. To design a \ac{RL}
922
+
environment, the broker needs a realistic portfolio of required energy. Therefore, a subset of the customers may be
923
+
chosen to pose as the brokers portfolio. While in a real simulation setting, the customers constantly join and leave
924
+
brokers tariffs, this offline environment approximation would assume a static portfolio. Furthermore, the market prices
925
925
analysis\footnote{\texttt{MktPriceStats.java}} gives a historical record of all market closings for each game. In a
926
926
historical data based environment approximation, the market prices don't get influenced by the brokers placement of ask
927
927
or bid orders. This is unrealistic if the broker represents any significant percentage of the overall market but may be
@@ -931,7 +931,7 @@ \subsubsection{Offline record based wholesale environment approximation}%
931
931
improvement is due to the agent not having to wait for the server to inform it about a new open time slot. Instead, the
932
932
timeslot gets artificially stepped whenever the wholesale trader has completed its trades.
933
933
934
-
\subsubsection{Learning from historical actions of teacher agents}%
934
+
\subsubsection{Learning from recorded teacher agent actions}%
0 commit comments