|
1 | | -\section*{Preface} |
| 1 | +\section*{Acknowledgements} |
2 | 2 |
|
3 | | -This thesis was planned and discussed in the winter of 17/18. On February 1st, the work phase of six months started. |
4 | | -Within these six months, I discovered many previously unknown or unforeseen complexities. These include the |
5 | | -communication technologies developed to permit a complete python based broker and a large variety of API approaches |
6 | | -within the RL agent libraries currently available. While I invested a significant amount of effort into the |
7 | | -development of the required components, I always intended to build something that may be reused in the future instead of |
8 | | -being discarded after my thesis was graded. This lead me to the decision of implementing a best practice based |
9 | | -communication instead of a minimal approach and lead me to try to write my python code in a way that will let |
10 | | -future broker developers reuse it as a framework for their broker implementations. |
| 3 | +I'd like to express my gratitude to my supervisor Prof. Wolfgang Ketter who has invited me to join the research |
| 4 | +community around PowerTAC and in doing so showed me an exciting new field of research. This gratitude needs to be |
| 5 | +extended to John Collins and Nastaran Naseri who have answered many questions of mine and guided me during this work. |
11 | 6 |
|
12 | | -Why not just write another broker in Java? I believe PowerTAC answers an important question of our time. But I also |
13 | | -believe there are not enough people working on this field and it doesn't receive the attention it should. Thousands of |
14 | | -researchers and those who want to become one are working on getting AI agents to become better at Atari games or playing |
15 | | -Doom. While the underlying technology advancements are fantastic, the application area is of no use to humanity. I |
16 | | -wanted to apply these new technologies to a problem that matters and do so in a way that will create artifacts that |
17 | | -others can build upon to outperform my solutions quickly. I wanted to create a bridge between the researchers of RL |
18 | | -implementations of recent years and their large community and the exciting field of energy markets. PowerTAC offers |
19 | | -another "game" to play with, another environment to let agents compete in. But it is an environment which actually |
20 | | -generates value when explored and improved. |
| 7 | +Thanks to any contributor of the numerous Open Source projects for sharing their work free of charge that allowed me to |
| 8 | +bring together PowerTAC and current RL research. |
21 | 9 |
|
22 | | -As of July, I was not able to complete my research question and reach the intended target of evaluating a variety of |
23 | | -neural network architectures that let a RL learn from other agents in its environment. Because of university |
24 | | -regulations, changing a thesis title is not permitted. And while my research question was not answered, I believe I |
25 | | -still contributed something valuable to the PowerTAC community. With my implementation, current state-of-the-art neural |
26 | | -network algorithms and especially reinforcement agent implementations can be used to act in the PowerTAC competition. |
27 | | -Any interested researcher with python skills can easily join the competition. And while I was not able to create a well |
28 | | -performing broker in time and compete with the current participants of the competition, it is nonetheless now possible |
29 | | -for others to work on a broker that deploys neural network technologies and to focus on the core problems of RL learning |
30 | | -problems: Environment observation filtering, NN input preprocessing, reward function definition, neural network |
31 | | -architecture experimentation etc. Using the created Docker images, developers are quickly able to start a competition |
32 | | -with multiple brokers and future participants may be encouraged to adopt the Docker based distribution of their agents |
33 | | -to include more advanced technologies in their broker implementations without placing a burden on others to manage these |
34 | | -dependencies. The new communication layer may be adopted by the competition maintainers to improve performance and to |
35 | | -enable other platforms to be used for writing brokers. |
| 10 | +Thanks to my parents Gudrun and Heiko for sparking and fostering my curiosity and for allowing me to pursue my education for years while |
| 11 | +always supporting me and believing in me. |
36 | 12 |
|
37 | | -When reading the thesis, please be aware that the title does not match the contents as one would expect. Adding a simple |
38 | | -"Towards" at the beginning of the title would make it a perfect fit again. Unfortunately, I fell into the same trap that |
39 | | -many software engineers and entire project teams fall into: Underestimating the complexity of the project which leads to |
40 | | -either loss in quality, time overruns or budget overruns. I chose quality of the work I completed over making it work |
41 | | -once but being useless for anyone else afterwards. I hope the thesis is still valuable to anyone who reads it and maybe |
42 | | -upcoming graduate theses will continue where I left off. |
| 13 | +Thanks to my partner Giorgia for supporting me mentally throughout these months and helping me survive Naples, my home |
| 14 | +for the duration of this work and a city with many interesting and confusing customs. I must also thank her for |
| 15 | +introducing me to so many fantastic Pizzaoli of Naples for their creative and nourishing Pizze that fed me |
| 16 | +week after week. |
| 17 | + |
| 18 | +And now that I am back in Germany, I should probably also thank the creative mechanics that repaired my overloaded car |
| 19 | +after breaking down somewhere in the foothills of the alps, 4 days before the deadline on my way back to Cologne. It |
| 20 | +takes a special kind of character to fix a 22 year old Toyota on a saturday morning without any spare parts for some |
| 21 | +young academic trying to make his way up north to complete his graduate studies. |
| 22 | + |
| 23 | +%This thesis was planned and discussed in the winter of 17/18. On February 1st, the work phase of six months started. |
| 24 | +%Within these six months, I discovered many previously unknown or unforeseen complexities. These include the |
| 25 | +%communication technologies developed to permit a complete python based broker and a large variety of API approaches |
| 26 | +%within the RL agent libraries currently available. I invested a significant amount of effort into the |
| 27 | +%development of the required components,and I always intended to build something that may be reused in the future instead of |
| 28 | +%being discarded after my thesis was graded. This lead me to the decision of implementing a best practice based |
| 29 | +%communication instead of a minimal approach and lead me to try to write my python code in a way that will let |
| 30 | +%future broker developers reuse it as a framework for their broker implementations. |
| 31 | +% |
| 32 | +%Why not just write another broker in Java? I believe PowerTAC answers an important question of our time. But I also |
| 33 | +%believe there are not enough people working on this field and it doesn't receive the attention it should. Thousands of |
| 34 | +%researchers and those who want to become one are working on getting AI agents to become better at Atari games or playing |
| 35 | +%Doom. While the underlying technology advancements are fantastic, the application area is of no use to humanity. I |
| 36 | +%wanted to apply these new technologies to a problem that matters and do so in a way that will create artifacts that |
| 37 | +%others can build upon to outperform my solutions quickly. I wanted to create a bridge between the researchers of RL |
| 38 | +%implementations of recent years and their large community and the exciting field of energy markets. PowerTAC offers |
| 39 | +%another "game" to play with, another environment to let agents compete in. But it is an environment which actually |
| 40 | +%generates value when explored and improved. |
| 41 | +% |
| 42 | +%As of July, I was not able to complete my research question and reach the intended target of evaluating a variety of |
| 43 | +%neural network architectures that let a RL learn from other agents in its environment. Because of university |
| 44 | +%regulations, changing a thesis title is not permitted. And while my research question was not answered, I believe I |
| 45 | +%still contributed something valuable to the PowerTAC community. With my implementation, current state-of-the-art neural |
| 46 | +%network algorithms and especially reinforcement agent implementations can be used to act in the PowerTAC competition. |
| 47 | +%Any interested researcher with python skills can easily join the competition. And while I was not able to create a well |
| 48 | +%performing broker in time and compete with the current participants of the competition, it is nonetheless now possible |
| 49 | +%for others to work on a broker that deploys neural network technologies and to focus on the core problems of RL learning |
| 50 | +%problems: Environment observation filtering, NN input preprocessing, reward function definition, neural network |
| 51 | +%architecture experimentation etc. Using the created Docker images, developers are quickly able to start a competition |
| 52 | +%with multiple brokers and future participants may be encouraged to adopt the Docker based distribution of their agents |
| 53 | +%to include more advanced technologies in their broker implementations without placing a burden on others to manage these |
| 54 | +%dependencies. The new communication layer may be adopted by the competition maintainers to improve performance and to |
| 55 | +%enable other platforms to be used for writing brokers. |
| 56 | +% |
| 57 | +%When reading the thesis, please be aware that the title does not match the contents as one would expect. Adding a simple |
| 58 | +%"Towards" at the beginning of the title would make it a perfect fit again. Unfortunately, I fell into the same trap that |
| 59 | +%many software engineers and entire project teams fall into: Underestimating the complexity of the project which leads to |
| 60 | +%either loss in quality, time overruns or budget overruns. I chose quality of the work I completed over making it work |
| 61 | +%once but being useless for anyone else afterwards. I hope the thesis is still valuable to anyone who reads it and maybe |
| 62 | +%upcoming graduate theses will continue where I left off. |
0 commit comments