You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<p>Hi, I'm Evan. I am an Electrical Engineering and Computer Science (EECS) master's student at UC Berkeley, advised by Professor <ahref="https://people.eecs.berkeley.edu/~jiantao/">Jiantao Jiao</a>. I also completed my bachelor's degree in computer science at Berkeley.</p>
44
+
<p>Hi, I'm Evan. I am a member of technical staff at <ahref="https://lmarena.ai/">LMArena</a>. I finished my master's in Electrical Engineering and Computer Science (EECS) at UC Berkeley in May 2025, advised by Professor <ahref="https://people.eecs.berkeley.edu/~jiantao/">Jiantao Jiao</a>. I also completed my bachelor's degree in computer science at Berkeley. In addition, I've been lucky to mentored by <ahref="https://people.eecs.berkeley.edu/~istoica/">Ion Stoica</a>, <ahref="https://banghua.me/">Banghua Zhu</a>, <ahref="https://people.eecs.berkeley.edu/~angelopoulos/">Anastasios Angelopoulos</a>, and <ahref="https://infwinston.github.io/">Wei-Lin Chiang</a>.</p>
45
45
<br>
46
-
<p>My research focuses on Reinforcement Learning with Human Feedback (RLHF) for fine-tuning LLMs. Currently, much of my efforts revolve around reward model training and benchmarking.</p>
46
+
<p>Previously, my research has focused on Reinforcement Learning with Human Feedback (RLHF) for fine-tuning LLMs–– much my efforts revolved around reward model training and benchmarking.</p>
47
47
<br>
48
-
<p>I am also a Research Engineer at <ahref="https://nexusflow.ai/">Nexusflow</a>, where I work on training LLMs like <ahref="https://huggingface.co/Nexusflow/Athene-70B">Athene-70B</a>. I also work with <ahref="https://blog.lmarena.ai/about/">Chatbot Arena</a>, mainly on modeling human preferences and building LLM/RM benchmarks.</p>
48
+
<p>During my master's I was also a Research Engineer at <ahref="https://nexusflow.ai/">Nexusflow</a>, where I trained open-weights LLMs like <ahref="https://huggingface.co/Nexusflow/Athene-70B">Athene-70B</a> and <ahref="https://huggingface.co/Nexusflow/Athene-V2-Chat">Athene-V2-Chat</a>. These days at <ahref="https://lmarena.ai/">LMArena</a>, I'm focused on modeling human preference and shaping data quality.</p>
<p>Evan Frick*, Connor Chen*, Joseph Tennyson*, Tianle Li*, Wei-Lin Chiang*, Anastasios N. Angelopoulos*, and Ion Stoica. (2025).</p>
59
59
<br>
60
60
<b><ahref="https://arxiv.org/abs/2410.14872">How to Evaluate Reward Models for RLHF [ICLR 2025]</a></b>
61
61
<p>Evan Frick, Tianle Li, Connor Chen, Wei-Lin Chiang, Anastasios N. Angelopoulos, Jiantao Jiao, Banghua Zhu, Joseph E. Gonzalez, and Ion Stoica. (2024).</p>
0 commit comments