File tree Expand file tree Collapse file tree 1 file changed +48
-0
lines changed
Expand file tree Collapse file tree 1 file changed +48
-0
lines changed Original file line number Diff line number Diff line change @@ -253,6 +253,54 @@ <h2 class="title is-3" style="text-align: center;">
253253 </ div >
254254</ section >
255255
256+ < head >
257+ < meta charset ="UTF-8 " />
258+ < meta name ="viewport " content ="width=device-width, initial-scale=1.0 "/>
259+ < title > Note Box</ title >
260+ < style >
261+ .note-box {
262+ border : 3px solid # a57bb3 ;
263+ border-radius : 10px ;
264+ padding : 1.2em ;
265+ background-color : # fefefe ;
266+ position : relative;
267+ font-family : "Georgia" , serif;
268+ max-width : 800px ;
269+ margin : 2em auto;
270+ line-height : 1.6 ;
271+ }
272+
273+ .note-box ::before {
274+ content : "Note" ;
275+ position : absolute;
276+ top : -1.4em ;
277+ left : 0 ;
278+ background-color : # a57bb3 ;
279+ color : white;
280+ padding : 0.4em 1em ;
281+ border-top-left-radius : 10px ;
282+ border-top-right-radius : 10px ;
283+ font-weight : bold;
284+ font-size : 1.2em ;
285+ }
286+
287+ em {
288+ font-style : italic;
289+ }
290+ </ style >
291+ </ head >
292+ < body >
293+
294+ < div class ="note-box ">
295+ Advanced test-time strategies shift evaluation from token-centric metrics (e.g., perplexity, latency) to
296+ < em > task-level throughput</ em > —the number of tasks completed per unit time. This shift is especially relevant for
297+ reasoning tasks, where intermediate steps may vary widely depending on the strategy, yet the ultimate utility hinges
298+ almost entirely on the correctness of the final output. In contrast, traditional tasks like chat completions focus
299+ on token-level quality and throughput.
300+ </ div >
301+
302+ </ body >
303+
256304<!-- Section: Motivation -->
257305< section class ="section hero is-light ">
258306 < div class ="container is-fluid ">
You can’t perform that action at this time.
0 commit comments