Skip to content

Commit 9917403

Browse files
committed
Bulk example: Compute immediate output tokens/second
1 parent 547135c commit 9917403

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/bulk_inference.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -93,6 +93,7 @@
9393

9494
# We'll always get at least one result for each active job, even if the result contains no output text
9595
bsz = len(set([r["identifier"] for r in results]))
96+
num_tokens += bsz
9697

9798
for result in results:
9899
if not result["eos"]: continue
@@ -104,7 +105,6 @@
104105

105106
# Measure performance
106107
num_completions += 1
107-
num_tokens += result["new_tokens"]
108108
elapsed_time = time.time() - time_begin
109109
rpm = num_completions / (elapsed_time / 60)
110110
tps = num_tokens / elapsed_time

0 commit comments

Comments
 (0)