We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent f3e42f7 commit 6544212Copy full SHA for 6544212
thunder/benchmarks/benchmark_inference.py
@@ -466,11 +466,6 @@ def parse_args() -> argparse.Namespace:
466
description="Thunder Inference Benchmark",
467
formatter_class=CustomFormatter,
468
epilog="""
469
-Standard Benchmark Scenarios:
470
- summarization - Prefill-Heavy: 4,000 input → 1,000 output tokens
471
- chat - Balanced: 1,000 input → 1,000 output tokens
472
- reasoning - Decode-Heavy: 1,000 input → 4,000 output tokens
473
-
474
Examples:
475
python benchmark_inference.py --input-length 2048 --output-length 512 --model-name meta-llama/Llama-4-Maverick-17B-128E --mode eager
476
""",
0 commit comments