|
3 | 3 | Benchmarks Metadata: |
4 | 4 | Run id:93e36b31-b454-471d-ba62-6b2671585485 |
5 | 5 | Duration:30.2 seconds |
6 | | - Profile:type=sweep, strategies=['synchronous', 'throughput', 'constant', |
7 | | - 'constant', 'constant', 'constant', 'constant', 'constant', 'constant', |
8 | | - 'constant'], max_concurrency=None |
9 | | - Args:max_number=None, max_duration=30.0, warmup_number=None, |
10 | | - warmup_duration=None, cooldown_number=None, cooldown_duration=None |
11 | | - Worker:type_='generative_requests_worker' backend_type='openai_http' |
12 | | - backend_target='example_target' backend_model='example_model' |
13 | | - backend_info={'max_output_tokens': 16384, 'timeout': 300, 'http2': True, |
14 | | - 'authorization': False, 'organization': None, 'project': None, |
15 | | - 'text_completions_path': '/v1/completions', 'chat_completions_path': |
16 | | - '/v1/chat/completions'} |
17 | | - Request Loader:type_='generative_request_loader' |
18 | | - data='prompt_tokens=256,output_tokens=128' data_args=None |
19 | | - processor='example_processor' processor_args=None |
| 6 | + Profile:type=sweep, strategies=['synchronous', 'throughput', 'constant', 'constant', 'constant', 'constant', |
| 7 | + 'constant', 'constant', 'constant', 'constant'], max_concurrency=None |
| 8 | + Args:max_number=None, max_duration=30.0, warmup_number=None, warmup_duration=None, cooldown_number=None, |
| 9 | + cooldown_duration=None |
| 10 | + Worker:type_='generative_requests_worker' backend_type='openai_http' backend_target='example_target' |
| 11 | + backend_model='example_model' backend_info={'max_output_tokens': 16384, 'timeout': 300, 'http2': True, |
| 12 | + 'authorization': False, 'organization': None, 'project': None, 'text_completions_path': '/v1/completions', |
| 13 | + 'chat_completions_path': '/v1/chat/completions'} |
| 14 | + Request Loader:type_='generative_request_loader' data='prompt_tokens=256,output_tokens=128' data_args=None |
| 15 | + processor='example_processor' processor_args=None |
20 | 16 | Extras:None |
21 | 17 |
|
22 | 18 |
|
23 | 19 | Benchmarks Info: |
24 | | -================================================================================ |
25 | | -=================================================================== |
26 | | -Metadata |||| Requests Made ||| Prompt |
27 | | -Tok/Req ||| Output Tok/Req ||| Prompt Tok Total||| Output Tok Total || |
28 | | - Benchmark| Start Time| End Time| Duration (s)| Comp| Inc| Err| Comp| |
29 | | -Inc| Err| Comp| Inc| Err| Comp| Inc| Err| Comp| Inc| Err |
30 | | ------------|-----------|---------|-------------|------|-----|-----|------|------ |
31 | | -|----|-------|-----|-----|-------|-----|-----|-------|------|------ |
32 | | -synchronous| 16:59:28| 16:59:58| 30.0| 46| 1| 0| 257.1| |
33 | | -256.0| 0.0| 128.0| 0.0| 0.0| 11827| 256| 0| 5888| 0| 0 |
34 | | -================================================================================ |
35 | | -=================================================================== |
| 20 | +======================================================================================================================== |
| 21 | +=========================== |
| 22 | +Metadata |||| Requests Made ||| Prompt Tok/Req ||| Output Tok/Req ||| Prompt Tok |
| 23 | +Total||| Output Tok Total || |
| 24 | + Benchmark| Start Time| End Time| Duration (s)| Comp| Inc| Err| Comp| Inc| Err| Comp| Inc| Err| Comp| Inc| |
| 25 | +Err| Comp| Inc| Err |
| 26 | +-----------|-----------|---------|-------------|------|-----|-----|------|------|----|-------|-----|-----|-------|-----| |
| 27 | +-----|-------|------|------ |
| 28 | +synchronous| 16:59:28| 16:59:58| 30.0| 46| 1| 0| 257.1| 256.0| 0.0| 128.0| 0.0| 0.0| 11827| 256| |
| 29 | +0| 5888| 0| 0 |
| 30 | +======================================================================================================================== |
| 31 | +=========================== |
36 | 32 |
|
37 | 33 |
|
38 | 34 | Benchmarks Stats: |
39 | | -================================================================================ |
40 | | -=============================================================== |
41 | | -Metadata | Request Stats || Out Tok/sec| Tot Tok/sec| Req Latency |
42 | | -(sec) ||| TTFT (ms) ||| ITL (ms) ||| TPOT (ms) || |
43 | | - Benchmark| Per Second| Concurrency| mean| mean| mean| median| |
44 | | -p99| mean| median| p99| mean| median| p99| mean| median| p99 |
45 | | ------------|-----------|------------|------------|------------|------|--------|- |
46 | | ------|-----|-------|-----|-----|-------|----|-----|-------|---- |
47 | | -synchronous| 1.55| 1.00| 198.1| 992.7| 0.64| 0.64| |
48 | | -0.69| 16.8| 16.4| 21.3| 4.9| 4.9| 5.3| 4.9| 4.9| 5.2 |
49 | | -================================================================================ |
50 | | -=============================================================== |
| 35 | +======================================================================================================================== |
| 36 | +======================= |
| 37 | +Metadata | Request Stats || Out Tok/sec| Tot Tok/sec| Req Latency (sec) ||| TTFT (ms) ||| ITL (ms) |
| 38 | +||| TPOT (ms) || |
| 39 | + Benchmark| Per Second| Concurrency| mean| mean| mean| median| p99| mean| median| p99| mean| median| |
| 40 | +p99| mean| median| p99 |
| 41 | +-----------|-----------|------------|------------|------------|------|--------|------|-----|-------|-----|-----|-------| |
| 42 | +----|-----|-------|---- |
| 43 | +synchronous| 1.55| 1.00| 198.1| 992.7| 0.64| 0.64| 0.69| 16.8| 16.4| 21.3| 4.9| 4.9| |
| 44 | +5.3| 4.9| 4.9| 5.2 |
| 45 | +======================================================================================================================== |
| 46 | +======================= |
0 commit comments