Commit f4a2cca
[VLM] Fixing request timeout error, and enable VllmDeployer to fail fast if the underying
* enable VllmDeployer to fail fast if the underying vllm process failed.
* example slurm script for submitting jobs
* fix slurm scripts
* small fix
* [Automated Commit] Format Codebase
* Update the readme about the example slurm scripts.
* Change the default endpoint startup timeout to 1 hour in case someone need to download the model for the fisrt time.
* change servr expected qps and target latency
* Change the default dataset repo_id to the new name of the public dataset
* [Automated Commit] Format Codebase
* evaluate the json file with multiprocess
* [Automated Commit] Format Codebase
* change default server_target_latency to 12
* revert evaluation changeS
* [Automated Commit] Format Codebase
* update slurm script
* update slurm script
* revert evaluation.py changes after analysing the discrepancy in is_secondhand f1 score
* [Automated Commit] Format Codebase
* linting
* [Automated Commit] Format Codebase
* lock in model and dataset SHA
* [Automated Commit] Format Codebase
* Specify model quality target and server target latency in the README
* Update loadgen/mlperf.conf
* aligning TestSettings'C++ code with its python binding
* [Automated Commit] Format Codebase
* remove ttft and tpot from mlperf.conf
* Enable CLI to take in user.conf
* [Automated Commit] Format Codebase
* readme
* readme
* rename vl2l -> q3vl
* [Automated Commit] Format Codebase
* empty
* rerun ci
* rerun ci
* Introduce sampling parameters
* [Automated Commit] Format Codebase
* [Automated Commit] Format Codebase
* empty
* move CFLAGS="-std=c++14 -O3" into extra_compile_args of Pybind11Extension
* [Automated Commit] Format Codebase
* enable specifying loadgen source in the Dockerfile
* update slurm scripts
* Maintain None as the default value for the sampling params
* [Automated Commit] Format Codebase
* update readme
* [Automated Commit] Format Codebase
* empty
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: John Calderon <jcalderon@nvidia.com>vllm serve process already failed (#2409)1 parent e93f59d commit f4a2cca
File tree
28 files changed
+1919
-972
lines changed- loadgen
- bindings
- multimodal
- qwen3-vl
- docker
- notebooks
- scripts
- slurm
- src/mlperf_inf_mm_q3vl
- vl2l
- src/mlperf_inference_multimodal_vl2l
- text_to_video/wan2.2-t2v-14b
28 files changed
+1919
-972
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
15 | | - | |
| 15 | + | |
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
312 | 312 | | |
313 | 313 | | |
314 | 314 | | |
| 315 | + | |
| 316 | + | |
315 | 317 | | |
316 | 318 | | |
317 | 319 | | |
| |||
324 | 326 | | |
325 | 327 | | |
326 | 328 | | |
| 329 | + | |
| 330 | + | |
| 331 | + | |
| 332 | + | |
| 333 | + | |
| 334 | + | |
| 335 | + | |
| 336 | + | |
327 | 337 | | |
328 | 338 | | |
329 | 339 | | |
| |||
333 | 343 | | |
334 | 344 | | |
335 | 345 | | |
336 | | - | |
337 | | - | |
338 | | - | |
339 | | - | |
340 | | - | |
341 | | - | |
342 | 346 | | |
343 | 347 | | |
344 | 348 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
| 30 | + | |
30 | 31 | | |
31 | 32 | | |
32 | 33 | | |
| |||
69 | 70 | | |
70 | 71 | | |
71 | 72 | | |
72 | | - | |
| 73 | + | |
73 | 74 | | |
74 | 75 | | |
75 | 76 | | |
| |||
94 | 95 | | |
95 | 96 | | |
96 | 97 | | |
97 | | - | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
98 | 101 | | |
99 | 102 | | |
100 | 103 | | |
| |||
140 | 143 | | |
141 | 144 | | |
142 | 145 | | |
| 146 | + | |
| 147 | + | |
143 | 148 | | |
144 | 149 | | |
145 | 150 | | |
| |||
164 | 169 | | |
165 | 170 | | |
166 | 171 | | |
| 172 | + | |
167 | 173 | | |
168 | 174 | | |
169 | 175 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
24 | 24 | | |
25 | 25 | | |
26 | 26 | | |
27 | | - | |
28 | 27 | | |
| 28 | + | |
29 | 29 | | |
30 | 30 | | |
| 31 | + | |
31 | 32 | | |
32 | | - | |
33 | 33 | | |
34 | 34 | | |
35 | 35 | | |
| |||
42 | 42 | | |
43 | 43 | | |
44 | 44 | | |
45 | | - | |
| 45 | + | |
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
| |||
54 | 54 | | |
55 | 55 | | |
56 | 56 | | |
57 | | - | |
| 57 | + | |
58 | 58 | | |
59 | 59 | | |
60 | 60 | | |
| |||
93 | 93 | | |
94 | 94 | | |
95 | 95 | | |
96 | | - | |
| 96 | + | |
97 | 97 | | |
98 | 98 | | |
99 | 99 | | |
100 | | - | |
| 100 | + | |
101 | 101 | | |
102 | 102 | | |
103 | | - | |
| 103 | + | |
104 | 104 | | |
105 | 105 | | |
106 | 106 | | |
107 | | - | |
| 107 | + | |
108 | 108 | | |
109 | 109 | | |
110 | 110 | | |
| |||
113 | 113 | | |
114 | 114 | | |
115 | 115 | | |
116 | | - | |
117 | | - | |
118 | | - | |
119 | | - | |
| 116 | + | |
| 117 | + | |
120 | 118 | | |
121 | 119 | | |
122 | 120 | | |
123 | 121 | | |
| 122 | + | |
124 | 123 | | |
125 | 124 | | |
126 | | - | |
127 | | - | |
128 | | - | |
129 | | - | |
130 | | - | |
131 | | - | |
132 | | - | |
133 | | - | |
134 | | - | |
135 | | - | |
136 | | - | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
| 137 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
234 | 234 | | |
235 | 235 | | |
236 | 236 | | |
237 | | - | |
238 | | - | |
239 | | - | |
240 | | - | |
241 | 237 | | |
242 | 238 | | |
243 | 239 | | |
| |||
272 | 268 | | |
273 | 269 | | |
274 | 270 | | |
| 271 | + | |
| 272 | + | |
| 273 | + | |
| 274 | + | |
275 | 275 | | |
276 | 276 | | |
277 | 277 | | |
| |||
File renamed without changes.
0 commit comments