Commit c7e9143
authored
fix preempted prompts (#928)
### Motivation
The preempted prompts might failed to mitch the `num_computed_tokens <
num_prompt_tokens` test and be treated as decoding then cause runtime
error.
### Changes
- add `_is_prompt()` to check if a request is prompt or not.
- consider the `num_scheduled_tokens` to handle the preempted prompts.
- add test for preemption handling to the CI.
---------
Signed-off-by: Youlei Yang <youlei.yang@intel.com>1 parent 00767f3 commit c7e9143
File tree
3 files changed
+69
-18
lines changed- tests/full_tests
- vllm_gaudi/v1/worker
3 files changed
+69
-18
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
273 | 273 | | |
274 | 274 | | |
275 | 275 | | |
| 276 | + | |
| 277 | + | |
| 278 | + | |
| 279 | + | |
| 280 | + | |
| 281 | + | |
| 282 | + | |
276 | 283 | | |
277 | 284 | | |
278 | 285 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
| 14 | + | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
| 30 | + | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
| 34 | + | |
| 35 | + | |
| 36 | + | |
| 37 | + | |
| 38 | + | |
| 39 | + | |
| 40 | + | |
| 41 | + | |
| 42 | + | |
| 43 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1528 | 1528 | | |
1529 | 1529 | | |
1530 | 1530 | | |
| 1531 | + | |
| 1532 | + | |
| 1533 | + | |
| 1534 | + | |
| 1535 | + | |
| 1536 | + | |
| 1537 | + | |
| 1538 | + | |
| 1539 | + | |
| 1540 | + | |
| 1541 | + | |
| 1542 | + | |
| 1543 | + | |
| 1544 | + | |
1531 | 1545 | | |
1532 | 1546 | | |
1533 | 1547 | | |
| |||
1551 | 1565 | | |
1552 | 1566 | | |
1553 | 1567 | | |
1554 | | - | |
1555 | 1568 | | |
1556 | 1569 | | |
1557 | 1570 | | |
| |||
1565 | 1578 | | |
1566 | 1579 | | |
1567 | 1580 | | |
1568 | | - | |
1569 | | - | |
1570 | | - | |
1571 | | - | |
1572 | | - | |
1573 | | - | |
| 1581 | + | |
1574 | 1582 | | |
1575 | 1583 | | |
1576 | | - | |
1577 | | - | |
1578 | | - | |
1579 | | - | |
1580 | 1584 | | |
| 1585 | + | |
1581 | 1586 | | |
1582 | 1587 | | |
1583 | 1588 | | |
| |||
1590 | 1595 | | |
1591 | 1596 | | |
1592 | 1597 | | |
1593 | | - | |
1594 | | - | |
1595 | | - | |
1596 | | - | |
1597 | 1598 | | |
1598 | | - | |
1599 | | - | |
| 1599 | + | |
1600 | 1600 | | |
1601 | 1601 | | |
1602 | | - | |
| 1602 | + | |
| 1603 | + | |
1603 | 1604 | | |
1604 | 1605 | | |
1605 | 1606 | | |
| |||
0 commit comments