Commit 9cee32a
authored
[https://nvbugs/5625990][fix] Respect VSWA scheme when doing block store for reuse and load block for reuse in KV cache manager (#10183)
Signed-off-by: eopXD <[email protected]>1 parent 2f8d6d2 commit 9cee32a
File tree
4 files changed
+122
-79
lines changed- cpp
- include/tensorrt_llm/batch_manager
- tensorrt_llm/batch_manager
- tests/unit_tests/batch_manager
- tests/integration/defs/accuracy
4 files changed
+122
-79
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
380 | 380 | | |
381 | 381 | | |
382 | 382 | | |
| 383 | + | |
383 | 384 | | |
384 | 385 | | |
385 | 386 | | |
| |||
500 | 501 | | |
501 | 502 | | |
502 | 503 | | |
| 504 | + | |
| 505 | + | |
| 506 | + | |
| 507 | + | |
| 508 | + | |
| 509 | + | |
| 510 | + | |
| 511 | + | |
| 512 | + | |
| 513 | + | |
| 514 | + | |
| 515 | + | |
| 516 | + | |
| 517 | + | |
503 | 518 | | |
504 | 519 | | |
505 | 520 | | |
| |||
517 | 532 | | |
518 | 533 | | |
519 | 534 | | |
| 535 | + | |
| 536 | + | |
520 | 537 | | |
521 | 538 | | |
522 | 539 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1224 | 1224 | | |
1225 | 1225 | | |
1226 | 1226 | | |
1227 | | - | |
| 1227 | + | |
1228 | 1228 | | |
1229 | 1229 | | |
1230 | 1230 | | |
| |||
1338 | 1338 | | |
1339 | 1339 | | |
1340 | 1340 | | |
| 1341 | + | |
1341 | 1342 | | |
1342 | 1343 | | |
1343 | 1344 | | |
| |||
1731 | 1732 | | |
1732 | 1733 | | |
1733 | 1734 | | |
| 1735 | + | |
| 1736 | + | |
| 1737 | + | |
| 1738 | + | |
| 1739 | + | |
| 1740 | + | |
| 1741 | + | |
| 1742 | + | |
| 1743 | + | |
| 1744 | + | |
| 1745 | + | |
| 1746 | + | |
1734 | 1747 | | |
1735 | 1748 | | |
1736 | | - | |
| 1749 | + | |
| 1750 | + | |
1737 | 1751 | | |
1738 | 1752 | | |
1739 | 1753 | | |
| |||
0 commit comments