Commit 83dbc6c
authored
[TRTLLM-5532][feat] store the block of context request into kv cache (NVIDIA#6683)
Signed-off-by: bhsueh <[email protected]>1 parent 9a8195e commit 83dbc6c
File tree
2 files changed
+10
-3
lines changed- cpp/tensorrt_llm/batch_manager
- tensorrt_llm/_torch/pyexecutor
2 files changed
+10
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2043 | 2043 | | |
2044 | 2044 | | |
2045 | 2045 | | |
2046 | | - | |
2047 | | - | |
| 2046 | + | |
2048 | 2047 | | |
2049 | | - | |
| 2048 | + | |
| 2049 | + | |
| 2050 | + | |
| 2051 | + | |
| 2052 | + | |
2050 | 2053 | | |
2051 | 2054 | | |
2052 | 2055 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
492 | 492 | | |
493 | 493 | | |
494 | 494 | | |
| 495 | + | |
| 496 | + | |
| 497 | + | |
| 498 | + | |
495 | 499 | | |
496 | 500 | | |
497 | 501 | | |
| |||
0 commit comments