Commit 4f4d427
Disable chunked prefill and/or prefix caching when MLA is enabled (#12642)
From @mgoin in #12638
I cannot push to that branch, therefore a new PR to unblock release.
---------
Signed-off-by: mgoin <[email protected]>
Signed-off-by: simon-mo <[email protected]>
Co-authored-by: mgoin <[email protected]>1 parent 1e36983 commit 4f4d427
1 file changed
+10
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3252 | 3252 | | |
3253 | 3253 | | |
3254 | 3254 | | |
| 3255 | + | |
| 3256 | + | |
| 3257 | + | |
| 3258 | + | |
| 3259 | + | |
| 3260 | + | |
| 3261 | + | |
| 3262 | + | |
| 3263 | + | |
| 3264 | + | |
3255 | 3265 | | |
3256 | 3266 | | |
3257 | 3267 | | |
| |||
0 commit comments