Commit a10b36c
authored
llama : refactor kv cache guard (ggml-org#12695)
* llama : refactor kv cache guard
ggml-ci
* cont : fix comment [no ci]
* llama : fix kv_cache restore logic
ggml-ci
* context : simplify kv cache updates
ggml-ci
* cont : better name [no ci]
* llama : fix llama_decode return code when could not find KV slot
ggml-ci
* context : change log err -> warn [no ci]
* kv-cache : add comment + warning1 parent 83a88bd commit a10b36c
File tree
4 files changed
+107
-127
lines changed- examples/parallel
- src
4 files changed
+107
-127
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
106 | 106 | | |
107 | 107 | | |
108 | 108 | | |
| 109 | + | |
| 110 | + | |
109 | 111 | | |
110 | 112 | | |
111 | 113 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1201 | 1201 | | |
1202 | 1202 | | |
1203 | 1203 | | |
1204 | | - | |
1205 | | - | |
1206 | | - | |
1207 | | - | |
1208 | | - | |
1209 | | - | |
1210 | | - | |
1211 | | - | |
1212 | | - | |
1213 | | - | |
1214 | | - | |
1215 | | - | |
1216 | | - | |
1217 | | - | |
1218 | | - | |
1219 | | - | |
1220 | | - | |
1221 | | - | |
1222 | | - | |
1223 | | - | |
1224 | | - | |
1225 | | - | |
1226 | | - | |
1227 | | - | |
1228 | | - | |
1229 | | - | |
1230 | | - | |
| 1204 | + | |
1231 | 1205 | | |
1232 | 1206 | | |
1233 | 1207 | | |
| |||
1280 | 1254 | | |
1281 | 1255 | | |
1282 | 1256 | | |
| 1257 | + | |
| 1258 | + | |
| 1259 | + | |
1283 | 1260 | | |
1284 | 1261 | | |
1285 | 1262 | | |
| |||
1319 | 1296 | | |
1320 | 1297 | | |
1321 | 1298 | | |
1322 | | - | |
| 1299 | + | |
| 1300 | + | |
1323 | 1301 | | |
1324 | | - | |
1325 | | - | |
1326 | | - | |
1327 | | - | |
| 1302 | + | |
1328 | 1303 | | |
1329 | 1304 | | |
1330 | | - | |
1331 | | - | |
1332 | | - | |
1333 | | - | |
1334 | | - | |
1335 | | - | |
1336 | | - | |
1337 | | - | |
1338 | 1305 | | |
1339 | 1306 | | |
1340 | 1307 | | |
| |||
1371 | 1338 | | |
1372 | 1339 | | |
1373 | 1340 | | |
1374 | | - | |
1375 | | - | |
1376 | | - | |
1377 | | - | |
1378 | | - | |
1379 | | - | |
1380 | | - | |
1381 | | - | |
1382 | | - | |
1383 | | - | |
1384 | 1341 | | |
1385 | 1342 | | |
1386 | 1343 | | |
| |||
1467 | 1424 | | |
1468 | 1425 | | |
1469 | 1426 | | |
1470 | | - | |
| 1427 | + | |
1471 | 1428 | | |
1472 | 1429 | | |
1473 | 1430 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
11 | 11 | | |
12 | 12 | | |
13 | 13 | | |
14 | | - | |
15 | | - | |
16 | 14 | | |
17 | 15 | | |
18 | 16 | | |
| |||
206 | 204 | | |
207 | 205 | | |
208 | 206 | | |
| 207 | + | |
| 208 | + | |
209 | 209 | | |
210 | 210 | | |
211 | 211 | | |
| |||
446 | 446 | | |
447 | 447 | | |
448 | 448 | | |
| 449 | + | |
| 450 | + | |
| 451 | + | |
| 452 | + | |
| 453 | + | |
| 454 | + | |
| 455 | + | |
| 456 | + | |
| 457 | + | |
| 458 | + | |
| 459 | + | |
| 460 | + | |
| 461 | + | |
| 462 | + | |
| 463 | + | |
| 464 | + | |
| 465 | + | |
| 466 | + | |
| 467 | + | |
| 468 | + | |
| 469 | + | |
| 470 | + | |
| 471 | + | |
| 472 | + | |
| 473 | + | |
| 474 | + | |
| 475 | + | |
| 476 | + | |
| 477 | + | |
| 478 | + | |
| 479 | + | |
| 480 | + | |
| 481 | + | |
| 482 | + | |
| 483 | + | |
| 484 | + | |
| 485 | + | |
| 486 | + | |
| 487 | + | |
| 488 | + | |
| 489 | + | |
| 490 | + | |
| 491 | + | |
| 492 | + | |
449 | 493 | | |
450 | 494 | | |
451 | 495 | | |
452 | 496 | | |
453 | | - | |
| 497 | + | |
454 | 498 | | |
455 | 499 | | |
456 | 500 | | |
457 | 501 | | |
458 | 502 | | |
| 503 | + | |
| 504 | + | |
| 505 | + | |
| 506 | + | |
| 507 | + | |
| 508 | + | |
459 | 509 | | |
460 | 510 | | |
461 | 511 | | |
| |||
477 | 527 | | |
478 | 528 | | |
479 | 529 | | |
480 | | - | |
| 530 | + | |
481 | 531 | | |
482 | 532 | | |
483 | 533 | | |
| |||
616 | 666 | | |
617 | 667 | | |
618 | 668 | | |
619 | | - | |
| 669 | + | |
620 | 670 | | |
621 | 671 | | |
622 | 672 | | |
623 | 673 | | |
624 | 674 | | |
625 | 675 | | |
626 | | - | |
| 676 | + | |
627 | 677 | | |
628 | 678 | | |
629 | 679 | | |
| |||
651 | 701 | | |
652 | 702 | | |
653 | 703 | | |
654 | | - | |
| 704 | + | |
655 | 705 | | |
656 | 706 | | |
657 | 707 | | |
| |||
668 | 718 | | |
669 | 719 | | |
670 | 720 | | |
671 | | - | |
| 721 | + | |
| 722 | + | |
| 723 | + | |
672 | 724 | | |
673 | 725 | | |
674 | 726 | | |
| |||
1033 | 1085 | | |
1034 | 1086 | | |
1035 | 1087 | | |
| 1088 | + | |
1036 | 1089 | | |
1037 | 1090 | | |
1038 | 1091 | | |
| |||
0 commit comments