Commit f27a7cb
convert : Add support for Microsoft Phi-4 model (ggml-org#10817)
* convert : use GPT2 vocab for Phi-4 model
* convert : use null value of sliding_window to distinguish Phi-4 from other PHI3-based models
* llama : do not use sliding window attention mask for Phi-4 model
---------
Co-authored-by: Stanisław Szymczyk <[email protected]>1 parent b26f071 commit f27a7cb
2 files changed
+22
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2200 | 2200 | | |
2201 | 2201 | | |
2202 | 2202 | | |
| 2203 | + | |
| 2204 | + | |
| 2205 | + | |
| 2206 | + | |
| 2207 | + | |
| 2208 | + | |
| 2209 | + | |
| 2210 | + | |
| 2211 | + | |
2203 | 2212 | | |
2204 | 2213 | | |
2205 | 2214 | | |
| |||
2316 | 2325 | | |
2317 | 2326 | | |
2318 | 2327 | | |
2319 | | - | |
| 2328 | + | |
| 2329 | + | |
| 2330 | + | |
| 2331 | + | |
| 2332 | + | |
2320 | 2333 | | |
2321 | 2334 | | |
2322 | 2335 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
13342 | 13342 | | |
13343 | 13343 | | |
13344 | 13344 | | |
13345 | | - | |
| 13345 | + | |
| 13346 | + | |
| 13347 | + | |
| 13348 | + | |
| 13349 | + | |
| 13350 | + | |
| 13351 | + | |
13346 | 13352 | | |
13347 | 13353 | | |
13348 | 13354 | | |
| |||
13400 | 13406 | | |
13401 | 13407 | | |
13402 | 13408 | | |
13403 | | - | |
| 13409 | + | |
13404 | 13410 | | |
13405 | 13411 | | |
13406 | 13412 | | |
| |||
0 commit comments