Commit 5f429c2
committed
Expose last-token candidates context method
* adds call to llama_get_logits, which accesses the
result of the most recent llama_decode call
* "Token logits obtained from the last call to llama_decode()
The logits for which llama_batch.logits[i] != 0
are stored contiguously in the order they have
appeared in the batch. Rows: number of tokens for
which llama_batch.logits[i] != 0. Cols: n_vocab"
* https://github.com/ggerganov/llama.cpp/blob/c35e586ea57221844442c65a1172498c54971cb0/include/llama.h#L844-L8491 parent b1420f3 commit 5f429c2
1 file changed
+39
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
187 | 187 | | |
188 | 188 | | |
189 | 189 | | |
| 190 | + | |
| 191 | + | |
| 192 | + | |
| 193 | + | |
| 194 | + | |
| 195 | + | |
| 196 | + | |
| 197 | + | |
| 198 | + | |
| 199 | + | |
| 200 | + | |
| 201 | + | |
| 202 | + | |
| 203 | + | |
| 204 | + | |
| 205 | + | |
| 206 | + | |
| 207 | + | |
| 208 | + | |
| 209 | + | |
| 210 | + | |
| 211 | + | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
| 224 | + | |
| 225 | + | |
| 226 | + | |
| 227 | + | |
| 228 | + | |
190 | 229 | | |
191 | 230 | | |
192 | 231 | | |
| |||
0 commit comments