You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
|`type_k`|[ggml_type](https://github.com/ggml-org/llama.cpp/blob/00681dfc16ba4cebb9c7fbd2cf2656e06a0692a4/ggml/include/ggml.h#L377)| Data type for K cache. |
145
+
|`type_v`|[ggml_type](https://github.com/ggml-org/llama.cpp/blob/00681dfc16ba4cebb9c7fbd2cf2656e06a0692a4/ggml/include/ggml.h#L377)| Data type for V cache. |
146
+
147
+
### Flags
148
+
149
+
> Place booleans at the end of your option string if you’re copy-by-value mirroring a struct; otherwise order doesn’t matter.
|`embeddings`|`1 or 0`| If `1`, extract embeddings (with logits). Used by the embedding preset. |
154
+
|`offload_kqv`|`1 or 0`| Offload KQV ops (incl. KV cache) to GPU. |
155
+
|`no_perf`|`1 or 0`| Disable performance timing. |
156
+
|`op_offload`|`1 or 0`| Offload host tensor ops to device. |
157
+
|`swa_full`|`1 or 0`| Use full-size SWA cache. When `false` and `n_seq_max > 1`, performance may degrade. |
158
+
|`kv_unified`|`1 or 0`| Use a unified buffer across input sequences during attention. Try disabling when `n_seq_max > 1` and sequences do not share a long prefix. |
**Parameters:****`context_settings` (optional):** Comma-separated `key=value` pairs to override or extend default settings (see [context settings](#context_settings) in `llm_context_create`).
104
175
105
176
**Returns:**`NULL`
106
177
@@ -109,7 +180,7 @@ Creates a new inference context specifically set for embedding generation.
109
180
110
181
It is equivalent to `SELECT llm_context_create('generate_embedding=1,normalize_embedding=1,pooling_type=mean');`
111
182
112
-
Context must explicitly created before performing any AI operation!
183
+
**Context must explicitly created before performing any AI operation!**
**Parameters:****`context_settings` (optional):** Comma-separated `key=value` pairs to override or extend default settings (see [context settings](#context_settings) in `llm_context_create`).
**Parameters:****`context_settings` (optional):** Comma-separated `key=value` pairs to override or extend default settings (see [context settings](#context_settings) in `llm_context_create`).
0 commit comments