You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* feat: `comment` segment budget
* feat(Electron template): comment segments
* feat(Electron template): improve completions speed when using functions
* feat(Electron template): start with inspect script
* feat(Electron template): add a link to download `gpt-oss`
* fix: using segment budgets with `gpt-oss` models
* fix: detect more variations of Harmony chat template
* fix: use a model message for user prompt completion on unsupported models by default
* fix(Electron template): don't render `<br/>` in a thought excerpt
Since `gpt-oss` models are reasoning models, they generate thoughts as part of their response.
83
+
These thoughts are useful for debugging and understanding the model's reasoning process,
84
+
and can be used to iterate on the system prompt and inputs you provide to the model to improve its responses.
85
+
86
+
However, OpenAI [emphasizes](https://openai.com/index/chain-of-thought-monitoring/#:~:text=leaving%20CoTs%20unrestricted%20may%20make%20them%20unfit%20to%20be%20shown%20to%20end%2Dusers%2C%20as%20they%20might%20violate%20some%20misuse%20policies)
87
+
that the thoughts generated by these models may not be safe to show to end users as they are unrestricted
88
+
and might include sensitive information, uncontained language, hallucinations, or other issues.
89
+
Thus, OpenAI recommends not showing these to users without further filtering, moderation or summarization.
90
+
91
+
Check out the [segment streaming example](../guide/chat-session.md#stream-response-segments) to learn how to use segments.
92
+
93
+
94
+
## `comment` Segments
95
+
`gpt-oss` models output "preamble" messages in their response;
96
+
these are segmented as a new `comment` segment in the model's response.
97
+
98
+
The model might choose to generate those segments to inform the user about the functions it's about to call.
99
+
For example, when it plans to use multiple functions, it may generate a plan in advance.
100
+
101
+
These are intended for the user to see, but not as part of the main response.
102
+
103
+
Check out the [segment streaming example](../guide/chat-session.md#stream-response-segments) to learn how to use segments.
104
+
105
+
::: info Experiment with `comment` segments
106
+
The [Electron app template](../guide/electron.md) has been updated to properly segment comments in the response.
107
+
108
+
Try it out by downloading the latest build [from GitHub](https://github.com/withcatai/node-llama-cpp/releases/latest),
109
+
or by [scaffolding a new project](../guide/index.md#scaffold-new-project) based on the Electron template:
110
+
111
+
```shell
112
+
npm create node-llama-cpp@latest
113
+
```
114
+
:::
115
+
116
+
57
117
## Customizing gpt-oss
58
118
You can adjust `gpt-oss`'s responses by configuring the options of [`HarmonyChatWrapper`](../api/classes/HarmonyChatWrapper.md):
0 commit comments