You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Scaleway's Generative APIs service allows users to interact with language models benefitting from additional reasoning capabilities.
11
+
Scaleway's Generative APIs service allows users to interact with language models benefiting from additional reasoning capabilities.
12
12
13
-
A reasoning model is a language model that is capable of carrying out multiple inference steps and systematically verifying intermediate results before producing answers. You can specify how much effort it should put into reasoning via dedicated parameters, and access reasoning content in its outputs. Even with default parameters, such models are designed to perform better on reasoning tasks like maths and logic problems, than non-reasoning language models.
13
+
A reasoning model is a language model that is capable of carrying out multiple inference steps and systematically verifying intermediate results before producing answers. You can specify how much effort it should put into reasoning via dedicated parameters, and access reasoning content in its outputs. Even with default parameters, such models are designed to perform better on reasoning tasks like maths and logic problems than non-reasoning language models.
14
14
15
15
Language models supporting the reasoning feature include `gpt-oss-120b`. See [Supported Models](/generative-apis/reference-content/supported-models/) for a full list.
16
16
@@ -45,7 +45,7 @@ The web playground displays.
45
45
4. Click **View code** to get code snippets configured according to your settings in the playground.
46
46
47
47
<Messagetype="note">
48
-
You cannot currently set values for parameters such as `reasoning_effort`, or access reasoning metadata in the model's output, via the console playground. Query the models programatically as shown below in order to access the full reasoning featureset.
48
+
You cannot currently set values for parameters such as `reasoning_effort`, or access reasoning metadata in the model's output, via the console playground. Query the models programmatically as shown below in order to access the full reasoning feature set.
49
49
</Message>
50
50
51
51
## Querying reasoning language models via API
@@ -57,7 +57,7 @@ In the example that follows, we will use the OpenAI Python client.
57
57
58
58
Both the [Chat Completions API](/https://www.scaleway.com/en/developers/api/generative-apis/#path-chat-completions-create-a-chat-completion) and the [Responses API](https://www.scaleway.com/en/developers/api/generative-apis/#path-chat-completions-create-a-chat-completion) allow you to access and control reasoning for supported models. Scaleway's support of the Responses API is currently in beta.
59
59
60
-
Note however, that the Responses API was introduced in part to better support features for reasoning workflows, among other tasks. It provides richer support for reasoning than Chat Completions, for example by providing chain-of-thought reasoning content in its responses.
60
+
Note however, that the Responses API was introduced in part to better support features for reasoning workflows, among other tasks. It provides richer support for reasoning than Chat Completions, for example, by providing chain-of-thought reasoning content in its responses.
61
61
62
62
For more information on Chat Completions versus Responses API, see the information provided in the [querying language models](/generative-apis/how-to/query-language-models/#chat-completions-api-or-responses-api) documentation.
63
63
@@ -139,7 +139,7 @@ You can now create a chat completion with reasoning, using either the Chat Compl
139
139
This code sends a message to the model, as well as specifying the effort to make with reasoning, and returns an answer based on your input. The model's reasoning metadata can be accessed as well as its answer, with outputs such as:
140
140
141
141
```python
142
-
Reasoning: The user asks: "Briefly describe a futuristic city with advanced technology and green energy solutions." They want a brief description. Should be concise but vivid. Provide details: architecture, transport, energy, AI, sustainability. Probably a paragraph or a few sentences. Ensure it's brief. Let's produce a short description.
142
+
Reasoning: The user asks: "Briefly describe a futuristic city with advanced technology and green energy solutions." They want a brief description. Should be concise but vivid. Provide details: architecture, transport, energy, AI, andsustainability. Probably a paragraph or a few sentences. Ensure it's brief. Let's produce a short description.
143
143
144
144
Answer: **Solaris Arcadia** rises from a reclaimed river delta, its skyline a lattice of translucent, self‑healing bioglass towers that
145
145
```
@@ -154,12 +154,12 @@ Some legacy models such as `deepseek-r1-distill-llama-70b` do not output reasoni
154
154
response.content = "<think> The user asks for questions about mathematics (...) </think> Answer is 42."
155
155
```
156
156
157
-
The reasoning content is inside the `<think>`...`</think>` tags, and you can parse the response accordingly to access such content. There is, however, a known bug which can lead to the model to omit the opening `<think>` tag, so we suggest taking care when parsing such outputs.
157
+
The reasoning content is inside the `<think>`...`</think>` tags, and you can parse the response accordingly to access such content. There is, however, a known bug that can lead the model to omit the opening `<think>` tag, so we suggest taking care when parsing such outputs.
158
158
159
159
Note that the `reasoning_effort` parameter is not available for this model.
160
160
161
161
162
162
## Impact on token generation
163
163
164
-
Reasoning models generate reasoning tokens, which are billable. Generally these are in the model's output as part of the reasoning content. To limit generation of reasoning tokens, you can adjust settings for the **reasoning effort** and **max completion/output tokens** parameters. Alternatively, use a non-reasoning model to avoid generation of reasoning tokens and subsequent billing.
164
+
Reasoning models generate reasoning tokens, which are billable. Generally these are in the model's output as part of the reasoning content. To limit the generation of reasoning tokens, you can adjust settings for the **reasoning effort** and **max completion/output tokens** parameters. Alternatively, use a non-reasoning model to avoid the generation of reasoning tokens and subsequent billing.
0 commit comments