Skip to content

Commit f54123b

Browse files
committed
Add available models for Moonshot Chat Completion Models.
Signed-off-by: Xiaojie Wang <[email protected]>
1 parent 1046015 commit f54123b

File tree

1 file changed

+19
-9
lines changed
  • models/spring-ai-moonshot/src/main/java/org/springframework/ai/moonshot/api

1 file changed

+19
-9
lines changed

models/spring-ai-moonshot/src/main/java/org/springframework/ai/moonshot/api/MoonshotApi.java

Lines changed: 19 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -214,25 +214,35 @@ public enum ChatCompletionFinishReason {
214214
* <li><b>MOONSHOT_V1_128K</b> - moonshot-v1-128k</li>
215215
* </ul>
216216
*
217-
* {@code moonshot-v1-auto} can select the appropriate model based on the number of Tokens occupied by the current context. The available models for selection include:
217+
* {@code moonshot-v1-auto} can select the appropriate model based on the number of
218+
* Tokens occupied by the current context. The available models for selection include:
218219
* <ul>
219220
* <li>{@code moonshot-v1-8k}</li>
220221
* <li>{@code moonshot-v1-32k}</li>
221222
* <li>{@code moonshot-v1-128k}</li>
222223
* </ul>
223-
* <p>{@code moonshot-v1-auto} can be regarded as a model router, which decides which specific model to select based on the number of Tokens occupied by the current context. In terms of performance and output, {@code moonshot-v1-auto} is indistinguishable from the aforementioned models.</p>
224-
* The routing rules for the model selected by {@code moonshot-v1-auto} are as follows:
224+
* <p>
225+
* {@code moonshot-v1-auto} can be regarded as a model router, which decides which
226+
* specific model to select based on the number of Tokens occupied by the current
227+
* context. In terms of performance and output, {@code moonshot-v1-auto} is
228+
* indistinguishable from the aforementioned models.
229+
* </p>
230+
* The routing rules for the model selected by {@code moonshot-v1-auto} are as
231+
* follows:
225232
* <ul>
226233
* <li>If {@code total_tokens ≤ 8 * 1024}, choose {@code moonshot-v1-8k}.</li>
227-
* <li>If {@code 8 * 1024 < total_tokens ≤ 32 * 1024}, choose {@code moonshot-v1-32k}.</li>
234+
* <li>If {@code 8 * 1024 < total_tokens ≤ 32 * 1024}, choose
235+
* {@code moonshot-v1-32k}.</li>
228236
* <li>If {@code total_tokens > 32 * 1024}, choose {@code moonshot-v1-128k}.</li>
229237
* </ul>
230-
* The calculation formula is:
231-
* {@code total_tokens = prompt_tokens + max_tokens}
232-
* <p>The total number of Tokens is composed of two parts:
238+
* The calculation formula is: {@code total_tokens = prompt_tokens + max_tokens}
239+
* <p>
240+
* The total number of Tokens is composed of two parts:
233241
* <ul>
234-
* <li>{@code prompt_tokens}: The number of Tokens occupied by the input prompt (Prompt).</li>
235-
* <li>{@code max_tokens}: The maximum number of Tokens expected to be generated as output.</li>
242+
* <li>{@code prompt_tokens}: The number of Tokens occupied by the input prompt
243+
* (Prompt).</li>
244+
* <li>{@code max_tokens}: The maximum number of Tokens expected to be generated as
245+
* output.</li>
236246
* </ul>
237247
*/
238248
public enum ChatModel implements ChatModelDescription {

0 commit comments

Comments
 (0)