revision

santiagxf · santiagxf · commit 3cc9f0cb7e79 · 2024-05-26T15:46:02.000Z
diff --git a/articles/ai-studio/reference/reference-model-inference-chat-completions.md b/articles/ai-studio/reference/reference-model-inference-chat-completions.md
@@ -37,7 +37,6 @@ POST /chat/completions?api-version=2024-04-01-preview
 | messages | True | [ChatCompletionRequestMessage](#chatcompletionrequestmessage) | A list of messages comprising the conversation so far. Returns a 422 error if at least some of the messages can't be understood by the model. |
 | frequency\_penalty |     | number | Helps prevent word repetitions by reducing the chance of a word being selected if it has already been used. The higher the frequency penalty, the less likely the model is to repeat the same words in its output. Return a 422 error if value or parameter is not supported by model. |
 | max\_tokens |     | integer | The maximum number of tokens that can be generated in the chat completion.<br><br>The total length of input tokens and generated tokens is limited by the model's context length. Passing null causes the model to use its max context length. |
-| model |     | string | Kept for compatibility reasons. This parameter is ignored. |
 | presence\_penalty |     | number | Helps prevent the same topics from being repeated by penalizing a word if it exists in the completion already, even just once. Return a 422 error if value or parameter is not supported by model. |
 | response\_format |     | [ChatCompletionResponseFormat](#chatcompletionresponseformat) |     |
 | seed |     | integer | If specified, our system will make a best effort to sample deterministically, such that repeated requests with the same `seed` and parameters should return the same result. Determinism is not guaranteed, and you should refer to the `system_fingerprint` response parameter to monitor changes in the backend. |
@@ -288,7 +287,6 @@ The API call fails when the prompt triggers a content filter as configured. Modi
 | frequency\_penalty | number | 0   | Helps prevent word repetitions by reducing the chance of a word being selected if it has already been used. The higher the frequency penalty, the less likely the model is to repeat the same words in its output. Return a 422 error if value or parameter is not supported by model. |
 | max\_tokens | integer |     | The maximum number of tokens that can be generated in the chat completion.<br><br>The total length of input tokens and generated tokens is limited by the model's context length. Passing null causes the model to use its max context length. |
 | messages | ChatCompletionRequestMessage\[\] |     | A list of messages comprising the conversation so far. Returns a 422 error if at least some of the messages can't be understood by the model. |
-| model | string |     | Kept for compatibility reasons. This parameter is ignored. |
 | presence\_penalty | number | 0   | Helps prevent the same topics from being repeated by penalizing a word if it exists in the completion already, even just once. Return a 422 error if value or parameter is not supported by model. |
 | response\_format | [ChatCompletionResponseFormat](#chatcompletionresponseformat) | text |     |
 | seed | integer |     | If specified, our system will make a best effort to sample deterministically, such that repeated requests with the same `seed` and parameters should return the same result. Determinism is not guaranteed, and you should refer to the `system_fingerprint` response parameter to monitor changes in the backend. |
diff --git a/articles/ai-studio/reference/reference-model-inference-completions.md b/articles/ai-studio/reference/reference-model-inference-completions.md
@@ -37,7 +37,6 @@ POST /completions?api-version=2024-04-01-preview
 | prompt | True |     | The prompts to generate completions for, encoded as a string, array of strings, array of tokens, or array of token arrays. Note that `<\|endoftext\|>` is the document separator that the model sees during training, so if a prompt is not specified the model generates as if from the beginning of a new document. |
 | frequency\_penalty |     | number | Positive values penalize new tokens based on their existing frequency in the text so far, decreasing the model's likelihood to repeat the same line verbatim. |
 | max\_tokens |     | integer | The maximum number of tokens that can be generated in the completion. The token count of your prompt plus `max_tokens` cannot exceed the model's context length. |
-| model |     | string | Kept for compatibility reasons. This parameter is ignored. |
 | presence\_penalty |     | number | Positive values penalize new tokens based on whether they appear in the text so far, increasing the model's likelihood to talk about new topics. |
 | seed |     | integer | If specified, the model makes a best effort to sample deterministically, such that repeated requests with the same `seed` and parameters should return the same result.<br><br>Determinism is not guaranteed, and you should refer to the `system_fingerprint` response parameter to monitor changes in the backend. |
 | stop |     |     | Sequences where the API will stop generating further tokens. The returned text will not contain the stop sequence. |
@@ -199,7 +198,6 @@ The API call fails when the prompt triggers a content filter as configured. Modi
 | --- | --- | --- | --- |
 | frequency\_penalty | number | 0   | Positive values penalize new tokens based on their existing frequency in the text so far, decreasing the model's likelihood to repeat the same line verbatim. |
 | max\_tokens | integer | 256 | The maximum number of tokens that can be generated in the completion. The token count of your prompt plus `max_tokens` cannot exceed the model's context length. |
-| model | string |     | Kept for compatibility reasons. This parameter is ignored. |
 | presence\_penalty | number | 0   | Positive values penalize new tokens based on whether they appear in the text so far, increasing the model's likelihood to talk about new topics. |
 | prompt |     | `<\|endoftext\|>` | The prompts to generate completions for, encoded as a string, array of strings, array of tokens, or array of token arrays. Note that `<\|endoftext\|>` is the document separator that the model sees during training, so if a prompt is not specified the model generates as if from the beginning of a new document. |
 | seed | integer |     | If specified, our system will make a best effort to sample deterministically, such that repeated requests with the same `seed` and parameters should return the same result.<br><br>Determinism is not guaranteed, and you should refer to the `system_fingerprint` response parameter to monitor changes in the backend. |
diff --git a/articles/ai-studio/reference/reference-model-inference-embeddings.md b/articles/ai-studio/reference/reference-model-inference-embeddings.md
@@ -38,7 +38,6 @@ POST /embeddings?api-version=2024-04-01-preview
 | dimensions      |          | integer                                             | The number of dimensions the resulting output embeddings should have. Returns a 422 error if the model doesn't support the value or parameter.                          |
 | encoding_format |          | [EmbeddingEncodingFormat](#embeddingencodingformat) | The format to return the embeddings in. Either base64, float, int8, uint8, binary, or ubinary. Returns a 422 error if the model doesn't support the value or parameter. |
 | input_type      |          | [EmbeddingInputType](#embeddinginputtype)           | The type of the input. Either `text`, `query`, or `document`. Returns a 422 error if the model doesn't support the value or parameter.                                  |
-| model           |          | string                                              | Kept for compatibility reasons. This parameter is ignored.                                                                                                              |
 
 ## Responses
 
@@ -153,13 +152,12 @@ Status code: 200
 
 ### CreateEmbeddingRequest
 
-| Name             | Type                                                | Default Value | Description                                                                                                                                                             |
-| ---------------- | --------------------------------------------------- | ------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
-| dimensions       | integer                                             |               | The number of dimensions the resulting output embeddings should have. Returns a 422 error if the model doesn't support the value or parameter.                          |
-| encoding\_format | [EmbeddingEncodingFormat](#embeddingencodingformat) | float         | The format to return the embeddings in. Either base64, float, int8, uint8, binary, or ubinary. Returns a 422 error if the model doesn't support the value or parameter. |
-| input            |                                                     |               | Input text to embed, encoded as a string or array of tokens. To embed multiple inputs in a single request, pass an array of strings or array of token arrays.           |
-| input\_type      | [EmbeddingInputType](#embeddinginputtype)           | text          | The type of the input. Either `text`, `query`, or `document`. Returns a 422 error if the model doesn't support the value or parameter.                                  |
-| model            | string                                              |               | Kept for compatibility reasons. This parameter is ignored.                                                                                                              |
+| Name            | Required | Type                                                | Description                                                                                                                                                             |
+| --------------- | -------- | --------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
+| input           | True     | string[]                                            | Input text to embed, encoded as a string or array of tokens. To embed multiple inputs in a single request, pass an array of strings or array of token arrays.           |
+| dimensions      |          | integer                                             | The number of dimensions the resulting output embeddings should have. Returns a 422 error if the model doesn't support the value or parameter.                          |
+| encoding_format |          | [EmbeddingEncodingFormat](#embeddingencodingformat) | The format to return the embeddings in. Either base64, float, int8, uint8, binary, or ubinary. Returns a 422 error if the model doesn't support the value or parameter. |
+| input_type      |          | [EmbeddingInputType](#embeddinginputtype)           | The type of the input. Either `text`, `query`, or `document`. Returns a 422 error if the model doesn't support the value or parameter.                                  |
 
 ### CreateEmbeddingResponse
 
@@ -180,11 +178,11 @@ Status code: 200
 
 ### Embedding
 
-Represents an image generated.
+Represents the embedding generated.
 
 | Name      | Type                                | Description                                                                                      |
 | --------- | ----------------------------------- | ------------------------------------------------------------------------------------------------ |
-| embedding | number\[\]                          | The embedding vector. The length of vector depends on the model used and the type depends on the `encoding_format` used. |
+| embedding | \[\]                          | The embedding vector. The length of vector depends on the model used and the type depends on the `encoding_format` used. |
 | index     | integer                             | The index of the embedding in the list of embeddings.                                            |
 | object    | [EmbeddingObject](#embeddingobject) | The object type, which is always "embedding".                                                    |
 
diff --git a/articles/ai-studio/reference/reference-model-inference-images-embeddings.md b/articles/ai-studio/reference/reference-model-inference-images-embeddings.md
@@ -38,7 +38,6 @@ POST /images/embeddings?api-version=2024-04-01-preview
 | input | True | [EmbeddingInput](#embeddinginput)\[\] | Input image to embed. To embed multiple inputs in a single request, pass an array. The input must not exceed the max input tokens for the model. |
 | dimensions |     | integer | The number of dimensions the resulting output embeddings should have. Passing null causes the model to use its default value. Returns a 422 error if the model doesn't support the value or parameter. |
 | encoding\_format |     | [EmbeddingEncodingFormat](#embeddingencodingformat) | The format to return the embeddings in. Either base64, float, int8, uint8, binary, or ubinary. Returns a 422 error if the model doesn't support the value or parameter. |
-| model |     | string | Kept for compatibility reasons. This parameter is ignored. |
 
 
 ## Responses
@@ -186,7 +185,6 @@ The API call fails when the prompt triggers a content filter as configured. Modi
 | dimensions | integer |     | The number of dimensions the resulting output embeddings should have. Passing null causes the model to use its default value. Returns a 422 error if the model doesn't support the value or parameter. |
 | encoding\_format | [EmbeddingEncodingFormat](#embeddingencodingformat) | float | The format to return the embeddings in. Either base64, float, int8, uint8, binary, or ubinary. Returns a 422 error if the model doesn't support the value or parameter. |
 | input | [EmbeddingInput](#embeddinginput)\[\] |     | Input image to embed. To embed multiple inputs in a single request, pass an array. The input must not exceed the max input tokens for the model. |
-| model | string |     | Kept for compatibility reasons. This parameter is ignored. |
 
 
 ### Detail