@@ -255,16 +255,16 @@ Retrieves a list of all available Foundry Local models in the catalog.
255
255
- ` version ` : The version number of the model.
256
256
- ` modelType ` : The format or type of the model (e.g., ONNX).
257
257
- ` promptTemplate ` :
258
- - ` assistant ` : The template for the assistant's response.
259
- - ` prompt ` : The template for the user-assistant interaction.
258
+ - ` assistant ` : The template for the assistant's response.
259
+ - ` prompt ` : The template for the user-assistant interaction.
260
260
- ` publisher ` : The entity or organization that published the model.
261
261
- ` task ` : The primary task the model is designed to perform (e.g., chat-completion).
262
262
- ` runtime ` :
263
- - ` deviceType ` : The type of hardware the model is designed to run on (e.g., CPU).
264
- - ` executionProvider ` : The execution provider used for running the model.
263
+ - ` deviceType ` : The type of hardware the model is designed to run on (e.g., CPU).
264
+ - ` executionProvider ` : The execution provider used for running the model.
265
265
- ` fileSizeMb ` : The size of the model file in megabytes.
266
266
- ` modelSettings ` :
267
- - ` parameters ` : A list of configurable parameters for the model.
267
+ - ` parameters ` : A list of configurable parameters for the model.
268
268
- ` alias ` : An alternative name or shorthand for the model
269
269
- ` supportsToolCalling ` : Indicates whether the model supports tool-calling functionality.
270
270
- ` license ` : The license type under which the model is distributed.
@@ -552,7 +552,7 @@ Counts tokens for a given chat completion request without performing inference.
552
552
"content" : " Hello, what is Microsoft?"
553
553
}
554
554
],
555
- "model" : " cpu-int4-rtn-block-32-acc-level-4 "
555
+ "model" : " Phi-4-mini-instruct-cuda-gpu "
556
556
}
557
557
```
558
558
- Response body
0 commit comments