-
Notifications
You must be signed in to change notification settings - Fork 231
Description
Followed instructions for Windows on
https://docs.openvino.ai/2025/model-server/ovms_demos_continuous_batching_agent.html
for GPT-OSS 20B.
After invoking:
ovms.exe --rest_port 8000 --source_model openai/gpt-oss-20b --model_repository_path models --tool_parser gptoss --reasoning_parser gptoss --target_device GPU --task text_generation --pipeline_type LM
it failed with
[2025-12-15 09:30:12.280][20660][serving][error][servable_initializer.cpp:104] Error during llm node initialization for models_path: C:\OpenVino\models\openai\gpt-oss-20b\./ exception: Exception from src\inference\src\cpp\core.cpp:117: Exception from src\inference\src\dev\plugin.cpp:53: Exception from src\core\src\pass\graph_rewrite.cpp:298: [ConvertFullyConnectedToFullyConnectedCompressed] END: node: gpu_opset::FullyConnected __module.model.layers.0.mlp.experts/aten::bmm/MatMul (opset1::Reshape __module.model.layers.0.mlp.experts/aten::view/Reshape[0]:f16[32,?,2880], opset1::Reshape Reshape_215611[0]:f16[32,5760,2880], gpu_opset::Placeholder Placeholder_113014[0]:dynamic[]) -> (f16[32,?,5760]) CALLBACK HAS THROWN: Check 'other_size == new_size' failed at src\core\src\op\constant.cpp:347: ov::Shape size 1061683200 is not equal to 16588800