fix: add NVIDIA API compatibility for minimax-M2 model #9001
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Fixes #8998
Problem
Users were unable to use the minimax-M2 model through NVIDIA API endpoints (integrate.api.nvidia.com) when using the OpenAI Compatible provider. The API was returning body errors when attempting to stream responses.
Root Cause
The OpenAI handler was sending
stream_options: { include_usage: true }parameter with streaming requests, which is not supported by NVIDIA API endpoints.Solution
_isNvidiaApi()method to detect NVIDIA API endpointsstream_optionsparameter for NVIDIA endpoints (similar to existing Grok XAI workaround)Testing
Additional Context
The user provided a Python example showing they were using
https://integrate.api.nvidia.com/v1as the base URL with the minimax-M2 model. This fix ensures compatibility by not sending unsupported parameters to NVIDIA endpoints.Important
Adds NVIDIA API compatibility to
OpenAiHandlerby excluding unsupportedstream_optionsin requests.stream_optionsparameter for NVIDIA API endpoints inOpenAiHandler._isNvidiaApi()method to detect NVIDIA API endpoints._isNvidiaApi()checks forintegrate.api.nvidia.com,build.nvidia.com, and*.api.nvidia.com.createMessage()andhandleO3FamilyMessage()to excludestream_optionsfor NVIDIA APIs.This description was created by
for ff2ec78. You can customize this summary. It will automatically update as commits are pushed.