Skip to content

Conversation

@Jan-Kazlouski-elastic
Copy link
Contributor

Creation of new NVIDIA inference provider integration allowing completion (both streaming and non-streaming) and chat_completion (only streaming) to be executed as part of inference API.
This is draft PR, rerank and text_embedding tasks are yet to be added

…ation

# Conflicts:
#	server/src/main/java/org/elasticsearch/TransportVersions.java
#	x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/InferenceNamedWriteablesProvider.java
#	x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/InferencePlugin.java
@elasticsearchmachine elasticsearchmachine added needs:triage Requires assignment of a team area label v9.2.0 external-contributor Pull request authored by a developer outside the Elasticsearch team labels Aug 4, 2025
@Jan-Kazlouski-elastic Jan-Kazlouski-elastic marked this pull request as draft August 4, 2025 10:01
@gareth-ellis gareth-ellis added :ml Machine learning Team:ML Meta label for the ML team >enhancement labels Aug 15, 2025
…ation

# Conflicts:
#	server/src/main/java/org/elasticsearch/TransportVersions.java
@pxsalehi pxsalehi removed the needs:triage Requires assignment of a team area label label Sep 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

>enhancement external-contributor Pull request authored by a developer outside the Elasticsearch team :ml Machine learning Team:ML Meta label for the ML team v9.3.0

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants