Skip to content

Conversation

sjberman
Copy link
Collaborator

Add the design for supporting the Gateway API Inference Extension. This would allow NGF to configure NGINX to route traffic to AI workloads in Kubernetes, using specialized load-balancing.

Closes #3716

Checklist

Before creating a PR, run through this checklist and mark each as complete.

  • I have read the CONTRIBUTING doc
  • I have added tests that prove my fix is effective or that my feature works
  • I have checked that all unit tests pass after adding my changes
  • I have updated necessary documentation
  • I have rebased my branch onto main
  • I will ensure my PR is targeting the main branch and pulling from my branch from my own fork

Release notes

If this PR introduces a change that affects users and needs to be mentioned in the release notes,
please add a brief note that summarizes the change.


@sjberman sjberman requested review from a team as code owners August 27, 2025 17:03
@github-actions github-actions bot added the documentation Improvements or additions to documentation label Aug 27, 2025
@salonichf5
Copy link
Contributor

this is really well written :)

@sjberman
Copy link
Collaborator Author

sjberman commented Sep 2, 2025

@shaun-nx @salonichf5 Added a diagram, let me know if that helps or if there's anything missing.

@sjberman sjberman force-pushed the proposal/inference-extension branch from 0a69101 to 12db9ec Compare September 3, 2025 16:58
Add the design for supporting the Gateway API Inference Extension. This would allow NGF to configure NGINX to route traffic to AI workloads in Kubernetes, using specialized load-balancing.
@sjberman sjberman force-pushed the proposal/inference-extension branch from 12db9ec to 7cb0166 Compare September 3, 2025 16:58
Copy link
Contributor

@bjee19 bjee19 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm, nice work

Copy link
Contributor

@ciarams87 ciarams87 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice work! 🚀

Copy link
Contributor

@shaun-nx shaun-nx left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Really great work @sjberman ! 🎉

@sjberman sjberman enabled auto-merge (squash) September 4, 2025 15:27
@sjberman sjberman merged commit 2c44c7b into nginx:main Sep 4, 2025
41 checks passed
@github-project-automation github-project-automation bot moved this from 🆕 New to ✅ Done in NGINX Gateway Fabric Sep 4, 2025
@sjberman sjberman deleted the proposal/inference-extension branch September 4, 2025 16:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

Enhancement Proposal: Gateway API Inference Extension
5 participants