Add private OCI OKE cookbook for Llama Nemotron Nano 8B by fede-kamel · Pull Request #117 · NVIDIA-NeMo/Nemotron

fede-kamel · 2026-03-16T20:37:20Z

Summary

add a Nemotron-specific OCI cookbook for nvidia/Llama-3.1-Nemotron-Nano-8B-v1
document a validated private-only OKE deployment in us-phoenix-1 with no public control-plane endpoint, no public worker IPs, and no public inference endpoint
add a checked-in Terraform wrapper for the private Phoenix OKE infrastructure using Oracle's official OKE module plus OCI Bastion service
include a known-good vLLM values file for a single VM.GPU.A10.1 node
surface the new OCI cookbook in the root README and cookbook index

Validation

Validated against a live Phoenix OKE deployment of nvidia/Llama-3.1-Nemotron-Nano-8B-v1 using a private cluster plus OCI Bastion/tunnel access:

terraform plan
terraform apply
private OKE cluster active
OCI Bastion service active
CPU node pool active
GPU node pool active in PHX-AD-2
/health
/v1/models
chat completion
tool calling
streaming
async concurrent requests

Notes

this contribution is intentionally Nemotron-specific and OCI-specific
the deployment guidance is private-only and does not use public IPs for the Kubernetes API or inference endpoint
the OCI path is documented as a reproducible option comparable to common AWS GPU/Kubernetes deployment patterns, without claiming repo-local AWS Terraform already exists in this repo
the Bastion resource is the OCI Bastion service, not a public bastion VM

Signed-off-by: Federico Kamelhar <federico.kamelhar@oracle.com>

fede-kamel · 2026-03-24T14:15:01Z

@chrisalexiuk-nvidia @anushapant @shashank3959 — Friendly follow-up! This PR adds an OCI OKE deployment cookbook for Llama Nemotron Nano 8B. Would love to get a review when you get a chance. Thanks!

fede-kamel · 2026-03-27T02:29:43Z

Hey team 👋 — just checking in on this one. Happy to address any feedback or make adjustments to scope if that helps move things along. Let me know if there's anything I can do on my end! @chrisalexiuk-nvidia @anushapant @shashank3959

fede-kamel · 2026-03-27T13:17:35Z

✅ Cross-validated with NeMo Agent Toolkit OCI integration

This OKE deployment is now serving as the live inference backend for NVIDIA/NeMo-Agent-Toolkit#1804, which adds first-class OCI Generative AI support to the Agent Toolkit.

The full Agent Toolkit OCI test suite — 11/11 tests pass — was validated against the nvidia/Llama-3.1-Nemotron-Nano-8B-v1 endpoint running on this exact private OKE infrastructure in us-phoenix-1. Both PRs together deliver a complete story: reproducible OCI deployment (this PR) powering a production-ready LLM provider and LangChain integration (Toolkit PR).

fede-kamel · 2026-03-27T13:21:49Z

@chrisalexiuk-nvidia @anushapant @shashank3959 — Quick update — we just used this exact OKE deployment to validate the full OCI integration for NeMo Agent Toolkit! 🚀

The nvidia/Llama-3.1-Nemotron-Nano-8B-v1 model running on the private Phoenix cluster documented in this cookbook passed 11/11 tests as the live inference backend for NVIDIA/NeMo-Agent-Toolkit#1804 — covering the OCI provider, LangChain wrapper, and an end-to-end agent workflow.

Really exciting to see both pieces come together: this PR provides the reproducible OCI deployment, and the Toolkit PR builds on top of it with a first-class integration. Two repos, one Nemotron story on OCI. Looking forward to your review!

fede-kamel added 3 commits March 16, 2026 16:37

Add private OCI OKE cookbook for Llama Nemotron Nano 8B

fb6ba26

Signed-off-by: Federico Kamelhar <federico.kamelhar@oracle.com>

Add private OCI OKE Terraform sample for Nemotron

9095e35

Signed-off-by: Federico Kamelhar <federico.kamelhar@oracle.com>

Refine OCI positioning in Nemotron docs

ab8a50f

Signed-off-by: Federico Kamelhar <federico.kamelhar@oracle.com>

Merge branch 'main' into fk/oci-phoenix-private-nemotron

079c7bf

fede-kamel mentioned this pull request Mar 27, 2026

Add OCI LangChain support for hosted Nemotron workflows NVIDIA/NeMo-Agent-Toolkit#1804

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add private OCI OKE cookbook for Llama Nemotron Nano 8B#117

Add private OCI OKE cookbook for Llama Nemotron Nano 8B#117
fede-kamel wants to merge 4 commits intoNVIDIA-NeMo:mainfrom
fede-kamel:fk/oci-phoenix-private-nemotron

fede-kamel commented Mar 16, 2026 •

edited

Loading

Uh oh!

fede-kamel commented Mar 24, 2026

Uh oh!

fede-kamel commented Mar 27, 2026 •

edited

Loading

Uh oh!

fede-kamel commented Mar 27, 2026

Uh oh!

fede-kamel commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

fede-kamel commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Validation

Notes

Uh oh!

fede-kamel commented Mar 24, 2026

Uh oh!

fede-kamel commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fede-kamel commented Mar 27, 2026

✅ Cross-validated with NeMo Agent Toolkit OCI integration

Uh oh!

fede-kamel commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fede-kamel commented Mar 16, 2026 •

edited

Loading

fede-kamel commented Mar 27, 2026 •

edited

Loading