Skip to content

Commit 7b941d7

Browse files
authored
docs: fix kubernetes docs links and bump example image tags to 1.0.0 (#7400)
Signed-off-by: Dan Gil <dagil@nvidia.com>
1 parent 5e326d6 commit 7b941d7

File tree

7 files changed

+16
-16
lines changed

7 files changed

+16
-16
lines changed

deploy/helm/charts/platform/components/operator/crds/nvidia.com_dynamographdeploymentrequests.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -131,7 +131,7 @@ spec:
131131
WorkersImage specifies the container image to use for DynamoGraphDeployment worker components.
132132
This image is used for both temporary DGDs created during online profiling and the final DGD.
133133
If omitted, the image from the base config file (e.g., disagg.yaml) is used.
134-
Example: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.9.0"
134+
Example: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.0"
135135
type: string
136136
type: object
137137
enableGpuDiscovery:
@@ -206,7 +206,7 @@ spec:
206206
description: |-
207207
ProfilerImage specifies the container image to use for profiling jobs.
208208
This image contains the profiler code and dependencies needed for SLA-based profiling.
209-
Example: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.9.0"
209+
Example: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.0"
210210
type: string
211211
resources:
212212
description: |-

deploy/operator/api/v1alpha1/dynamographdeploymentrequest_types.go

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -64,7 +64,7 @@ type ProfilingConfigSpec struct {
6464

6565
// ProfilerImage specifies the container image to use for profiling jobs.
6666
// This image contains the profiler code and dependencies needed for SLA-based profiling.
67-
// Example: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.9.0"
67+
// Example: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.0"
6868
// +kubebuilder:validation:Required
6969
ProfilerImage string `json:"profilerImage"`
7070

@@ -132,7 +132,7 @@ type DeploymentOverridesSpec struct {
132132
// WorkersImage specifies the container image to use for DynamoGraphDeployment worker components.
133133
// This image is used for both temporary DGDs created during online profiling and the final DGD.
134134
// If omitted, the image from the base config file (e.g., disagg.yaml) is used.
135-
// Example: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.9.0"
135+
// Example: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.0"
136136
// +kubebuilder:validation:Optional
137137
WorkersImage string `json:"workersImage,omitempty"`
138138
}

deploy/operator/config/crd/bases/nvidia.com_dynamographdeploymentrequests.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -131,7 +131,7 @@ spec:
131131
WorkersImage specifies the container image to use for DynamoGraphDeployment worker components.
132132
This image is used for both temporary DGDs created during online profiling and the final DGD.
133133
If omitted, the image from the base config file (e.g., disagg.yaml) is used.
134-
Example: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.9.0"
134+
Example: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.0"
135135
type: string
136136
type: object
137137
enableGpuDiscovery:
@@ -206,7 +206,7 @@ spec:
206206
description: |-
207207
ProfilerImage specifies the container image to use for profiling jobs.
208208
This image contains the profiler code and dependencies needed for SLA-based profiling.
209-
Example: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.9.0"
209+
Example: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.0"
210210
type: string
211211
resources:
212212
description: |-

deploy/operator/config/samples/nvidia.com_v1beta1_dynamographdeploymentrequest.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ spec:
2525
backend: trtllm
2626

2727
# Image is the container image reference for the profiling job
28-
image: "nvcr.io/nvidia/ai-dynamo/tensorrtllm-runtime:0.9.0"
28+
image: "nvcr.io/nvidia/ai-dynamo/tensorrtllm-runtime:1.0.0"
2929

3030
# SearchStrategy controls the profiling search depth
3131
# "rapid" for fast sweep; "thorough" for deeper exploration

docs/kubernetes/inference-gateway.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -72,7 +72,7 @@ kubectl create secret generic hf-token-secret \
7272
```
7373

7474
Create a model configuration file similar to the vllm_agg_qwen.yaml for your model.
75-
This file demonstrates the values needed for the Vllm Agg setup in [agg.yaml](../../examples/backends/vllm/deploy/agg.yaml)
75+
This file demonstrates the values needed for the vLLM aggregated setup in [agg.yaml](https://github.com/ai-dynamo/dynamo/blob/main/examples/backends/vllm/deploy/agg.yaml)
7676
Take a note of the model's block size provided in the model card.
7777

7878
### 4. Build EPP image (Optional)

docs/kubernetes/snapshot.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@ make docker-push-placeholder \
4545
PLACEHOLDER_IMG="${PLACEHOLDER_IMAGE}"
4646
```
4747

48-
This flow is defined in [deploy/snapshot/Makefile](../../deploy/snapshot/Makefile) and [deploy/snapshot/Dockerfile](../../deploy/snapshot/Dockerfile). The placeholder image preserves the base runtime entrypoint and command contract, and adds the CRIU, `cuda-checkpoint`, and `nsrestore` tooling needed for restore.
48+
This flow is defined in [deploy/snapshot/Makefile](https://github.com/ai-dynamo/dynamo/blob/main/deploy/snapshot/Makefile) and [deploy/snapshot/Dockerfile](https://github.com/ai-dynamo/dynamo/blob/main/deploy/snapshot/Dockerfile). The placeholder image preserves the base runtime entrypoint and command contract, and adds the CRIU, `cuda-checkpoint`, and `nsrestore` tooling needed for restore.
4949

5050
### 2. Enable checkpointing in the platform and verify it
5151

@@ -75,7 +75,7 @@ kubectl get configmap "${OPERATOR_CONFIG}" -n "${PLATFORM_NAMESPACE}" \
7575

7676
Verify that the rendered config includes `enabled: true` and the same PVC name and base path you plan to use for the snapshot chart.
7777

78-
For the full platform/operator configuration surface, see [deploy/helm/charts/platform/README.md](../../deploy/helm/charts/platform/README.md) and [deploy/helm/charts/platform/components/operator/values.yaml](../../deploy/helm/charts/platform/components/operator/values.yaml).
78+
For the full platform/operator configuration surface, see [deploy/helm/charts/platform/README.md](https://github.com/ai-dynamo/dynamo/blob/main/deploy/helm/charts/platform/README.md) and [deploy/helm/charts/platform/components/operator/values.yaml](https://github.com/ai-dynamo/dynamo/blob/main/deploy/helm/charts/platform/components/operator/values.yaml).
7979

8080
### 3. Install the snapshot chart
8181

@@ -97,11 +97,11 @@ kubectl get pvc snapshot-pvc -n ${NAMESPACE}
9797
kubectl rollout status daemonset/snapshot-agent -n ${NAMESPACE}
9898
```
9999

100-
For the full snapshot chart configuration surface, see [deploy/helm/charts/snapshot/README.md](../../deploy/helm/charts/snapshot/README.md) and [deploy/helm/charts/snapshot/values.yaml](../../deploy/helm/charts/snapshot/values.yaml).
100+
For the full snapshot chart configuration surface, see [deploy/helm/charts/snapshot/README.md](https://github.com/ai-dynamo/dynamo/blob/main/deploy/helm/charts/snapshot/README.md) and [deploy/helm/charts/snapshot/values.yaml](https://github.com/ai-dynamo/dynamo/blob/main/deploy/helm/charts/snapshot/values.yaml).
101101

102102
### 4. Apply a snapshot-compatible `DynamoGraphDeployment`
103103

104-
This example is adapted from [examples/backends/vllm/deploy/agg.yaml](../../examples/backends/vllm/deploy/agg.yaml). The worker must use the placeholder image from step 1, and the checkpoint identity must describe the runtime state you want to reuse.
104+
This example is adapted from [examples/backends/vllm/deploy/agg.yaml](https://github.com/ai-dynamo/dynamo/blob/main/examples/backends/vllm/deploy/agg.yaml). The worker must use the placeholder image from step 1, and the checkpoint identity must describe the runtime state you want to reuse.
105105

106106
```yaml
107107
apiVersion: nvidia.com/v1alpha1
@@ -490,6 +490,6 @@ Or use `mode: Auto` with the same identity and snapshot-hash label, and the oper
490490

491491
## Related Documentation
492492

493-
- [Dynamo Snapshot Helm Chart README](../../deploy/helm/charts/snapshot/README.md) - Chart configuration
493+
- [Dynamo Snapshot Helm Chart README](https://github.com/ai-dynamo/dynamo/blob/main/deploy/helm/charts/snapshot/README.md) - Chart configuration
494494
- [Installation Guide](installation-guide.md) - Platform installation
495495
- [API Reference](api-reference.md) - Complete CRD specifications

fern/components/profiler/profiler_guide.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -66,7 +66,7 @@ Each DGDR requires a container image for profiling and deployment:
6666

6767
```yaml
6868
spec:
69-
image: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.9.0"
69+
image: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.0"
7070
```
7171
7272
#### Quick Start: Deploy with DGDR
@@ -83,7 +83,7 @@ metadata:
8383
spec:
8484
model: "Qwen/Qwen3-0.6B"
8585
backend: vllm
86-
image: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.9.0"
86+
image: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.0"
8787

8888
workload:
8989
isl: 3000
@@ -229,7 +229,7 @@ metadata:
229229
spec:
230230
model: "Qwen/Qwen3-0.6B"
231231
backend: vllm
232-
image: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.9.0"
232+
image: "nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.0"
233233
234234
workload: { ... }
235235
sla: { ... }

0 commit comments

Comments
 (0)