Skip to content

Commit d63c8b1

Browse files
authored
Merge pull request #14 from oracle-devrel/cw24
feat: added files from nvidia-nim-oke
2 parents 0bf194f + 6a74070 commit d63c8b1

32 files changed

+895
-0
lines changed

nvidia-nim-oke/README.md

Lines changed: 481 additions & 0 deletions
Large diffs are not rendered by default.

nvidia-nim-oke/helm/Chart.yaml

Lines changed: 15 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
apiVersion: v2
2+
name: nim-llm
3+
description: A Helm chart for NVIDIA NIM for LLMs
4+
type: application
5+
kubeVersion: ">=v1.23.0-0"
6+
# This is the chart version. This version number should be incremented each time you make changes
7+
# to the chart and its templates, including the app version.
8+
# Versions are expected to follow Semantic Versioning (https://semver.org/)
9+
version: 0.2.1
10+
11+
# This is the version number of the application being deployed. This version number should be
12+
# incremented each time you make changes to the application. Versions are not expected to
13+
# follow Semantic Versioning. They should reflect the version the application is using.
14+
# It is recommended to use it with quotes.
15+
appVersion: "1.0.0"

nvidia-nim-oke/helm/values.yaml

Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,27 @@
1+
image:
2+
repository: nvcr.io/nim/meta/llama3-8b-instruct:latest
3+
pullPolicy: IfNotPresent
4+
model: llama3-8b-instruct
5+
numGpus: 1
6+
7+
service:
8+
type: ClusterIP
9+
openaiPort: 8000
10+
annotations: {}
11+
labels: {}
12+
name: "" # override the default service name
13+
# below options are deprecated
14+
# http_port: 8000 # exposes http interface used in healthchecks to the service
15+
# grpc_port: 8001 # exposes the triton grpc interface
16+
# metrics_port: 8002 # expose metrics through the main service
17+
# openai_port: 8005
18+
# nemo_port: 8006
19+
20+
mount:
21+
name: /opt/nim/cache
22+
23+
imageCredentials:
24+
registry: nvcr.io
25+
username: $oauthtoken
26+
password: <YOUR_KEY_FROM_NVIDIA>
27+
email: <YOUR_EMAIL>
Lines changed: 17 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,17 @@
1+
image:
2+
imageName: nvcr.io/nvidia/tritonserver:latest
3+
pullPolicy: IfNotPresent
4+
model: llama3-8b-instruct
5+
numGpus: 1
6+
7+
service:
8+
type: LoadBalancer
9+
10+
mount:
11+
name: /opt/nim/cache
12+
13+
imageCredentials:
14+
registry: nvcr.io
15+
username: $oauthtoken
16+
password: <YOUR_KEY_FROM_NVIDIA>
17+
email: <YOUR_EMAIL>

nvidia-nim-oke/img/1.PNG

83.1 KB
Loading

nvidia-nim-oke/img/AccessCluster.png

114 KB
Loading

nvidia-nim-oke/img/api_key.PNG

135 KB
Loading

nvidia-nim-oke/img/cloud-init.PNG

925 KB
Loading
78 KB
Loading

nvidia-nim-oke/img/creation_1.PNG

172 KB
Loading

0 commit comments

Comments
 (0)