Skip to content

Commit e0536d3

Browse files
authored
Merge pull request #898 from chtyler/RHAIENG-306-modify-docs-on-deploying-llamastackdistrubution-instance
RHAI-ENG-306-modify-docs-on-deploying-llamastackdistribution-instanc…
2 parents 1af3622 + ef570e8 commit e0536d3

File tree

2 files changed

+8
-13
lines changed

2 files changed

+8
-13
lines changed

modules/deploying-a-llama-model-with-kserve.adoc

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ To use Llama Stack and retrieval-augmented generation (RAG) workloads in {produc
1010

1111
* You have logged in to {productname-long}.
1212
* You have cluster administrator privileges for your {openshift-platform} cluster.
13-
* You have installed the Llama Stack Operator.
13+
* You have activated the Llama Stack Operator.
1414
ifdef::upstream[]
1515
For more information, see link:{odhdocshome}/working-with-rag/#installing-the-llama-stack-operator_rag[Installing the Llama Stack Operator].
1616
endif::[]

modules/deploying-a-llamastackdistribution-instance.adoc

Lines changed: 7 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -4,14 +4,7 @@
44
= Deploying a LlamaStackDistribution instance
55

66
[role='_abstract']
7-
You can integrate LlamaStack and its retrieval-augmented generation (RAG) capabilities with your deployed Llama 3.2 model served by vLLM. This integration enables you to build intelligent applications that combine large language models (LLMs) with real-time data retrieval, providing more accurate and contextually relevant responses for your AI workloads.
8-
9-
When you create a `LlamaStackDistribution` custom resource (CR), specify the Llama Stack image `quay.io/opendatahub/llama-stack:odh` in the `spec.server.distribution.image` field. The image is hosted on link:https://quay.io[Quay.io], a secure registry that provides vulnerability scanning, role‑based access control, and globally distributed content delivery. Using this {org-name}–validated image ensures that your deployment automatically receives the latest security patches and compatibility updates. For more information about working with Quay.io, see link:https://docs.redhat.com/en/documentation/red_hat_quay/3/html/about_quay_io/quayio-overview[Quay.io overview].
10-
11-
[IMPORTANT]
12-
====
13-
The Llama Stack image is hosted on link:https://quay.io[Quay.io] only during the Developer Preview phase of the Llama Stack integration with {productname-short}. When the Llama Stack integration reaches general availability, the image will be available on link:https://registry.redhat.io[registry.redhat.io].
14-
====
7+
You can integrate LlamaStack and its retrieval-augmented generation (RAG) capabilities with your deployed Llama 3.2 model served by vLLM. You can use this integration to build intelligent applications that combine large language models (LLMs) with real-time data retrieval, providing more accurate and contextually relevant responses for your AI workloads. When you create a `LlamaStackDistribution` custom resource (CR), specify `rh-dev` in the `spec.server.distribution.name` field.
158

169
ifdef::self-managed[]
1710
ifdef::disconnected[]
@@ -42,7 +35,6 @@ endif::[]
4235

4336
.Procedure
4437

45-
4638
. Open a new terminal window.
4739
.. Log in to your {openshift-platform} cluster from the CLI:
4840
.. In the upper-right corner of the OpenShift web console, click your user name and select *Copy login command*.
@@ -119,10 +111,13 @@ spec:
119111
name: llama-stack
120112
port: 8321
121113
distribution:
122-
image: quay.io/opendatahub/llama-stack:odh
123-
storage:
124-
size: "5Gi"
114+
name: rh-dev
125115
----
116+
+
117+
[NOTE]
118+
====
119+
The `rh-dev` value is an internal image reference. When you create the `LlamaStackDistribution` custom resource, the {productname-short} Operator automatically resolves `rh-dev` to the container image in the appropriate registry. This internal image reference allows the underlying image to update without requiring changes to your custom resource.
120+
====
126121

127122
. Click *Create*.
128123

0 commit comments

Comments
 (0)