You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: GPU_provisioning.md
+9-2Lines changed: 9 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -43,11 +43,18 @@ spec:
43
43
44
44
Use `kubectl` or `oc` command line to create new machineset `oc apply -f gpu_machineset.yaml`
45
45
46
-
Depending on type of EC2 instance creation of the new machines make take some time. Please note that all nodes with GPU will have labels(`node-role.kubernetes.io/odh-notebook`in our case) and taints (`odh-notebook `) that we have specified in machineset applied automatically
46
+
Depending on type of EC2 instance creation of the new machines make take some
47
+
time. Please note that all nodes with GPU will have
48
+
labels(`node-role.kubernetes.io/odh-notebook` in our case) and taints
49
+
(`odh-notebook`) that we have specified in machineset applied automatically
47
50
48
51
## Install Node Feature Operator
49
52
50
-
From OperatorHub install `Node Feature Discovery Operator` , accepting defaults . Once Operator has been installed , create `NodeFeatureDiscovery`instance . Use default entries unless you something specific is needed . Node Feature Discovery Operator will add labels to nodes based on available hardware resources
53
+
From OperatorHub install `Node Feature Discovery Operator` , accepting
54
+
defaults. Once Operator has been installed, create
55
+
`NodeFeatureDiscovery`instance. Use default entries unless you something
56
+
specific is needed. Node Feature Discovery Operator will add labels to nodes
_Figure 5. Schematic diagram for RAG demo augmented query._
68
63
69
-
70
64
In Figure 5, we can see RAG augmented query. [IBM Granite 3.1-8B-Instruct](https://huggingface.co/ibm-granite/granite-3.1-8b-instruct) model is used for language processing, LangChain to
71
65
integrate different tools of the LLM-based application together and to process the PDF
72
66
files and web pages, vector database provider such as EDB Postgres for Kubernetes or Redis, is used to store vectors, and [Red Hat OpenShift AI](https://www.redhat.com/en/technologies/cloud-computing/openshift/openshift-ai) to serve the [IBM Granite 3.1-8B-Instruct](https://huggingface.co/ibm-granite/granite-3.1-8b-instruct) model, Gradio is used for user interface and object storage to store language model and other datasets.
73
67
Solution components are deployed as microservices in the Red Hat OpenShift cluster.
74
68
75
-
76
69
#### Download diagrams
70
+
77
71
View and download all of the diagrams above in our open source tooling site.
To run the demo, ensure the Podman is running on your machine.Fork the [rag-llm-gitops](https://github.com/validatedpatterns/rag-llm-gitops)repo into your organization
89
+
To run the demo, ensure the Podman is running on your machine.Fork the [rag-llm-gitops](https://github.com/validatedpatterns/rag-llm-gitops)repository into your organization
97
90
98
91
### Login to OpenShift cluster
99
92
100
-
Replace the token and the api server url in the command below to login to the OpenShift cluster.
93
+
Replace the token and the API server URL in the command below to login to the OpenShift cluster.
101
94
102
95
```sh
103
96
oc login --token=<token> --server=<api_server_url># login to Openshift cluster
@@ -112,7 +105,7 @@ cd rag-llm-gitops
112
105
113
106
### Configuring model
114
107
115
-
This pattern deploys [IBM Granite 3.1-8B-Instruct](https://huggingface.co/ibm-granite/granite-3.1-8b-instruct) out of box. Run the following command to configure vault with the model Id.
108
+
This pattern deploys [IBM Granite 3.1-8B-Instruct](https://huggingface.co/ibm-granite/granite-3.1-8b-instruct) out of box. Run the following command to configure vault with the model ID.
116
109
117
110
```sh
118
111
# Copy values-secret.yaml.template to ~/values-secret-rag-llm-gitops.yaml.
@@ -254,10 +247,10 @@ By default, Grafana application is deployed in `llm-monitoring` namespace.To lau
254
247

255
248
256
249
## Test Plan
250
+
257
251
GOTO: [Test Plan](./TESTPLAN.md)
258
252
259
253
## Licenses
260
254
261
255
EDB Postgres for Kubernetes is distributed under the EDB Limited Usage License
262
256
Agreement, available at [enterprisedb.com/limited-use-license](https://www.enterprisedb.com/limited-use-license).
0 commit comments