You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+27-24Lines changed: 27 additions & 24 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,41 +1,44 @@
1
-
# Retrieval Augmented Generation (RAG) stack
1
+
# Retrieval Augmented Generation Pattern for Watsonx on IBM Cloud
2
2
3
-
To run the full stack, follow these steps. These steps will be updated as development progresses on the stack and underlying DAs.
3
+
The following [deployable architecture](https://cloud.ibm.com/docs/secure-enterprise?topic=secure-enterprise-understand-module-da#what-is-da) automates the deployment of a sample GenAI Pattern on IBM Cloud, including all underlying infrastructure. This architecture implements the best practices for Watsonx GenAI Pattern deployment on IBM Cloud, as described in the [reference architecture](https://cloud.ibm.com/docs/pattern-genai-rag?topic=pattern-genai-rag-genai-pattern).
4
4
5
-
## 1. Deploy the stack in a new project from catalog
Click the "Add to Project" button and select "Create in new project."
14
+
15
+
## 2. Prerequisites in Target Account
12
16
13
17
Before deploying the stack, ensure you have:
14
-
- Created an API key in the target account with sufficient permissions. Note the API key, as it will be used later.
15
-
- For now, grant it admin privileges. The exact permissions required will be refined in future versions.
16
-
- Install the IBM Cloud CLI's Project addon using `ibmcloud plugin install project` command. More info here: https://cloud.ibm.com/docs/cli?topic=cli-projects-cli
17
18
19
+
* Created an API key in the target account with sufficient permissions. Note the API key, as it will be used later. For now, grant it admin privileges. The exact permissions required will be refined in future versions.
20
+
* Installed the IBM Cloud CLI's Project add-on using the `ibmcloud plugin install project` command. More information is available here: https://cloud.ibm.com/docs/cli?topic=cli-projects-cli
18
21
19
-
## 3. Set the input configuration for the stack
22
+
## 3. Set the Input Configuration for the Stack
20
23
21
-
- Clone this repository locally.
22
-
- Create a file with name ".def.json" with the following content.
24
+
* Clone this repository locally.
25
+
* Create a file named ".def.json" with the following content:
23
26
24
27
**Important**:
25
-
- Ensure region is either us-south or eu-de as watsonx can only be deployed in those 2 locations for now.
26
-
- Ensure that the prefix is globally unique. It is used for the container registry namespace (which needs to be globally unique) in this alpha version.
27
-
- If specifying `existing_secrets_manager_crn`, the ibmcloud_api_key that is passed as an input must have the documented read and write access to the instance
28
-
- If specifying `existing_secrets_manager_crn`, ensure that the default security group does not contain secrets named `signing-key` and `ibmcloud-api-key`. The RAG DA currently always attempt to create secret with those names (temporary issue - to be fixed).
28
+
* Ensure the region is either us-south or eu-de, as Watsonx can only be deployed in those two locations for now.
29
+
* Ensure that the prefix is globally unique. It is used for the container registry namespace (which needs to be globally unique) in this alpha version.
30
+
* If specifying `existing_secrets_manager_crn`, the `ibmcloud_api_key` that is passed as an input must have the documented read and write access to the instance.
31
+
* If specifying `existing_secrets_manager_crn`, ensure that the default security group does not contain secrets named `signing-key` and `ibmcloud-api-key`. The RAG DA currently always attempts to create a secret with those names (temporary issue - to be fixed).
29
32
30
33
```json
31
34
{
32
35
"inputs": {
33
36
"prefix": "<prefix for resources name - ensure unique>",
34
37
"ibmcloud_api_key": "<API Key of the target account with sufficient permissions>",
35
-
"resource_group_name": "<target resource group - name of a new resource group that the stack will creates>",
38
+
"resource_group_name": "<target resource group - name of a new resource group that the stack will create>",
36
39
"region": "<region where all resources are deployed>",
- Ensure you are login into the account containing the Cloud project with the stack using ibmcloud login --sso
67
-
- Execute ./deploy-many.sh with project name, stack name and optional configuration name pattern. The selected non-stack configruations will be processed by their name in alphabetical order. Using configuration name pattern (regex can be used - make sure to enclose it in quotes) you can chose which configurations are deployed
69
+
* Ensure you are logged in to the account containing the Cloud project with the stack using `ibmcloud login --sso`.
70
+
* Execute `./deploy-many.sh` with the project name, stack name, and optional configuration name pattern. The selected non-stack configurations will be processed by their name in alphabetical order. Using the configuration name pattern (regex can be used - make sure to enclose it in quotes), you can choose which configurations are deployed.
68
71
69
-
Example 1 - update stack inputs for stack configuration `RAG` and process all non-stack configurations in the project:
72
+
Example 1 - Update stack inputs for stack configuration `RAG` and process all non-stack configurations in the project:
70
73
```bash
71
74
./deploy-many.sh my-test-project RAG
72
75
```
73
76
74
-
Example 2 - update stack inputs and process some configurations in the project:
77
+
Example 2 - Update stack inputs and process some configurations in the project:
Example 3 - simulate updating stack inputs and validating some configurations in the project in dry-run mode (no changes or actual validation or deployments is done):
82
+
Example 3 - Simulate updating stack inputs and validating some configurations in the project in dry-run mode (no changes or actual validation or deployments are done):
Copy file name to clipboardExpand all lines: ibm_catalog.json
+2-2Lines changed: 2 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -14,7 +14,7 @@
14
14
"watson",
15
15
"ai"
16
16
],
17
-
"short_description": "An automated solution that deploys a sample application illustrating how to implement the RAG Pattern with watsonx.ai and IBM Cloud services.",
17
+
"short_description": "An automated solution that deploys a sample application illustrating how to implement the RAG Pattern with watson.ai and IBM Cloud services.",
0 commit comments