watsonx-rag-elasticsearch

This implementation of watsonx Assistant with watsonx Discovery (Elasticsearch) consists of the following features,

Conversation history
Query rewrite
Semantic search
Answer generation

1. Index documents

Edit the notebook watsonx-rag-elasticsearch.ipynb, and update the required variables.

This notebook demonstrates,

how to have fine grain control over the ingesting process such as chunking
using pipelines to offload embedding text to Elasticsearch
create an Elasticsearch index
search for query terms using pipelines
deploy and test prompts on Watson Machine Learning Deployment spaces

2. Setup watsonx Assistant

2.1 Create 3 variables

query of text type
context of any type
history of any type

2.2 Create 2 custom extensions

Elasticsearch
- Upload elasticsearch-openapi.json for the Elasticsearch custom extension
- Configure credentails
watsonx.ai
- Upload watsonx-deployment-history.json for the watsonx.ai custom extension
- Configure credentials

2.3 Create a new Action (select Start from scratch option)

This action consists of 3 steps for the RAG process,

Rewrite query based on conversational history
Retreive information using semantic search
Generate answer based on retrieved information

2.3.1 Rewrite Query

Set history variable to the following expression,

${history} = ${system_session_history}.transform("role", "<|start_header_id|>user<|end_header_id|>\n\n", "<|start_header_id|>assistant<|end_header_id|>\n\n").joinToArray("%e.role%%e.content%").join("<|eot_id|>") + "<|eot_id|>"

In the And then section, select use an extension.

Select the watsonx.ai extension that was created in Step 2.2.
Select the Text Generation operation.
Under parameters,
- set the deployment_id for rewrite-prompt that was create in Step 1
- version to 2023-05-29
Under optional parameters, set parameters.prompt_variables.history to the history variable.
Under stream response, set text to results[0].generated_text

2.3.2 Retrieve Information

Set query variable to the following expression,

${query} = ${step_950_result_2.body.results}[0]["generated_text"]

Edit expression using the editor to select the previous step and select body.results.

The final expression should be like the below image.

In the And then section, select use an extension.

Select the elasticsearch extension that was created in Step 2.2.
Select the Search request operation.
Under parameters, fill in the index_name that was used in Step 1.
Under optional parameters, fill in the knn, fields and source fields.

For knn,

{
    "field": "text_embedding",
    "query_vector_builder": {
        "text_embedding": {"model_id": "intfloat__multilingual-e5-base", "model_text": ${query}}
    },
    "k": 5,
    "num_candidates": 50
}

Note: set model_id according to the embedding model used in Step 1.

For fields, fill in ["text"]

Note: ["text"] will depend on what field names was used in Step 1.

For source, select False.

2.3.3 Generate Answer

Click on New step +

Set context variable to the below expression,

${context} = "\n<documents><document>\n" + ${step_181_result_1.body.hits.hits}.joinToArray("%e.fields.text[0]%").join("\n</document>\n\n<document>\n") + "\n</document></documents>\n"

Edit expression using the editor to select the previous step and select body.hits.hits.

Set history variable to the below expression,

${history} = ${system_session_history}.transform("role", "<|start_header_id|>user<|end_header_id|>\n\n", "<|start_header_id|>assistant<|end_header_id|>\n\n").joinToArray("%e.role%%e.content%").join("<|eot_id|>") + "<|eot_id|>"

In the And then section, select use an extension.

Select the watsonx.ai extension that was created in Step 2.2.
Select the Text Generation Stream operation. Under parameters,
- set deployment_id for conversational-prompt that was create in Step 1
- set version to 2023-05-29.
Under optional parameters, set parameters.prompt_variables to the expression { "context": ${context}, "history": ${history} }.
Under stream response, set text to results[0].generated_text

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets/images		assets/images
data		data
notebook		notebook
openapi		openapi
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

watsonx-rag-elasticsearch

1. Index documents

2. Setup watsonx Assistant

2.1 Create 3 variables

2.2 Create 2 custom extensions

2.3 Create a new Action (select Start from scratch option)

2.3.1 Rewrite Query

2.3.2 Retrieve Information

2.3.3 Generate Answer

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

watsonx-rag-elasticsearch

1. Index documents

2. Setup watsonx Assistant

2.1 Create 3 variables

2.2 Create 2 custom extensions

2.3 Create a new Action (select Start from scratch option)

2.3.1 Rewrite Query

2.3.2 Retrieve Information

2.3.3 Generate Answer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages