Skip to content

Commit 0bb7509

Browse files
committed
Update RAG LP
1 parent 530f6ec commit 0bb7509

File tree

3 files changed

+9
-4
lines changed

3 files changed

+9
-4
lines changed

content/learning-paths/servers-and-cloud-computing/rag/_index.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,11 +3,11 @@ title: Deploy a RAG-based Chatbot with llama-cpp-python using KleidiAI on Arm Se
33

44
minutes_to_complete: 45
55

6-
who_is_this_for: This Learning Path is for software developers, ML engineers, and those looking to deploy production-ready LLM chatbots with RAG capabilities, knowledge base integration, and performance optimization for Arm Architecture.
6+
who_is_this_for: This Learning Path is for software developers, ML engineers, and those looking to deploy production-ready LLM chatbots with Retrieval Augmented Generation (RAG) capabilities, knowledge base integration, and performance optimization for Arm Architecture.
77

88
learning_objectives:
99
- Set up llama-cpp-python optimized for Arm servers.
10-
- Implement RAG architecture using the FAISS vector database.
10+
- Implement RAG architecture using the Facebook AI Similarity Search (FAISS) vector database.
1111
- Optimize model performance through 4-bit quantization.
1212
- Build a web interface for document upload and chat.
1313
- Monitor and analyze inference performance metrics.

content/learning-paths/servers-and-cloud-computing/rag/chatbot.md

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -13,9 +13,14 @@ Open the web application in your browser using either the local URL or the exter
1313
http://localhost:8501 or http://75.101.253.177:8501
1414
```
1515

16+
{{% notice Note %}}
17+
18+
To access the links you may need to allow inbound TCP traffic in your instance's security rules. Always review these permissions with caution as they may introduce security vulnerabilities.
19+
20+
{{% /notice %}}
1621
## Upload a PDF File and Create a New Index
1722

18-
Now you can upload a PDF file in the web browser by selecting the **Create New Store** option.
23+
Now you can upload a PDF file in the web browser by selecting the **Create New Store** option.
1924

2025
Follow these steps to create a new index:
2126

content/learning-paths/servers-and-cloud-computing/rag/rag_llm.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ This learning path demonstrates how to build and deploy a Retrieval Augmented Ge
1414

1515
## Overview
1616

17-
In this Learning Path, you learn how to build a Retrieval Augmented Generation (RAG) chatbot using llama-cpp-python, a Python binding for llama.cpp that enables efficient LLM inference on Arm CPUs.
17+
In this Learning Path, you learn how to build a RAG chatbot using llama-cpp-python, a Python binding for llama.cpp that enables efficient LLM inference on Arm CPUs.
1818

1919
The tutorial demonstrates how to integrate the FAISS vector database with the Llama-3.1-8B model for document retrieval, while leveraging llama-cpp-python's optimized C++ backend for high-performance inference.
2020

0 commit comments

Comments
 (0)