Apply suggestions from code review

fpagny · jcirinosclwy · web-flow · commit 9ff9c34e873b · 2024-11-28T18:17:10.000+01:00
Co-authored-by: Jessica &lt;113192637+jcirinosclwy@users.noreply.github.com&gt;
diff --git a/tutorials/how-to-implement-rag-generativeapis/index.mdx b/tutorials/how-to-implement-rag-generativeapis/index.mdx
@@ -214,8 +214,7 @@ Then, we will embed them as vectors and store these vectors in your PostgreSQL d
     Note that the configuration should be similar to the one used in `embed.py` to ensure vectors will be read in the same format as the one used to create and store them.
     </Message>
 
-
-### Configure LLM client and create a basic RAG pipeline
+### Configure the LLM client and create a basic RAG pipeline
 
 3. Edit `rag.py` to configure LLM client using `ChatOpenAI` and create a simple RAG pipeline:
 
@@ -256,7 +255,6 @@ Then, we will embed them as vectors and store these vectors in your PostgreSQL d
    scw instance server stop example-28f3-4e91-b2af-4c3502562d72 
 
    This will shut down the Instance with the specified instance-uuid. Note that this command stops the Instance but does not shut it down completely.
-   ```
 
    This command is correct and can be used with the Scaleway CLI.
    Note that vector embedding enabled the system to retrieve proper document chunks even if the Scaleway cheatsheet never mentions `shut down` but only `power off`.
@@ -265,7 +263,8 @@ Then, we will embed them as vectors and store these vectors in your PostgreSQL d
    ```sh
    scaleway instance shutdown --instance-uuid example-28f3-4e91-b2af-4c3502562d72
    ```
-   This is command is not correct at all, and "hallucinate" in several ways to fit the question prompt content: `scaleway` instead of `scw`, `instance` instead of `instance server`, `shutdown` instead of `stop` and `--instance-uuid` parameter doesn't exist.
+
+   This command is incorrect and 'hallucinates' in several ways to fit the question prompt content: `scaleway` instead of `scw`, `instance` instead of `instance server`, `shutdown` instead of `stop`, and the `--instance-uuid` parameter does not exist.
 
 ### Query the RAG system with your own prompt template
 
@@ -317,9 +316,9 @@ Personalizing your prompt template allows you to tailor the responses from your
        print(r, end="", flush=True)
    ```
 
-   - `PromptTemplate` enables you to customize how retrieved context and question are passed through LLM prompt.
+   - `PromptTemplate` enables you to customize how the retrieved context and question are passed through the LLM prompt.
    - `retriever.invoke` lets you customize which part of the LLM input is used to retrieve documents.
-   - `create_stuff_documents_chain` provides the prompt template to the llm.
+   - `create_stuff_documents_chain` provides the prompt template to the LLM.
 
 6. You can now execute your custom RAG pipeline with:
 
@@ -329,27 +328,33 @@ Personalizing your prompt template allows you to tailor the responses from your
 
    Note that with Scaleway cheatsheets example, the CLI answer should be similar, but without additional explanations regarding the command line performed.
 
-Congratulations! You built a custom RAG pipeline to improve LLM answers based on specific documentation.
+Congratulations! You have built a custom RAG pipeline to improve LLM answers based on specific documentation.
 
-You can now go further by:
-- Specializing your RAG pipeline for your use case (whether it's providing better answers for Customer support, finding relevant content through Internal Documentation, helping user generate more creative and personalized content, or much more)
-- Storing chat history to increase prompt relevancy. 
-- Adding a complete testing pipeline to test which prompt, models and retrieval strategy to provide a better experience for your users. You can for instance leverage [Serverless Jobs](https://console.scaleway.com/serverless-jobs/jobs/fr-par) to do so.
+## Going further
+- Specialize your RAG pipeline for your use case, such as providing better answers for customer support, finding relevant content through internal documentation, helping users generate more creative and personalized content, and much more.
+- Store chat history to increase prompt relevancy. 
+- Add a complete testing pipeline to test which prompt, models, and retrieval strategy provide a better experience for your users. You can, for instance, leverage [Serverless Jobs](https://www.scaleway.com/en/serverless-jobs/) to do so.
 
 ## Troubleshooting
 
-If you happen to encounter any issues, first check that you meet all the requirements:
+If you happen to encounter any issues, first ensure that you have:
+
+- The necessary [IAM permissions](/identity-and-access-management/iam/reference-content/policy/), specifically **ContainersRegistryFullAccess**, **ContainersFullAccess**
+- An [IAM API key capable of interacting with Object Storage](/identity-and-access-management/iam/api-cli/using-api-key-object-storage/)
+- Stored the right credentials in your `.env` file allowing to connect to your [Managed Database Instance with admin rights](/managed-databases/postgresql-and-mysql/how-to/add-users/)
 
-- You have the right [IAM permissions](/identity-and-access-management/iam/reference-content/policy/), specifically **ContainersRegistryFullAccess**, **ContainersFullAccess** 
-- You created an [IAM API Key capable of interacting with Object Storage](https://www.scaleway.com/en/docs/identity-and-access-management/iam/api-cli/using-api-key-object-storage/).
-- You stored the right credentials in your `.env` file, especially credentials allowing to connect to your [Managed Database Instance with admin rights](https://www.scaleway.com/en/docs/managed-databases/postgresql-and-mysql/how-to/add-users/).
+Below are some known error messages and their corresponding solutions:
 
-If you encounter the following error message, try corresponding solutions:
 
-- `botocore.exceptions.ClientError: An error occurred (SignatureDoesNotMatch) when calling the ListObjectsV2 operation: The request signature we calculated does not match the signature you provided. Check your key and signing method.`
-  - **Solution:** Ensure your `SCW_BUCKET_NAME`, `SCW_REGION`, `SCW_BUCKET_ENDPOINT` and `SCW_SECRET_KEY` are properly configured, that corresponding IAM Principal have the proper rights and that your [IAM API Key can interact with Object Storage]
-- `urllib.error.URLError: <urlopen error [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:992)>`
-  - On MacOS, ensure your Python version has loaded certificates trusted by MacOS. You can fix this by `Applications/Python 3.X` (where X is your version number), and double click on `Certificates.command`.
-- `ERROR:root:An error occurred: bge-multilingual-gemma2 is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
+**Error**: `botocore.exceptions.ClientError: An error occurred (SignatureDoesNotMatch) when calling the ListObjectsV2 operation: The request signature we calculated does not match the signature you provided. Check your key and signing method.`
+**Solution**: Ensure that your `SCW_BUCKET_NAME`, `SCW_REGION`, `SCW_BUCKET_ENDPOINT`, and `SCW_SECRET_KEY` are properly configured, the corresponding IAM Principal has the necessary rights, and that your [IAM API key can interact with Object Storage](/identity-and-access-management/iam/api-cli/using-api-key-object-storage/).
+
+**Error**: `urllib.error.URLError: <urlopen error [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:992)>`
+
+**Solution**: On MacOS, ensure your Python version has loaded certificates trusted by MacOS.
+You can fix this with `Applications/Python 3.X` (where `X` is your version number), and double click on `Certificates.command`.
+
+**Error**: `ERROR:root:An error occurred: bge-multilingual-gemma2 is not a local folder and is not a valid model identifier listed on 'https://huggingface.co/models'
 If this is a private repository, make sure to pass a token having permission to this repo either by logging in with `huggingface-cli login` or by passing `token=<your_token>``
-  - This is caused by Langchain OpenAI adapter trying to tokenize content. Ensure you set `check_embedding_ctx_length=False` in OpenAIEmbedding configuration to avoid tokenizing content (as tokenization will be performed server-side, in Generative APIs).
+
+**Solution**: This is caused by the LangChain OpenAI adapter trying to tokenize content. Ensure you set `check_embedding_ctx_length=False` in OpenAIEmbedding configuration to avoid tokenizing content, as tokenization will be performed server-side in Generative APIs.