Skip to content

Commit 5ccc3c2

Browse files
committed
Merge branch 'main' into release-preview-new-mistral-models
2 parents ffd1991 + 1423cf2 commit 5ccc3c2

File tree

12 files changed

+1518
-24
lines changed

12 files changed

+1518
-24
lines changed

articles/ai-services/openai/how-to/evaluations.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,7 @@ Azure OpenAI evaluation enables developers to create evaluation runs to test aga
2222

2323
### Regional availability
2424

25+
- East US2
2526
- North Central US
2627
- Sweden Central
2728
- Switzerland West

articles/ai-services/openai/how-to/fine-tuning.md

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -26,9 +26,6 @@ In contrast to few-shot learning, fine tuning improves the model by training on
2626

2727
We use LoRA, or low rank approximation, to fine-tune models in a way that reduces their complexity without significantly affecting their performance. This method works by approximating the original high-rank matrix with a lower rank one, thus only fine-tuning a smaller subset of *important* parameters during the supervised training phase, making the model more manageable and efficient. For users, this makes training faster and more affordable than other techniques.
2828

29-
> [!NOTE]
30-
> Azure OpenAI currently only supports text-to-text fine-tuning for all supported models including GPT-4o mini.
31-
3229
::: zone pivot="programming-language-studio"
3330

3431
[!INCLUDE [Azure OpenAI Studio fine-tuning](../includes/fine-tuning-unified.md)]

articles/ai-services/openai/how-to/realtime-audio.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -161,4 +161,5 @@ An example `session.update` that configures several aspects of the session, incl
161161
## Related content
162162

163163
* Try the [real-time audio quickstart](../realtime-audio-quickstart.md)
164+
* See the [Realtime API reference](../realtime-audio-reference.md)
164165
* Learn more about Azure OpenAI [quotas and limits](../quotas-limits.md)

articles/ai-services/openai/realtime-audio-quickstart.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -128,4 +128,5 @@ You can run the sample code locally on your machine by following these steps. Re
128128
## Related content
129129
130130
* Learn more about [How to use the Realtime API](./how-to/realtime-audio.md)
131+
* See the [Realtime API reference](./realtime-audio-reference.md)
131132
* Learn more about Azure OpenAI [quotas and limits](quotas-limits.md)

articles/ai-services/openai/realtime-audio-reference.md

Lines changed: 1487 additions & 0 deletions
Large diffs are not rendered by default.

articles/ai-services/openai/toc.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -348,6 +348,8 @@ items:
348348
displayName: RAG, rag
349349
- name: Azure OpenAI monitoring data reference
350350
href: monitor-openai-reference.md
351+
- name: Realtime API (preview) WebSocket reference
352+
href: realtime-audio-reference.md
351353
- name: Resources
352354
items:
353355
- name: Support and help options

articles/ai-studio/concepts/retrieval-augmented-generation.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -8,30 +8,30 @@ ms.custom:
88
- ignite-2023
99
- build-2024
1010
ms.topic: conceptual
11-
ms.date: 5/21/2024
11+
ms.date: 12/12/2024
1212
ms.reviewer: sgilley
1313
ms.author: sgilley
1414
author: sdgilley
1515
---
1616

1717
# Retrieval augmented generation and indexes
1818

19-
This article talks about the importance and need for Retrieval Augmented Generation (RAG) and index in generative AI.
19+
This article talks about the importance and need for Retrieval Augmented Generation (RAG) and index in generative AI.
2020

2121
## What is RAG?
2222

2323
Some basics first. Large language models (LLMs) like ChatGPT are trained on public internet data that was available at the point in time when they were trained. They can answer questions related to the data they were trained on. This public data might not be sufficient to meet all your needs. You might want questions answered based on your private data. Or, the public data might simply have gotten out of date. The solution to this problem is Retrieval Augmented Generation (RAG), a pattern used in AI that uses an LLM to generate answers with your own data.
2424

2525
## How does RAG work?
2626

27-
RAG is a pattern that uses your data with an LLM to generate answers specific to your data. When a user asks a question, the data store is searched based on user input. The user question is then combined with the matching results and sent to the LLM using a prompt (explicit instructions to an AI or machine learning model) to generate the desired answer. This can be illustrated as follows.
27+
RAG is a pattern that uses your data with an LLM to generate answers specific to your data. When a user asks a question, the data store is searched based on user input. The user question is then combined with the matching results and sent to the LLM using a prompt (explicit instructions to an AI or machine learning model) to generate the desired answer. This process can be illustrated as follows.
2828

2929
:::image type="content" source="../media/index-retrieve/rag-pattern.png" alt-text="Screenshot of the RAG pattern." lightbox="../media/index-retrieve/rag-pattern.png":::
3030

3131

3232
## What is an index and why do I need it?
3333

34-
RAG uses your data to generate answers to the user question. For RAG to work well, we need to find a way to search and send your data in an easy and cost efficient manner to the LLMs. This is achieved by using an index. An index is a data store that allows you to search data efficiently. This is very useful in RAG. An index can be optimized for LLMs by creating vectors (text data converted to number sequences using an embedding model). A good index usually has efficient search capabilities like keyword searches, semantic searches, vector searches or a combination of these. This optimized RAG pattern can be illustrated as follows.
34+
RAG uses your data to generate answers to the user question. For RAG to work well, we need to find a way to search and send your data in an easy and cost efficient manner to the LLMs. This is achieved by using an index. An index is a data store that allows you to search data efficiently. This index is very useful in RAG. An index can be optimized for LLMs by creating vectors (text data converted to number sequences using an embedding model). A good index usually has efficient search capabilities like keyword searches, semantic searches, vector searches, or a combination of these. This optimized RAG pattern can be illustrated as follows.
3535

3636
:::image type="content" source="../media/index-retrieve/rag-pattern-with-index.png" alt-text="Screenshot of the RAG pattern with index." lightbox="../media/index-retrieve/rag-pattern-with-index.png":::
3737

articles/ai-studio/index.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@ metadata:
1414
ms.reviewer: sgilley
1515
ms.author: sgilley
1616
author: sdgilley
17-
ms.date: 5/21/2024
17+
ms.date: 12/12/2024
1818
# linkListType: architecture | concept | deploy | download | get-started | how-to-guide | learn | overview | quickstart | reference | tutorial | video | whats-new
1919

2020
landingContent:
@@ -72,7 +72,7 @@ landingContent:
7272
- linkListType: tutorial
7373
links:
7474
- text: Build a custom chat app with the Azure AI SDK
75-
url: tutorials/copilot-sdk-build-rag.md
75+
url: tutorials/copilot-sdk-create-resources.md
7676

7777
- linkListType: concept
7878
links:

articles/ai-studio/toc.yml

Lines changed: 14 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -309,8 +309,6 @@ items:
309309
href: how-to/develop/trace-local-sdk.md
310310
- name: Visualize your traces
311311
href: how-to/develop/visualize-traces.md
312-
- name: Continuously monitor your applications
313-
href: how-to/online-evaluation.md
314312
- name: Evaluate generative AI apps
315313
items:
316314
- name: Evaluations concepts
@@ -341,16 +339,20 @@ items:
341339
href: concepts/a-b-experimentation.md
342340
- name: Deploy and monitor generative AI apps
343341
items:
344-
- name: Deploy a flow for real-time inference
345-
href: how-to/flow-deploy.md
346-
displayName: endpoint
347-
- name: Enable tracing and collect feedback for a flow deployment
348-
href: how-to/develop/trace-production-sdk.md
349-
displayName: code
350-
- name: Monitor prompt flow deployments
351-
href: how-to/monitor-quality-safety.md
352-
- name: Troubleshoot deployments and monitoring
353-
href: how-to/troubleshoot-deploy-and-monitor.md
342+
- name: Continuously monitor your applications
343+
href: how-to/online-evaluation.md
344+
- name: Deploy and monitor flows
345+
items:
346+
- name: Deploy a flow for real-time inference
347+
href: how-to/flow-deploy.md
348+
displayName: endpoint
349+
- name: Enable tracing and collect feedback for a flow deployment
350+
href: how-to/develop/trace-production-sdk.md
351+
displayName: code
352+
- name: Monitor prompt flow deployments
353+
href: how-to/monitor-quality-safety.md
354+
- name: Troubleshoot deployments and monitoring
355+
href: how-to/troubleshoot-deploy-and-monitor.md
354356
- name: Costs and quotas
355357
items:
356358
- name: Plan and manage costs

articles/ai-studio/tutorials/copilot-sdk-build-rag.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
---
2-
title: "Part 2: Build a ca custom knowledge retrieval (RAG) app with the Azure AI Foundry SDK"
2+
title: "Part 2: Build a custom knowledge retrieval (RAG) app with the Azure AI Foundry SDK"
33
titleSuffix: Azure AI Foundry
44
description: Learn how to build a RAG-based chat app using the Azure AI Foundry SDK. This tutorial is part 2 of a 3-part tutorial series.
55
manager: scottpolly

0 commit comments

Comments
 (0)