Skip to content

Commit 094f45d

Browse files
Merge branch 'main' into apic-devproxy-articles
2 parents 00d29ab + fc4ae2a commit 094f45d

File tree

212 files changed

+2885
-1525
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

212 files changed

+2885
-1525
lines changed

.openpublishing.publish.config.json

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1225,6 +1225,12 @@
12251225
"url": "https://github.com/MicrosoftDocs/SupportArticles-docs",
12261226
"branch": "main",
12271227
"branch_mapping": {}
1228+
},
1229+
{
1230+
"path_to_root": "msdocs-python-fastapi-webapp-quickstart",
1231+
"url": "https://github.com/Azure-Samples/msdocs-python-fastapi-webapp-quickstart",
1232+
"branch": "main",
1233+
"branch_mapping": {}
12281234
}
12291235
],
12301236
"branch_target_mapping": {

.openpublishing.redirection.baremetal-infrastructure.json

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -134,6 +134,11 @@
134134
"source_path_from_root": "/articles/baremetal-infrastructure/workloads/sap/baremetal-overview-architecture.md",
135135
"redirect_url": "/azure/baremetal-infrastructure/concepts-baremetal-infrastructure-overview",
136136
"redirect_document_id": false
137-
}
137+
},
138+
{
139+
"source_path": "articles/baremetal-infrastructure/workloads/nc2-on-azure/solution-design.md",
140+
"redirect_url": "/azure/baremetal-infrastructure/workloads/nc2-on-azure/architecture",
141+
"redirect_document_id": false
142+
}
138143
]
139144
}

.openpublishing.redirection.json

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1189,6 +1189,16 @@
11891189
"redirect_url": "/azure/azure-maps/",
11901190
"redirect_document_id": true
11911191
},
1192+
{
1193+
"source_path_from_root": "/articles/azure-maps/tutorial-creator-indoor-maps.md",
1194+
"redirect_url": "/azure/azure-maps/creator-indoor-maps",
1195+
"redirect_document_id": false
1196+
},
1197+
{
1198+
"source_path_from_root": "/articles/azure-maps/about-creator.md",
1199+
"redirect_url": "/azure/azure-maps/creator-indoor-maps",
1200+
"redirect_document_id": true
1201+
},
11921202
{
11931203
"source_path_from_root": "/articles/azure-maps/search-categories.md",
11941204
"redirect_url": "/azure/azure-maps/supported-search-categories",

.openpublishing.redirection.operator-nexus.json

Lines changed: 6 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,6 +29,11 @@
2929
"source_path_from_root": "/articles/operator-nexus/template-cloud-native-network-function-deployment.md",
3030
"redirect_url": "/azure/operator-nexus/quickstarts-kubernetes-cluster-deployment-cli",
3131
"redirect_document_id": false
32+
},
33+
{
34+
"source_path_from_root": "/articles/operator-nexus/concepts-nexus-network.md",
35+
"redirect_url": "/azure/operator-nexus/concepts-nexus-networking",
36+
"redirect_document_id": false
3237
}
3338
]
34-
}
39+
}

articles/ai-services/openai/how-to/deployment-types.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -33,7 +33,7 @@ Azure OpenAI offers three types of deployments. These provide a varied level of
3333
| **Best suited for** | Applications that don’t require data residency. Recommended starting place for customers. | For customers with data residency requirements. Optimized for low to medium volume. | Real-time scoring for large consistent volume. Includes the highest commitments and limits.|
3434
| **How it works** | Traffic may be routed anywhere in the world | | |
3535
| **Getting started** | [Model deployment](./create-resource.md) | [Model deployment](./create-resource.md) | [Provisioned onboarding](./provisioned-throughput-onboarding.md) |
36-
| **Cost** | [Baseline](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) | [Regional Pricing](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) | May experience cost savings for consistent usage |
36+
| **Cost** | [Global deployment pricing](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) | [Regional pricing](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) | May experience cost savings for consistent usage |
3737
| **What you get** | Easy access to all new models with highest default pay-per-call limits.<br><br> Customers with high volume usage may see higher latency variability | Easy access with [SLA on availability](https://azure.microsoft.com/support/legal/sla/). Optimized for low to medium volume workloads with high burstiness. <br><br>Customers with high consistent volume may experience greater latency variability. | Regional access with very high & predictable throughput. Determine throughput per PTU using the provided [capacity calculator](./provisioned-throughput-onboarding.md#estimate-provisioned-throughput-and-cost) |
3838
| **What you don’t get** | ❌Data residency guarantees | ❌High volume w/consistent low latency | ❌Pay-per-call flexibility |
3939
| **Per-call Latency** | Optimized for real-time calling & low to medium volume usage. Customers with high volume usage may see higher latency variability. Threshold set per model | Optimized for real-time calling & low to medium volume usage. Customers with high volume usage may see higher latency variability. Threshold set per model | Optimized for real-time. |

articles/ai-studio/how-to/model-catalog-overview.md

Lines changed: 9 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -47,12 +47,12 @@ Some models in the **Curated by Azure AI** and **Open models from the Hugging Fa
4747
Model Catalog offers two distinct ways to deploy models from the catalog for your use: managed compute and serverless APIs. The deployment options available for each model vary; learn more about the features of the deployment options, and the options available for specific models, in the following tables. Learn more about [data processing]( concept-data-privacy.md) with the deployment options.
4848
<!-- docutune:disable -->
4949

50-
Features | Managed compute | serverless API (pay-as-you-go)
50+
Features | Managed compute | Serverless API (pay-as-you-go)
5151
--|--|--
52-
Deployment experience and billing | Model weights are deployed to dedicated Virtual Machines with Managed Online Endpoints. The managed online endpoint, which can have one or more deployments, makes available a REST API for inference. You're billed for the Virtual Machine core hours used by the deployments. | Access to models is through a deployment that provisions an API to access the model. The API provides access to the model hosted and managed by Microsoft, for inference. This mode of access is referred to as "Models as a Service". You're billed for inputs and outputs to the APIs, typically in tokens; pricing information is provided before you deploy.
52+
Deployment experience and billing | Model weights are deployed to dedicated Virtual Machines with Managed Online Endpoints. The managed online endpoint, which can have one or more deployments, makes available a REST API for inference. You're billed for the Virtual Machine core hours used by the deployments. | Access to models is through a deployment that provisions an API to access the model. The API provides access to the model hosted and managed by Microsoft, for inference. You're billed for inputs and outputs to the APIs, typically in tokens; pricing information is provided before you deploy.
5353
| API authentication | Keys and Microsoft Entra ID authentication.| Keys only.
54-
Content safety | Use Azure Content Safety service APIs. | Azure AI Content Safety filters are available integrated with inference APIs. Azure AI Content Safety filters may be billed separately.
55-
Network isolation | [Configure managed networks for Azure AI Studio hubs.](configure-managed-network.md) | MaaS endpoint will follow your hub's public network access (PNA) flag setting. For more information, see the [Network isolation for models deployed via Serverless APIs](#network-isolation-for-models-deployed-via-serverless-apis) section.
54+
Content safety | Use Azure Content Safety service APIs. | Azure AI Content Safety filters are available integrated with inference APIs. Azure AI Content Safety filters is billed separately.
55+
Network isolation | [Configure managed networks for Azure AI Studio hubs.](configure-managed-network.md) | Endpoints will follow your hub's public network access (PNA) flag setting. For more information, see the [Network isolation for models deployed via Serverless APIs](#network-isolation-for-models-deployed-via-serverless-apis) section.
5656

5757
Model | Managed compute | Serverless API (pay-as-you-go)
5858
--|--|--
@@ -100,25 +100,19 @@ Prompt flow offers a great experience for prototyping. You can use models deploy
100100

101101
## Serverless APIs with Pay-as-you-go billing
102102

103-
Certain models in the Model Catalog can be deployed as serverless APIs with pay-as-you-go billing; this method of deployment is called Models-as-a Service (MaaS), providing a way to consume them as an API without hosting them on your subscription. Models available through MaaS are hosted in infrastructure managed by Microsoft, which enables API-based access to the model provider's model. API based access can dramatically reduce the cost of accessing a model and significantly simplify the provisioning experience. Most MaaS models come with token-based pricing.
103+
Certain models in the Model Catalog can be deployed as serverless APIs with pay-as-you-go billing, providing a way to consume them as an API without hosting them on your subscription. Models are hosted in infrastructure managed by Microsoft, which enables API-based access to the model provider's model. API based access can dramatically reduce the cost of accessing a model and significantly simplify the provisioning experience.
104104

105-
### How are third-party models made available in MaaS?
105+
Models that are available for deployment as serverless APIs with pay-as-you-go billing are offered by the model provider but hosted in Microsoft-managed Azure infrastructure and accessed via API. Model providers define the license terms and set the price for use of their models, while Azure Machine Learning service manages the hosting infrastructure, makes the inference APIs available, and acts as the data processor for prompts submitted and content output by models deployed via MaaS. Learn more about data processing for MaaS at the [data privacy](concept-data-privacy.md) article.
106106

107107
:::image type="content" source="../media/explore/model-publisher-cycle.png" alt-text="A diagram showing model publisher service cycle." lightbox="../media/explore/model-publisher-cycle.png":::
108108

109-
Models that are available for deployment as serverless APIs with pay-as-you-go billing are offered by the model provider but hosted in Microsoft-managed Azure infrastructure and accessed via API. Model providers define the license terms and set the price for use of their models, while Azure Machine Learning service manages the hosting infrastructure, makes the inference APIs available, and acts as the data processor for prompts submitted and content output by models deployed via MaaS. Learn more about data processing for MaaS at the [data privacy](concept-data-privacy.md) article.
110-
111-
### Pay for model usage in MaaS
109+
### Billing
112110

113111
The discovery, subscription, and consumption experience for models deployed via MaaS is in the Azure AI Studio and Azure Machine Learning studio. Users accept license terms for use of the models, and pricing information for consumption is provided during deployment. Models from third party providers are billed through Azure Marketplace, in accordance with the [Commercial Marketplace Terms of Use](/legal/marketplace/marketplace-terms); models from Microsoft are billed using Azure meters as First Party Consumption Services. As described in the [Product Terms](https://www.microsoft.com/licensing/terms/welcome/welcomepage), First Party Consumption Services are purchased using Azure meters but aren't subject to Azure service terms; use of these models is subject to the license terms provided.
114112

115-
### Deploy models for inference through MaaS
116-
117-
Deploying a model through MaaS allows users to get access to ready to use inference APIs without the need to configure infrastructure or provision GPUs, saving engineering time and resources. These APIs can be integrated with several LLM tools and usage is billed as described in the previous section.
118-
119-
### Fine-tune models through MaaS with Pay-as-you-go
113+
### Fine-tune models
120114

121-
For models that are available through MaaS and support fine-tuning, users can take advantage of hosted fine-tuning with pay-as-you-go billing to tailor the models using data they provide. For more information, see the [fine-tuning overview](../concepts/fine-tuning-overview.md).
115+
Certain models support also serverless fine-tuning where users can take advantage of hosted fine-tuning with pay-as-you-go billing to tailor the models using data they provide. For more information, see the [fine-tuning overview](../concepts/fine-tuning-overview.md).
122116

123117
### RAG with models deployed as serverless APIs
124118

articles/aks/TOC.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -182,7 +182,7 @@
182182
href: concepts-sustainable-software-engineering.md
183183
- name: Preview API life cycle
184184
href: concepts-preview-api-life-cycle.md
185-
- name: Dapr
185+
- name: Dapr extension
186186
href: dapr-overview.md
187187
- name: GitOps
188188
href: ../azure-arc/kubernetes/conceptual-gitops-flux2.md?toc=/azure/aks/toc.json&bc=/azure/aks/breadcrumb/toc.json
@@ -879,7 +879,7 @@
879879
href: deploy-application-az-cli.md
880880
- name: Dapr
881881
items:
882-
- name: Create the Dapr extension
882+
- name: Install the Dapr extension
883883
href: dapr.md
884884
- name: Configure the Dapr extension
885885
href: dapr-settings.md

articles/aks/dapr-overview.md

Lines changed: 9 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -1,12 +1,12 @@
11
---
2-
title: Dapr extension for Azure Kubernetes Service (AKS) overview
2+
title: Dapr extension for Azure Kubernetes Service (AKS) and Arc-enabled Kubernetes
33
description: Learn more about using Dapr on your Azure Kubernetes Service (AKS) cluster to develop applications.
44
ms.author: nickoman
55
ms.topic: overview
66
ms.date: 04/22/2024
77
---
88

9-
# Dapr
9+
# Dapr extension for Azure Kubernetes Service (AKS) and Arc-enabled Kubernetes
1010

1111
[Distributed Application Runtime (Dapr)][dapr-docs] offers APIs that help you write and implement simple, portable, resilient, and secured microservices. Dapr APIs run as a sidecar process in tandem with your applications and abstract away common complexities you may encounter when building distributed applications, such as:
1212
- Service discovery
@@ -19,7 +19,7 @@ Dapr is incrementally adoptable. You can use any of the API building blocks as n
1919

2020
## Capabilities and features
2121

22-
[Using the Dapr extension to provision Dapr on your AKS or Arc-enabled Kubernetes cluster](../azure-arc/kubernetes/conceptual-extensions.md) eliminates the overhead of:
22+
[Using the Dapr extension to provision Dapr on your AKS or Arc-enabled Kubernetes cluster][dapr-create-extension] eliminates the overhead of:
2323
- Downloading Dapr tooling
2424
- Manually installing and managing the Dapr runtime on your AKS cluster
2525

@@ -63,7 +63,7 @@ Microsoft provides best-effort support for [the latest version of Dapr and two p
6363
- 1.12.x
6464
- 1.11.x
6565

66-
You can run Azure CLI commands to retreive a list of available versions in [a cluster](/cli/azure/k8s-extension/extension-types#az-k8s-extension-extension-types-list-versions-by-cluster) or [a location](/cli/azure/k8s-extension/extension-types#az-k8s-extension-extension-types-list-versions-by-location).
66+
You can run Azure CLI commands to retrieve a list of available versions in [a cluster](/cli/azure/k8s-extension/extension-types#az-k8s-extension-extension-types-list-versions-by-cluster) or [a location](/cli/azure/k8s-extension/extension-types#az-k8s-extension-extension-types-list-versions-by-location).
6767

6868
To view a list of the stable Dapr versions available to your managed AKS cluster, run the following command:
6969

@@ -221,7 +221,9 @@ If you install Dapr through the AKS extension, our recommendation is to continue
221221

222222
## Next Steps
223223

224-
After learning about Dapr and some of the challenges it solves, try [Deploying an application with the Dapr cluster extension][dapr-quickstart].
224+
> [!div class="nextstepaction"]
225+
> [Walk through the Dapr extension quickstart to demo how it works][dapr-quickstart]
226+
225227

226228
<!-- Links Internal -->
227229
[csi-secrets-store]: ./csi-secrets-store-driver.md
@@ -231,6 +233,7 @@ After learning about Dapr and some of the challenges it solves, try [Deploying a
231233
[dapr-migration]: ./dapr-migration.md
232234
[aks-msi]: ./use-managed-identity.md
233235
[dapr-configuration-options]: ./dapr-settings.md
236+
[dapr-create-extension]: ./dapr.md
234237

235238
<!-- Links External -->
236239
[dapr-docs]: https://docs.dapr.io/
@@ -248,4 +251,4 @@ After learning about Dapr and some of the challenges it solves, try [Deploying a
248251
[dapr-subscriptions]: https://docs.dapr.io/developing-applications/building-blocks/pubsub/subscription-methods/#declarative-subscriptions
249252
[dapr-supported-version]: https://docs.dapr.io/operations/support/support-release-policy/
250253
[dapr-observability]: https://docs.dapr.io/operations/observability/
251-
[dapr-alpha-beta]: https://docs.dapr.io/operations/support/alpha-beta-apis/
254+
[dapr-alpha-beta]: https://docs.dapr.io/operations/support/alpha-beta-apis/

articles/aks/dapr-settings.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -374,7 +374,9 @@ The Dapr extension requires the following outbound URLs on `https://:443` to fun
374374

375375
## Next Steps
376376

377-
Once you successfully provisioned Dapr in your AKS cluster, try deploying a [sample application][sample-application].
377+
- [Walk through the tutorial for deploying Dapr Workflow via the extension][dapr-workflow]
378+
- [Determine if you need to migrate from Dapr open source to the Dapr extension][dapr-migration].
379+
378380

379381
<!-- LINKS INTERNAL -->
380382
[deploy-cluster]: ./tutorial-kubernetes-deploy-cluster.md
@@ -389,7 +391,7 @@ Once you successfully provisioned Dapr in your AKS cluster, try deploying a [sam
389391
[dapr-migration]: ./dapr-migration.md
390392
[dapr-settings]: ./dapr-settings.md
391393
[aks-azurelinux]: ./cluster-configuration.md#azure-linux-container-host-for-aks
392-
394+
[dapr-workflow]: ./dapr-workflow.md
393395

394396
<!-- LINKS EXTERNAL -->
395397
[kubernetes-production]: https://docs.dapr.io/operations/hosting/kubernetes/kubernetes-production

articles/aks/dapr-workflow.md

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -185,7 +185,8 @@ Notice that the workflow status is marked as completed.
185185

186186
## Next steps
187187

188-
[Learn how to add configuration settings to the Dapr extension on your AKS cluster][dapr-config].
188+
- [Configure the Dapr extension on your AKS cluster][dapr-config].
189+
- [Determine if you need to migrate from Dapr open source to the Dapr extension][dapr-migration].
189190

190191
<!-- Links Internal -->
191192
[deploy-cluster]: ./tutorial-kubernetes-deploy-cluster.md
@@ -194,6 +195,7 @@ Notice that the workflow status is marked as completed.
194195
[cluster]: ./tutorial-kubernetes-deploy-cluster.md
195196
[k8s-sp]: ./dapr.md#register-the-kubernetesconfiguration-resource-provider
196197
[dapr-config]: ./dapr-settings.md
198+
[dapr-migration]: ./dapr-migration.md
197199
[az-cloud-shell]: ./learn/quick-kubernetes-deploy-powershell.md#azure-cloud-shell
198200
[kubectl]: ./tutorial-kubernetes-deploy-cluster.md#connect-to-cluster-using-kubectl
199201

0 commit comments

Comments
 (0)