Skip to content

Commit 34035e6

Browse files
committed
Merge branch 'main' of https://github.com/MicrosoftDocs/azure-docs-pr into apiccli
2 parents 6c63e3d + 8022fd7 commit 34035e6

File tree

155 files changed

+2647
-1915
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

155 files changed

+2647
-1915
lines changed

.openpublishing.publish.config.json

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1250,6 +1250,7 @@
12501250
"articles/event-grid/.openpublishing.redirection.event-grid.json",
12511251
"articles/event-hubs/.openpublishing.redirection.event-hubs.json",
12521252
"articles/governance/policy/.openpublishing.redirection.policy.json",
1253+
"articles/governance/policy/.openpublishing.redirection.resource-graph.json",
12531254
"articles/hdinsight/.openpublishing.redirection.hdinsight.json",
12541255
"articles/hdinsight-aks/.openpublishing.redirection.hdinsight-aks.json",
12551256
"articles/healthcare-apis/.openpublishing.redirection.healthcare-apis.json",

articles/ai-services/content-safety/includes/severity-levels.md

Lines changed: 18 additions & 18 deletions
Large diffs are not rendered by default.

articles/ai-services/openai/concepts/content-filter.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -796,6 +796,10 @@ data: {"id":"","object":"","created":0,"model":"","choices":[{"index":0,"finish_
796796
797797
data: [DONE]
798798
```
799+
800+
> [!IMPORTANT]
801+
> When content filtering is triggered for a prompt and a `"status": 400` is received as part of the response there may be a charge for this request as the prompt was evaluated by the service. [Charges will also occur](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/) when a `"status":200` is received with `"finish_reason": "content_filter"`. In this case the prompt did not have any issues, but the completion generated by the model was detected to violate the content filtering rules which results in the completion being filtered.
802+
799803
## Best practices
800804
801805
As part of your application design, consider the following best practices to deliver a positive experience with your application while minimizing potential harms:

articles/ai-services/openai/concepts/models.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -101,7 +101,7 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
101101
**<sup>2</sup>** GPT-4 Turbo with Vision Preview = `gpt-4` (vision-preview). To deploy this model, under **Deployments** select model **gpt-4**. For **Model version** select **vision-preview**.
102102

103103
> [!CAUTION]
104-
> We don't recommend using these models in production. We will upgrade all deployments of these models to a future stable version. Models designated preview do not follow the standard Azure OpenAI model lifecycle.
104+
> We don't recommend using preview models in production. We will upgrade all deployments of preview models to a future stable version. Models designated preview do not follow the standard Azure OpenAI model lifecycle.
105105
106106
> [!NOTE]
107107
> Regions where GPT-4 (0314) & (0613) are listed as available have access to both the 8K and 32K versions of the model
@@ -110,8 +110,8 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
110110

111111
| Model Availability | gpt-4 (0314) | gpt-4 (0613) | gpt-4 (1106-preview) | gpt-4 (vision-preview) |
112112
|---|:---|:---|:---|:---|
113-
| Available to all subscriptions with Azure OpenAI access | | Australia East <br> Canada East <br> France Central <br> Sweden Central <br> Switzerland North | Australia East <br> Canada East <br> East US 2 <br> France Central <br> Norway East <br> South India <br> Sweden Central <br> UK South <br> West US | Switzerland North <br> West US |
114-
| Available to subscriptions with current access to the model version in the region | East US <br> France Central <br> South Central US <br> UK South | East US <br> East US 2 <br> Japan East <br> UK South | | Australia East <br>Sweden Central|
113+
| Available to all subscriptions with Azure OpenAI access | | Australia East <br> Canada East <br> France Central <br> Sweden Central <br> Switzerland North | Australia East <br> Canada East <br> East US 2 <br> France Central <br> Norway East <br> South India <br> Sweden Central <br> UK South <br> West US | Sweden Central <br> Switzerland North <br> West US |
114+
| Available to subscriptions with current access to the model version in the region | East US <br> France Central <br> South Central US <br> UK South | East US <br> East US 2 <br> Japan East <br> UK South | | Australia East |
115115

116116
### GPT-3.5 models
117117

articles/ai-services/openai/includes/dall-e-python.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -64,8 +64,6 @@ Open a command prompt and browse to your project folder. Create a new python fil
6464

6565
## Install the Python SDK
6666

67-
> [!IMPORTANT]
68-
> The latest release of the [OpenAI Python library](https://pypi.org/project/openai/) does not currently support DALL-E when used with Azure OpenAI. To access DALL-E with Azure OpenAI use version `0.28.1`.
6967

7068
Install the OpenAI Python SDK by using the following command:
7169

@@ -77,6 +75,9 @@ pip install openai
7775

7876
#### [DALL-E 2](#tab/dalle2)
7977

78+
> [!IMPORTANT]
79+
> The latest release of the [OpenAI Python library](https://pypi.org/project/openai/) does not currently support DALL-E 2 when used with Azure OpenAI. To access DALL-E 2 with Azure OpenAI use version `0.28.1`. Or, follow the [migration guide](/azure/ai-services/openai/how-to/migration?tabs=python%2Cdalle-fix) to use DALL-E 2 with OpenAI 1.x.
80+
8081
```bash
8182
pip install openai==0.28.1
8283
```

articles/ai-services/openai/quotas-limits.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
ms.topic: conceptual
13-
ms.date: 12/06/2023
13+
ms.date: 01/12/2024
1414
ms.author: mbullwin
1515
---
1616

@@ -107,7 +107,7 @@ The default quota for models varies by model and region. Default quota limits ar
107107
<tr>
108108
<td>gpt-4 (vision-preview)<br>GPT-4 Turbo with Vision</td>
109109
<td>Sweden Central, Switzerland North, Australia East, West US</td>
110-
<td>10 K</td>
110+
<td>30 K</td>
111111
</tr>
112112
<tr>
113113
<td rowspan="2">text-embedding-ada-002</td>

articles/aks/TOC.yml

Lines changed: 18 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -15,20 +15,20 @@
1515
- name: Add-ons, extensions, and other integrations
1616
href: integrations.md
1717
displayName: add-on, extensions, prometheus, grafana, openfaas, spark, istio, linkerd, consul, github, actions
18-
- name: Getting started
19-
items:
20-
- name: Well-architected considerations
21-
href: /azure/architecture/framework/services/compute/azure-kubernetes-service/azure-kubernetes-service?toc=/azure/aks/toc.json&bc=/azure/aks/breadcrumb/toc.json
22-
- name: Plan your implementation
23-
href: /azure/architecture/reference-architectures/containers/aks-start-here?toc=/azure/aks/toc.json&bc=/azure/aks/breadcrumb/toc.json
24-
- name: Optimize costs
25-
href: best-practices-cost.md
26-
- name: Windows vs. Linux containers
27-
href: windows-vs-linux-containers.md
18+
- name: Getting started
19+
items:
20+
- name: Well-architected considerations
21+
href: /azure/architecture/framework/services/compute/azure-kubernetes-service/azure-kubernetes-service?toc=/azure/aks/toc.json&bc=/azure/aks/breadcrumb/toc.json
22+
- name: Plan your implementation
23+
href: /azure/architecture/reference-architectures/containers/aks-start-here?toc=/azure/aks/toc.json&bc=/azure/aks/breadcrumb/toc.json
24+
- name: Windows container considerations
25+
href: windows-vs-linux-containers.md
26+
- name: Optimize costs
27+
href: best-practices-cost.md
2828
- name: Quickstarts
2929
expanded: true
3030
items:
31-
- name: Deploy a Linux-based AKS Cluster
31+
- name: Create a Linux-based AKS Cluster
3232
expanded: true
3333
items:
3434
- name: Use the Azure CLI
@@ -48,15 +48,15 @@
4848
href: learn/quick-kubernetes-deploy-rm-template.md
4949
- name: Use Terraform
5050
href: learn/quick-kubernetes-deploy-terraform.md
51-
- name: Deploy a Windows-based AKS Cluster
51+
- name: Create a Windows-based AKS Cluster
5252
items:
5353
- name: Use the Azure CLI
5454
href: learn/quick-windows-container-deploy-cli.md
5555
- name: Use Azure PowerShell
5656
href: learn/quick-windows-container-deploy-powershell.md
5757
- name: Use the Azure portal
5858
href: learn/quick-windows-container-deploy-portal.md
59-
- name: Develop applications
59+
- name: Work with package managers
6060
expanded: true
6161
items:
6262
- name: Develop with Helm
@@ -346,7 +346,11 @@
346346
- name: Proximity placement groups
347347
href: reduce-latency-ppg.md
348348
- name: Cluster Autoscaler
349-
href: cluster-autoscaler.md
349+
items:
350+
- name: Cluster Autoscaler overview
351+
href: cluster-autoscaler-overview.md
352+
- name: Use the Cluster Autoscaler on AKS
353+
href: cluster-autoscaler.md
350354
- name: Node autoprovision
351355
href: node-autoprovision.md
352356
- name: Availability Zones

articles/aks/availability-zones.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -68,6 +68,9 @@ AKS clusters deployed using availability zones can distribute nodes across multi
6868

6969
If a single zone becomes unavailable, your applications continue to run on clusters configured to spread across multiple zones.
7070

71+
> [!NOTE]
72+
> When implementing **availability zones with the [cluster autoscaler](./cluster-autoscaler-overview.md)**, we recommend using a single node pool for each zone. You can set the `--balance-similar-node-groups` parameter to `True` to maintain a balanced distribution of nodes across zones for your workloads during scale up operations. When this approach isn't implemented, scale down operations can disrupt the balance of nodes across zones.
73+
7174
## Create an AKS cluster across availability zones
7275

7376
When you create a cluster using the [az aks create][az-aks-create] command, the `--zones` parameter specifies the availability zones to deploy agent nodes into. The availability zones that the managed control plane components are deployed into are **not** controlled by this parameter. They are automatically spread across all availability zones (if present) in the region during cluster deployment.

articles/aks/azure-cni-overlay.md

Lines changed: 22 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
title: Configure Azure CNI Overlay networking in Azure Kubernetes Service (AKS)
3-
description: Learn how to configure Azure CNI Overlay networking in Azure Kubernetes Service (AKS), including deploying an AKS cluster into an existing virtual network and subnet.
3+
description: Learn how to configure Azure CNI Overlay networking in Azure Kubernetes Service (AKS), including deploying an AKS cluster into an existing virtual network and subnets.
44
author: asudbring
55
ms.author: allensu
66
ms.subservice: aks-networking
@@ -17,7 +17,7 @@ With Azure CNI Overlay, the cluster nodes are deployed into an Azure Virtual Net
1717

1818
## Overview of Overlay networking
1919

20-
In Overlay networking, only the Kubernetes cluster nodes are assigned IPs from a subnet. Pods receive IPs from a private CIDR provided at the time of cluster creation. Each node is assigned a `/24` address space carved out from the same CIDR. Extra nodes created when you scale out a cluster automatically receive `/24` address spaces from the same CIDR. Azure CNI assigns IPs to pods from this `/24` space.
20+
In Overlay networking, only the Kubernetes cluster nodes are assigned IPs from subnets. Pods receive IPs from a private CIDR provided at the time of cluster creation. Each node is assigned a `/24` address space carved out from the same CIDR. Extra nodes created when you scale out a cluster automatically receive `/24` address spaces from the same CIDR. Azure CNI assigns IPs to pods from this `/24` space.
2121

2222
A separate routing domain is created in the Azure Networking stack for the pod's private CIDR space, which creates an Overlay network for direct communication between pods. There's no need to provision custom routes on the cluster subnet or use an encapsulation method to tunnel traffic between pods, which provides connectivity performance between pods on par with VMs in a VNet. Workloads running within the pods are not even aware that network address manipulation is happening.
2323

@@ -43,7 +43,7 @@ Like Azure CNI Overlay, Kubenet assigns IP addresses to pods from an address spa
4343

4444
## IP address planning
4545

46-
- **Cluster Nodes**: When setting up your AKS cluster, make sure your VNet subnet has enough room to grow for future scaling. Keep in mind that clusters can't scale across subnets, but you can always add new node pools in another subnet within the same VNet for extra space. A `/24`subnet can fit up to 251 nodes since the first three IP addresses are reserved for management tasks.
46+
- **Cluster Nodes**: When setting up your AKS cluster, make sure your VNet subnets have enough room to grow for future scaling. You can assign each node pool to a dedicated subnet. A `/24`subnet can fit up to 251 nodes since the first three IP addresses are reserved for management tasks.
4747
- **Pods**: The Overlay solution assigns a `/24` address space for pods on every node from the private CIDR that you specify during cluster creation. The `/24` size is fixed and can't be increased or decreased. You can run up to 250 pods on a node. When planning the pod address space, ensure the private CIDR is large enough to provide `/24` address spaces for new nodes to support future cluster expansion.
4848
- When planning IP address space for pods, consider the following factors:
4949
- The same pod CIDR space can be used on multiple independent AKS clusters in the same VNet.
@@ -112,6 +112,25 @@ az aks create -n $clusterName -g $resourceGroup \
112112
--pod-cidr 192.168.0.0/16
113113
```
114114

115+
## Add a new nodepool to a dedicated subnet
116+
117+
After your have created a cluster with Azure CNI Overlay, you can create another nodepool and assign the nodes to a new subnet of the same VNet.
118+
This approach can be usefull if you want to control the ingress or egress IPs of the host from/ towards targets in the same VNET or peered VNets.
119+
120+
```azurecli-interactive
121+
clusterName="myOverlayCluster"
122+
resourceGroup="myResourceGroup"
123+
location="westcentralus"
124+
nodepoolName="newpool1"
125+
subscriptionId=$(az account show --query id -o tsv)
126+
vnetName="yourVnetName"
127+
subnetName="yourNewSubnetName"
128+
subnetResourceId="/subscriptions/$subscriptionId/resourceGroups/$resourceGroup/providers/Microsoft.Network/virtualNetworks/$vnetName/subnets/$subnetName"
129+
az aks nodepool add -g $resourceGroup --cluster-name $clusterName \
130+
--name $nodepoolName --node-count 1 \
131+
--mode system --vnet-subnet-id $subnetResourceId
132+
```
133+
115134
## Upgrade an existing cluster to CNI Overlay
116135

117136
> [!NOTE]

articles/aks/best-practices-performance-scale-large.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,7 @@ Keeping the above considerations in mind, customers are typically able to deploy
7676
Always upgrade your Kubernetes clusters to the latest version. Newer versions contain many improvements that address performance and throttling issues. If you're using an upgraded version of Kubernetes and still see throttling due to the actual load or the number of clients in the subscription, you can try the following options:
7777

7878
* **Analyze errors using AKS Diagnose and Solve Problems**: You can use [AKS Diagnose and Solve Problems](./aks-diagnostics.md) to analyze errors, identity the root cause, and get resolution recommendations.
79-
* **Increase the Cluster Autoscaler scan interval**: If the diagnostic reports show that [Cluster Autoscaler throttling has been detected](/troubleshoot/azure/azure-kubernetes/429-too-many-requests-errors#analyze-and-identify-errors-by-using-aks-diagnose-and-solve-problems), you can [increase the scan interval](./cluster-autoscaler.md#change-the-cluster-autoscaler-settings) to reduce the number of calls to Virtual Machine Scale Sets from the Cluster Autoscaler.
79+
* **Increase the Cluster Autoscaler scan interval**: If the diagnostic reports show that [Cluster Autoscaler throttling has been detected](/troubleshoot/azure/azure-kubernetes/429-too-many-requests-errors#analyze-and-identify-errors-by-using-aks-diagnose-and-solve-problems), you can [increase the scan interval](./cluster-autoscaler.md#update-the-cluster-autoscaler-settings) to reduce the number of calls to Virtual Machine Scale Sets from the Cluster Autoscaler.
8080
* **Reconfigure third-party applications to make fewer calls**: If you filter by *user agents* in the ***View request rate and throttle details*** diagnostic and see that [a third-party application, such as a monitoring application, makes a large number of GET requests](/troubleshoot/azure/azure-kubernetes/429-too-many-requests-errors#analyze-and-identify-errors-by-using-aks-diagnose-and-solve-problems), you can change the settings of these applications to reduce the frequency of the GET calls. Make sure the application clients use exponential backoff when calling Azure APIs.
8181
* **Split your clusters into different subscriptions or regions**: If you have a large number of clusters and node pools that use Virtual Machine Scale Sets, you can split them into different subscriptions or regions within the same subscription. Most Azure API limits are shared at the subscription-region level, so you can move or scale your clusters to different subscriptions or regions to get unblocked on Azure API throttling. This option is especially helpful if you expect your clusters to have high activity. There are no generic guidelines for these limits. If you want specific guidance, you can create a support ticket.
8282

0 commit comments

Comments
 (0)