Skip to content

Commit cd5ed06

Browse files
Merge pull request #279985 from shubhirajMsft/patch-11
Add network isolation docs.md
2 parents d43b491 + 64e0f62 commit cd5ed06

File tree

2 files changed

+39
-3
lines changed

2 files changed

+39
-3
lines changed

articles/ai-studio/how-to/model-catalog-overview.md

Lines changed: 20 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ Features | Managed compute | serverless API (pay-as-you-go)
5252
Deployment experience and billing | Model weights are deployed to dedicated Virtual Machines with Managed Online Endpoints. The managed online endpoint, which can have one or more deployments, makes available a REST API for inference. You're billed for the Virtual Machine core hours used by the deployments. | Access to models is through a deployment that provisions an API to access the model. The API provides access to the model hosted and managed by Microsoft, for inference. This mode of access is referred to as "Models as a Service". You're billed for inputs and outputs to the APIs, typically in tokens; pricing information is provided before you deploy.
5353
| API authentication | Keys and Microsoft Entra ID authentication.| Keys only.
5454
Content safety | Use Azure Content Safety service APIs. | Azure AI Content Safety filters are available integrated with inference APIs. Azure AI Content Safety filters may be billed separately.
55-
Network isolation | Configure Managed Network. [Learn more.]( configure-managed-network.md) |
55+
Network isolation | [Configure managed networks for Azure AI Studio hubs.](configure-managed-network.md) | MaaS endpoint will follow your hub's public network access (PNA) flag setting. For more information, see the [Network isolation for models deployed via Serverless APIs](#network-isolation-for-models-deployed-via-serverless-apis) section.
5656

5757
Model | Managed compute | Serverless API (pay-as-you-go)
5858
--|--|--
@@ -145,14 +145,31 @@ Phi-3-mini-128k-instruct <br> Phi-3-medium-4k-instruct <br> Phi-3-medium-128k-in
145145

146146
<!-- docutune:enable -->
147147

148-
### Content safety for models deployed via Serverless API
148+
### Content safety for models deployed via Serverless APIs
149149

150150
[!INCLUDE [Feature preview](~/reusable-content/ce-skilling/azure/includes/ai-studio/includes/feature-preview.md)]
151151

152152
Azure AI Studio implements a default configuration of [Azure AI Content Safety](../../ai-services/content-safety/overview.md) text moderation filters for harmful content (hate, self-harm, sexual, and violence) in language models deployed with MaaS. To learn more about content filtering (preview), see [harm categories in Azure AI Content Safety](../../ai-services/content-safety/concepts/harm-categories.md). Content filtering (preview) occurs synchronously as the service processes prompts to generate content, and you may be billed separately as per [AACS pricing](https://azure.microsoft.com/pricing/details/cognitive-services/content-safety/) for such use. You can disable content filtering for individual serverless endpoints when you first deploy a language model or in the deployment details page by clicking the content filtering toggle. You may be at higher risk of exposing users to harmful content if you turn off content filters.
153153

154+
### Network isolation for models deployed via Serverless APIs
154155

156+
Endpoints for models deployed as Serverless APIs follow the public network access (PNA) flag setting of the AI Studio Hub that has the project in which the deployment exists. To secure your MaaS endpoint, disable the PNA flag on your AI Studio Hub. You can secure inbound communication from a client to your endpoint by using a private endpoint for the hub.
155157

156-
## Next steps
158+
To set the PNA flag for the Azure AI hub:
159+
160+
* Go to the [Azure portal](https://ms.portal.azure.com/)
161+
* Search for the Resource group to which the hub belongs, and select your Azure AI hub from the resources listed for this Resource group.
162+
* On the hub Overview page, use the left navigation pane to go to **Settings** > **Networking**.
163+
* Under the __Public access__ tab, you can configure settings for the public network access flag.
164+
* Save your changes. Your changes might take up to five minutes to propagate.
165+
166+
#### Limitations
167+
168+
* If you have an AI Studio hub with a private endpoint created before July 11, 2024, new MaaS endpoints added to projects in this hub won't follow the networking configuration of the hub. Instead, you need to create a new private endpoint for the hub and create new serverless API deployments in the project so that the new deployments can follow the hub's networking configuration.
169+
* If you have an AI studio hub with MaaS deployments created before July 11, 2024, and you enable a private endpoint on this hub, the existing MaaS deployments won't follow the hub's networking configuration. For serverless API deployments in the hub to follow the hub's networking configuration, you need to create the deployments again.
170+
* Currently [On Your Data](#rag-with-models-deployed-as-serverless-apis) support isn't available for MaaS deployments in private hubs, since private hubs have the PNA flag disabled.
171+
* Any network configuration change (for example, enabling or disabling the PNA flag) might take up to five minutes to propagate.
172+
173+
## Next step
157174

158175
- [Explore Azure AI foundation models in Azure AI Studio](models-foundation-azure-ai.md)

articles/machine-learning/concept-model-catalog.md

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -157,6 +157,25 @@ For language models deployed to MaaS, Azure Machine Learning implements a defaul
157157

158158
Content filtering (preview) occurs synchronously as the service processes prompts to generate content, and you might be billed separately as per [AACS pricing](https://azure.microsoft.com/pricing/details/cognitive-services/content-safety/) for such use. You can disable content filtering (preview) for individual serverless endpoints either at the time when you first deploy a language model or in the deployment details page by selecting the content filtering toggle. If you use a model in MaaS via an API other than the [Azure AI Model Inference API](../ai-studio/reference/reference-model-inference-api.md), content filtering isn't enabled unless you implement it separately by using [Azure AI Content Safety](../ai-services/content-safety/quickstart-text.md). If you use a model in MaaS without content filtering, you run a higher risk of exposing users to harmful content.
159159

160+
### Network isolation for models deployed via Serverless APIs
161+
162+
Endpoints for models deployed as Serverless APIs follow the public network access (PNA) flag setting of the workspace in which the deployment exists. To secure your MaaS endpoint, disable the PNA flag on your workspace. You can secure inbound communication from a client to your endpoint by using a private endpoint for the workspace.
163+
164+
To set the PNA flag for the workspace:
165+
166+
* Go to the [Azure portal](https://ms.portal.azure.com/).
167+
* Search for _Azure Machine Learning_, and select your workspace from the list of workspaces.
168+
* On the Overview page, use the left navigation pane to go to **Settings** > **Networking**.
169+
* Under the **Public access** tab, you can configure settings for the public network access flag.
170+
* Save your changes. Your changes might take up to five minutes to propagate.
171+
172+
#### Limitations
173+
174+
* If you have a workspace with a private endpoint created before July 11, 2024, new MaaS endpoints added to this workspace won't follow its networking configuration. Instead, you need to create a new private endpoint for the workspace and create new serverless API deployments in the workspace so that the new deployments can follow the workspace's networking configuration.
175+
* If you have a workspace with MaaS deployments created before July 11, 2024, and you enable a private endpoint on this workspace, the existing MaaS deployments won't follow the workspace's networking configuration. For serverless API deployments in the workspace to follow the workspace's configuration, you need to create the deployments again.
176+
* Currently [On Your Data](#rag-with-models-deployed-through-maas) support isn't available for MaaS deployments in private workspaces, since private workspaces have the PNA flag disabled.
177+
* Any network configuration change (for example, enabling or disabling the PNA flag) might take up to five minutes to propagate.
178+
160179
## Learn more
161180

162181
* Learn [how to use foundation Models in Azure Machine Learning](./how-to-use-foundation-models.md) for fine-tuning, evaluation, and deployment using Azure Machine Learning studio UI or code based methods.

0 commit comments

Comments
 (0)