Skip to content

Commit e77b66c

Browse files
author
Larry
committed
edits for clarity
1 parent 7b836d1 commit e77b66c

File tree

1 file changed

+7
-7
lines changed

1 file changed

+7
-7
lines changed

articles/machine-learning/how-to-deploy-azure-kubernetes-service.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -234,22 +234,22 @@ Analyze and promote model versions in a controlled fashion using endpoints. You
234234
* Configure the __percentage of scoring traffic sent to each endpoint__. For example, route 20% of the traffic to endpoint 'test' and 80% to 'production'.
235235

236236
> [!NOTE]
237-
> If you do not account for 100% of the traffic, any remaining percentage is routed to the __default__ endpoint. For example, if you configure endpoint 'test' to get 10% of the traffic, the remaining 90% is sent to the default endpoint.
237+
> If you do not account for 100% of the traffic, any remaining percentage is routed to the __default__ endpoint version. For example, if you configure endpoint version 'test' to get 10% of the traffic, and 'prod' for 30%, the remaining 60% is sent to the default endpoint version.
238238
>
239-
> The first endpoint created is automatically configured as the default. You can change this by setting `is_default=True` when creating or updating an endpoint version.
239+
> The first endpoint version created is automatically configured as the default. You can change this by setting `is_default=True` when creating or updating an endpoint version.
240240
241-
* Tag an endpoint as either __control__ or __treatment__. For example, the current production endpoint might be the control, while potential new models are deployed as treatments. After evaluating performance of the treatment endpoints, if one outperforms the current control, it might be promoted to the new production/control.
241+
* Tag an endpoint version as either __control__ or __treatment__. For example, the current production endpoint version might be the control, while potential new models are deployed as treatment versions. After evaluating performance of the treatment versions, if one outperforms the current control, it might be promoted to the new production/control.
242242

243243
> [!NOTE]
244-
> You can only have __one__ control endpoint. You can have multiple treatments.
244+
> You can only have __one__ control. You can have multiple treatments.
245245
246246
You can enable app insights to view operational metrics of endpoints and deployed versions.
247247

248248
### Create an endpoint
249249
Once you are ready to deploy your models, create a scoring endpoint and deploy your first version. The following example shows how to deploy and create the endpoint using the SDK. The first deployment will be defined as the default version, which means that unspecified traffic percentile across all versions will go to the default version.
250250

251251
> [!TIP]
252-
> In the following example, the configuration sets this endpoint to handle 20% of the traffic. Since this is the first endpoint, it's also the default endpoint. And since we don't have any other endpoints for the other 80% of traffic, it is routed to the default endpoint. Until other endpoints that take a percentage of traffic are deployed, this one effectively receives 100% of the traffic.
252+
> In the following example, the configuration sets the initial endpoint version to handle 20% of the traffic. Since this is the first endpoint, it's also the default version. And since we don't have any other versions for the other 80% of traffic, it is routed to the default as well. Until other versions that take a percentage of traffic are deployed, this one effectively receives 100% of the traffic.
253253
254254
```python
255255
import azureml.core,
@@ -280,7 +280,7 @@ endpoint_deployment_config = AksEndpoint.deploy_configuration(cpu_cores = 0.1, m
280280
Add another version to your endpoint and configure the scoring traffic percentile going to the version. There are two types of versions, a control and a treatment version. There can be multiple treatment versions to help compare against a single control version.
281281

282282
> [!TIP]
283-
> The previous endpoint is set to receive 20% of the traffic. It's also the default, so it gets any leftover traffic not handled by other endpoints. The second endpoint version created by the next code snippet accepts 10% of traffic. After it is created, the first endpoint version is configured for 20% of the traffic and the new version for 10%. The remaining 70% is sent to the first endpoint version, because it is also the default version.
283+
> The second version, created by the following code snippet, accepts 10% of traffic. The first version is configured for 20%, so only 30% of the traffic is configured for specific versions. The remaining 70% is sent to the first endpoint version, because it is also the default version.
284284
285285
```python
286286
from azureml.core.webservice import AksEndpoint
@@ -299,7 +299,7 @@ endpoint.wait_for_deployment(True)
299299
Update existing versions or delete them in an endpoint. You can change the version's default type, control type, and the traffic percentile. In the following example, the second version increases its traffic to 40% and is now the default.
300300

301301
> [!TIP]
302-
> Since the second version is now default, and is configured for 40%, while the original version is configured for 20%. This means that 40% of traffic is not accounted for by version configurations, and will also be routed to the second version. It effectively receives 80% of the traffic.
302+
> After the following code snippet, the second version is now default. It is now configured for 40%, while the original version is still configured for 20%. This means that 40% of traffic is not accounted for by version configurations. The leftover traffic will be routed to the second version, because it is now default. It effectively receives 80% of the traffic.
303303
304304
```python
305305
from azureml.core.webservice import AksEndpoint

0 commit comments

Comments
 (0)