Skip to content

Commit 4668d55

Browse files
committed
Update k8s compute TSG and log info
1 parent fce59e0 commit 4668d55

File tree

1 file changed

+14
-9
lines changed

1 file changed

+14
-9
lines changed

articles/machine-learning/how-to-troubleshoot-kubernetes-extension.md

Lines changed: 14 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -250,15 +250,20 @@ volcano-scheduler.conf: |
250250
- name: nodeorder
251251
- name: binpack
252252
```
253-
254-
And you need to skip the resource validation when install the extension by configuring `--config amloperator.skipResourceValidation=true`.
255-
256-
[!NOTE]
257-
> Since the gang plugin is removed, there's potential that the deadlock happens when volcano schedules the job.
258-
>
259-
> * To avoid this situation, you can **use same instance type across the jobs**.
260-
>
261-
> Note that you need to disable `job/validate` webhook in the volcano admission if your **volcano version is lower than 1.6**.
253+
254+
To use this config after you install the Azureml extension with the configuration setting of `installVolcano=true`, you need to follow the steps below:
255+
1. Create a configmap file with the above config in the azureml namespace. This namespace will generally be created when you install the AzureML extension.
256+
1. Set `volcanoSchedulerConfig=<configmap name>` in the extension config to apply this configmap. And you need to skip the resource validation when install the extension by configuring `amloperator.skipResourceValidation=true`. For example:
257+
```azurecli
258+
az k8s-extension update --name <extension-name> --extension-type Microsoft.AzureML.Kubernetes --config volcanoSchedulerConfig=<configmap name> amloperator.skipResourceValidation=true --cluster-type managedClusters --cluster-name <your-AKS-cluster-name> --resource-group <your-RG-name> --scope cluster
259+
```
260+
261+
> [!NOTE]
262+
> Since the gang plugin is removed, there's potential that the deadlock happens when volcano schedules the job.
263+
>
264+
> * To avoid this situation, you can **use same instance type across the jobs**.
265+
>
266+
> Note that you need to disable `job/validate` webhook in the volcano admission if your **volcano version is lower than 1.6**.
262267
263268
264269

0 commit comments

Comments
 (0)