doc: consolidate installation steps#762

Open

shuynh2017 wants to merge 1 commit intollm-d:mainfrom

shuynh2017:doc-install-consolidate

Contributor

shuynh2017 commented Feb 19, 2026

This PR:

consolidates all installation steps to one place - docs/user-guide/installation.md
removes install instructions from README.md and points to docs/user-guide/installation.md
removes install instructions from charts/workload-variant-autoscaler/README.md and points to docs/user-guide/installation.md
provides separate installation steps for wva controller and scale target models in docs/user-guide/installation.md
raises the important of updating the global prometheus-adapter and shows how to append to existing prometheus-adapter config.


          doc: consolidate install

27848c5

Contributor Author

shuynh2017 commented Feb 19, 2026

@lionelvillard pls review. Thanks.

shuynh2017 mentioned this pull request

Split the existing helm chart in two #555

Open

mamy-CS reviewed

View reviewed changes

Collaborator

mamy-CS left a comment

Some comments. Thank you for the cleanup @shuynh2017

charts/workload-variant-autoscaler/README.md

-              wva:
-                controllerInstance: "my-unique-instance-id"
-              ```
+              When running multiple WVA controllers in the same cluster (e.g., for parallel e2e tests or multi-tenant environments), use the `controllerInstance` configuration to prevent metrics conflicts between controllers. See [Multi-Controller Isolation](../../docs/user-guide/multi-controller-isolation.md) for details configuration.

Collaborator

mamy-CS Feb 19, 2026

Suggested change

      
            When running multiple WVA controllers in the same cluster (e.g., for parallel e2e tests or multi-tenant environments), use the `controllerInstance` configuration to prevent metrics conflicts between controllers. See [Multi-Controller Isolation](../../docs/user-guide/multi-controller-isolation.md) for details configuration.
          
            When running multiple WVA controllers in the same cluster (e.g., for parallel e2e tests or multi-tenant environments), use the `controllerInstance` configuration to prevent metrics conflicts between controllers. See [Multi-Controller Isolation](../../docs/user-guide/multi-controller-isolation.md) for detailed configuration.

docs/user-guide/installation.md

+              # - Prometheus and monitoring stack
+              # - vLLM emulator for testing
               # See deploy/kind-emulator/README.md for detailed instructions
-              make deploy-llm-d-wva-emulated-on-kind

Collaborator

mamy-CS Feb 19, 2026

The chart readme had a cleanup section that was removed. Consider adding cleanup instructions here maybe?

docs/user-guide/installation.md

Comment on lines +74 to +82

+              ```
+              helm upgrade -i wva-model-a ./workload-variant-autoscaler \
+                -n $WVA_NS \
+                --set controller.enabled=false \
+                --set va.enabled=true \
+                --set hpa.enabled=true \
+                --set llmd.namespace=team-a \
+                --set llmd.modelName=my-model-a \
+                --set llmd.modelID="meta-llama/Llama-3.1-8"

Collaborator

mamy-CS Feb 19, 2026

missing some important configuration options that were in the original chart readme, such as va.accelerator. Add the complete example

helm upgrade -i wva-model-a ./workload-variant-autoscaler \
  -n $WVA_NS \
  --set controller.enabled=false \
  --set va.enabled=true \
  --set hpa.enabled=true \
  --set va.accelerator=L40S \
  --set llmd.namespace=team-a \
  --set llmd.modelName=my-model-a \
  --set llmd.modelID="meta-llama/Llama-3.1-8" \
  --set vllmService.enabled=true \
  --set vllmService.nodePort=30000

docs/user-guide/installation.md

+              export WVA_PROJECT=$PWD
+              helm repo add prometheus-community https://prometheus-community.github.io/helm-charts
+              helm repo update
+              ```

Collaborator

mamy-CS Feb 19, 2026

Maybe add a validation step after step 1 to verify that the CA cert file was created, to avoid issues?

docs/user-guide/installation.md

+                --set va.enabled=false \
+                --set hpa.enabled=false \
+                --set vllmService.enabled=false
+              ```

Collaborator

mamy-CS Feb 19, 2026

Maybe add a verification step after step 3 to confirm the controller is running, before moving further?

docs/user-guide/installation.md

               ## Installation Methods
-              ### Option 1: Helm Installation (Recommended)
+              ### Option 1: Helm Installation (Recommended, on OpenShift)

Collaborator

mamy-CS Feb 19, 2026

The heading says recommended, on OpenShift, but the content is entirely openShift-specific. Might be confusing for other non openshift users. Clarify here.

docs/user-guide/installation.md

+              helm upgrade -i workload-variant-autoscaler ./workload-variant-autoscaler \
+                -n $WVA_NS \
+                --set-file wva.prometheus.caCert=/tmp/prometheus-ca.crt \
+                --set controller.enabled=true \

Collaborator

mamy-CS Feb 19, 2026

The --set controller.enabled=true is explicit but redundant (it's the default). This is fine for clarity, but you could also remove it to reduce verbosity. Either approach works, I guess keeping it makes the intent clear.

Contributor Author

shuynh2017 commented Feb 21, 2026

@mamy-CS thank you for your comments. @lionelvillard also provided ideas for further organization. I will update the PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet