```yaml
type: service
diff --git a/docs/blog/posts/beyond-kubernetes-2024-recap-and-whats-ahead.md b/docs/blog/posts/beyond-kubernetes-2024-recap-and-whats-ahead.md
index be7868e285..79e026a0b4 100644
--- a/docs/blog/posts/beyond-kubernetes-2024-recap-and-whats-ahead.md
+++ b/docs/blog/posts/beyond-kubernetes-2024-recap-and-whats-ahead.md
@@ -110,7 +110,7 @@ efficient manner.
### NVIDIA
-NVIDIA remains the top accelerator supported by `dstack`. Recently, we introduced a [NIM example](../../examples/deployment/nim/index.md)
+NVIDIA remains the top accelerator supported by `dstack`. Recently, we introduced a [NIM example](../../examples/inference/nim/index.md)
for model deployment, and we continue to enhance support for the rest of NVIDIA's ecosystem.
### AMD
diff --git a/docs/blog/posts/dstack-metrics.md b/docs/blog/posts/dstack-metrics.md
index 459ef6d3e5..07d80ab0ab 100644
--- a/docs/blog/posts/dstack-metrics.md
+++ b/docs/blog/posts/dstack-metrics.md
@@ -7,7 +7,7 @@ image: https://dstack.ai/static-assets/static-assets/images/dstack-stats-v2.png
categories:
- AMD
- NVIDIA
- - Monitoring
+ - Metrics
---
# Monitoring essential GPU metrics via CLI
diff --git a/docs/blog/posts/metrics-ui.md b/docs/blog/posts/metrics-ui.md
index ba0cfde6b6..032115e5a6 100644
--- a/docs/blog/posts/metrics-ui.md
+++ b/docs/blog/posts/metrics-ui.md
@@ -5,7 +5,7 @@ description: "TBA"
slug: metrics-ui
image: https://dstack.ai/static-assets/static-assets/images/dstack-metrics-ui-v3-min.png
categories:
- - Monitoring
+ - Metrics
- AMD
- NVIDIA
---
@@ -55,6 +55,6 @@ For persistent storage and long-term access to metrics, we still recommend setti
metrics from `dstack`.
!!! info "What's next?"
- 1. See [Monitoring](../../docs/guides/monitoring.md)
+ 1. See [Metrics](../../docs/guides/metrics.md)
2. Check [dev environments](../../docs/concepts/dev-environments.md), [tasks](../../docs/concepts/tasks.md), [services](../../docs/concepts/services.md), and [fleets](../../docs/concepts/fleets.md)
3. Join [Discord :material-arrow-top-right-thin:{ .external }](https://discord.gg/u8SmfwPpMd){:target="_blank"}
diff --git a/docs/blog/posts/mpi.md b/docs/blog/posts/mpi.md
index 4516c4297e..70b3ed1650 100644
--- a/docs/blog/posts/mpi.md
+++ b/docs/blog/posts/mpi.md
@@ -101,5 +101,5 @@ as well as use MPI for other tasks.
!!! info "What's next?"
1. Learn more about [dev environments](../../docs/concepts/dev-environments.md), [tasks](../../docs/concepts/tasks.md), [services](../../docs/concepts/services.md), and [fleets](../../docs/concepts/fleets.md)
- 2. Check the [NCCL tests](../../examples/distributed-training/nccl-tests/index.md) example
+ 2. Check the [NCCL tests](../../examples/clusters/nccl-tests/index.md) example
2. Join [Discord :material-arrow-top-right-thin:{ .external }](https://discord.gg/u8SmfwPpMd){:target="_blank"}
diff --git a/docs/blog/posts/prometheus.md b/docs/blog/posts/prometheus.md
index 23e644bcfd..fbaee63c1e 100644
--- a/docs/blog/posts/prometheus.md
+++ b/docs/blog/posts/prometheus.md
@@ -5,7 +5,7 @@ description: "TBA"
slug: prometheus
image: https://dstack.ai/static-assets/static-assets/images/dstack-prometheus-v3.png
categories:
- - Monitoring
+ - Metrics
- NVIDIA
---
@@ -46,7 +46,7 @@ Overall, `dstack` collects three groups of metrics:
| **Runs** | Run metrics include run counters for each user in each project. |
| **Jobs** | A run consists of one or more jobs, each mapped to a container. Job metrics offer insights into execution time, cost, GPU model, NVIDIA DCGM telemetry, and more. |
-For a full list of available metrics and labels, check out [Monitoring](../../docs/guides/monitoring.md).
+For a full list of available metrics and labels, check out [Metrics](../../docs/guides/metrics.md).
??? info "NVIDIA"
NVIDIA DCGM metrics are automatically collected for `aws`, `azure`, `gcp`, and `oci` backends,
@@ -60,7 +60,7 @@ For a full list of available metrics and labels, check out [Monitoring](../../do
only accessible through the UI and the [`dstack metrics`](dstack-metrics.md) CLI.
!!! info "What's next?"
- 1. See [Metrics](../../docs/guides/monitoring.md)
+ 1. See [Metrics](../../docs/guides/metrics.md)
1. Check [dev environments](../../docs/concepts/dev-environments.md),
[tasks](../../docs/concepts/tasks.md), [services](../../docs/concepts/services.md),
and [fleets](../../docs/concepts/fleets.md)
diff --git a/docs/blog/posts/tpu-on-gcp.md b/docs/blog/posts/tpu-on-gcp.md
index 8cdf1a051a..24534c7e76 100644
--- a/docs/blog/posts/tpu-on-gcp.md
+++ b/docs/blog/posts/tpu-on-gcp.md
@@ -50,7 +50,7 @@ and [vLLM :material-arrow-top-right-thin:{ .external }](https://github.com/vllm-
=== "Optimum TPU"
-
+
```yaml
type: service
@@ -83,7 +83,7 @@ and [vLLM :material-arrow-top-right-thin:{ .external }](https://github.com/vllm-
the official Docker image can be used instead of `dstackai/optimum-tpu:llama31`.
=== "vLLM"
-
+
```yaml
type: service
diff --git a/docs/changelog/index.md b/docs/changelog/index.md
deleted file mode 100644
index 05761ac57f..0000000000
--- a/docs/changelog/index.md
+++ /dev/null
@@ -1 +0,0 @@
-# Blog
diff --git a/docs/docs/concepts/gateways.md b/docs/docs/concepts/gateways.md
index 815eca4ae1..500d0d86cf 100644
--- a/docs/docs/concepts/gateways.md
+++ b/docs/docs/concepts/gateways.md
@@ -71,7 +71,7 @@ To delete a gateway, pass the gateway configuration to [`dstack delete`](../refe
```shell
-$ dstack delete -f examples/deployment/gateway.dstack.yml
+$ dstack delete -f examples/inference/gateway.dstack.yml
```
diff --git a/docs/docs/concepts/services.md b/docs/docs/concepts/services.md
index 1f7b515b84..70e0b2d5bd 100644
--- a/docs/docs/concepts/services.md
+++ b/docs/docs/concepts/services.md
@@ -558,6 +558,6 @@ If one replica of a multi-replica service fails with retry enabled,
1. Read about [dev environments](dev-environments.md), [tasks](tasks.md), and [repos](repos.md)
2. Learn how to manage [fleets](fleets.md)
3. See how to set up [gateways](gateways.md)
- 4. Check the [TGI :material-arrow-top-right-thin:{ .external }](../../examples/deployment/tgi/index.md){:target="_blank"},
- [vLLM :material-arrow-top-right-thin:{ .external }](../../examples/deployment/vllm/index.md){:target="_blank"}, and
- [NIM :material-arrow-top-right-thin:{ .external }](../../examples/deployment/nim/index.md){:target="_blank"} examples
+ 4. Check the [TGI :material-arrow-top-right-thin:{ .external }](../../examples/inference/tgi/index.md){:target="_blank"},
+ [vLLM :material-arrow-top-right-thin:{ .external }](../../examples/inference/vllm/index.md){:target="_blank"}, and
+ [NIM :material-arrow-top-right-thin:{ .external }](../../examples/inference/nim/index.md){:target="_blank"} examples
diff --git a/docs/docs/guides/monitoring.md b/docs/docs/guides/metrics.md
similarity index 100%
rename from docs/docs/guides/monitoring.md
rename to docs/docs/guides/metrics.md
diff --git a/docs/examples.md b/docs/examples.md
index 24881a7c09..128640b1ef 100644
--- a/docs/examples.md
+++ b/docs/examples.md
@@ -38,10 +38,10 @@ hide:
-## Distributed training
+## Clusters
-## Deployment
+## Inference
-
SGLang
@@ -95,7 +95,7 @@ hide:
Deploy DeepSeek distilled models with SGLang
-
vLLM
@@ -104,7 +104,7 @@ hide:
Deploy Llama 3.1 with vLLM
-
TGI
@@ -113,7 +113,7 @@ hide:
Deploy Llama 4 with TGI
-
NIM
@@ -122,7 +122,7 @@ hide:
Deploy a DeepSeek distilled model with NIM
-
TensorRT-LLM
diff --git a/docs/examples/deployment/nim/index.md b/docs/examples/clusters/a3high/index.md
similarity index 100%
rename from docs/examples/deployment/nim/index.md
rename to docs/examples/clusters/a3high/index.md
diff --git a/docs/examples/deployment/sglang/index.md b/docs/examples/clusters/a3mega/index.md
similarity index 100%
rename from docs/examples/deployment/sglang/index.md
rename to docs/examples/clusters/a3mega/index.md
diff --git a/docs/examples/deployment/tgi/index.md b/docs/examples/clusters/nccl-tests/index.md
similarity index 100%
rename from docs/examples/deployment/tgi/index.md
rename to docs/examples/clusters/nccl-tests/index.md
diff --git a/docs/examples/deployment/trtllm/index.md b/docs/examples/clusters/rccl-tests/index.md
similarity index 100%
rename from docs/examples/deployment/trtllm/index.md
rename to docs/examples/clusters/rccl-tests/index.md
diff --git a/docs/examples/deployment/vllm/index.md b/docs/examples/inference/nim/index.md
similarity index 100%
rename from docs/examples/deployment/vllm/index.md
rename to docs/examples/inference/nim/index.md
diff --git a/docs/examples/distributed-training/a3high-clusters/index.md b/docs/examples/inference/sglang/index.md
similarity index 100%
rename from docs/examples/distributed-training/a3high-clusters/index.md
rename to docs/examples/inference/sglang/index.md
diff --git a/docs/examples/distributed-training/a3mega-clusters/index.md b/docs/examples/inference/tgi/index.md
similarity index 100%
rename from docs/examples/distributed-training/a3mega-clusters/index.md
rename to docs/examples/inference/tgi/index.md
diff --git a/docs/examples/distributed-training/nccl-tests/index.md b/docs/examples/inference/trtllm/index.md
similarity index 100%
rename from docs/examples/distributed-training/nccl-tests/index.md
rename to docs/examples/inference/trtllm/index.md
diff --git a/docs/examples/distributed-training/rccl-tests/index.md b/docs/examples/inference/vllm/index.md
similarity index 100%
rename from docs/examples/distributed-training/rccl-tests/index.md
rename to docs/examples/inference/vllm/index.md
diff --git a/docs/overrides/main.html b/docs/overrides/main.html
index 3ab9f7deff..51adf057be 100644
--- a/docs/overrides/main.html
+++ b/docs/overrides/main.html
@@ -117,9 +117,9 @@