Skip to content

Commit 8bd37c9

Browse files
authored
refactor: move backend deploy, launch and slurm files from components to examples (#3849)
Signed-off-by: Anant Sharma <[email protected]>
1 parent 7835904 commit 8bd37c9

File tree

168 files changed

+216
-226
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

168 files changed

+216
-226
lines changed

.github/filters.yaml

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -28,22 +28,22 @@ vllm: &vllm
2828
- 'container/Dockerfile.vllm'
2929
- 'container/deps/requirements.vllm.txt'
3030
- 'container/deps/vllm/**'
31-
- 'components/backends/vllm/**'
31+
- 'examples/backends/vllm/**'
3232
- 'components/src/dynamo/vllm/**'
3333
- 'container/build.sh'
3434
- 'tests/serve/test_vllm.py'
3535

3636
sglang: &sglang
3737
- 'container/Dockerfile.sglang'
3838
- 'container/Dockerfile.sglang-wideep'
39-
- 'components/backends/sglang/**'
39+
- 'examples/backends/sglang/**'
4040
- 'components/src/dynamo/sglang/**'
4141
- 'container/build.sh'
4242
- 'tests/serve/test_sglang.py'
4343

4444
trtllm: &trtllm
4545
- 'container/Dockerfile.trtllm'
46-
- 'components/backends/trtllm/**'
46+
- 'examples/backends/trtllm/**'
4747
- 'components/src/dynamo/trtllm/**'
4848
- 'container/build.sh'
4949
- 'container/build_trtllm_wheel.sh'

.github/workflows/container-validation-backends.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -429,7 +429,7 @@ jobs:
429429
export KUBECONFIG=$(pwd)/.kubeconfig
430430
kubectl config set-context --current --namespace=$NAMESPACE
431431
432-
cd components/backends/$FRAMEWORK
432+
cd examples/backends/$FRAMEWORK
433433
export FRAMEWORK_RUNTIME_IMAGE="${{ secrets.AZURE_ACR_HOSTNAME }}/ai-dynamo/dynamo:${{ github.sha }}-${FRAMEWORK}-amd64"
434434
export KUBE_NS=$NAMESPACE
435435
export GRAPH_NAME=$(yq e '.metadata.name' $DEPLOYMENT_FILE)

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -171,7 +171,7 @@ Rerun with `curl -N` and change `stream` in the request to `true` to get the res
171171
### Deploying Dynamo
172172

173173
- Follow the [Quickstart Guide](docs/kubernetes/README.md) to deploy on Kubernetes.
174-
- Check out [Backends](components/backends) to deploy various workflow configurations (e.g. SGLang with router, vLLM with disaggregated serving, etc.)
174+
- Check out [Backends](examples/backends) to deploy various workflow configurations (e.g. SGLang with router, vLLM with disaggregated serving, etc.)
175175
- Run some [Examples](examples) to learn about building components in Dynamo and exploring various integrations.
176176

177177
### Benchmarking Dynamo

benchmarks/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@ This directory contains benchmarking scripts and tools for performance evaluatio
2020
## Quick Start
2121

2222
### Benchmark a Dynamo Deployment
23-
First, deploy your DynamoGraphDeployment using the [deployment documentation](../components/backends/), then:
23+
First, deploy your DynamoGraphDeployment using the [deployment documentation](../docs/kubernetes/), then:
2424

2525
```bash
2626
# Port-forward your deployment to http://localhost:8000

benchmarks/profiler/utils/config_modifiers/sglang.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,7 +36,7 @@
3636
logger.addHandler(console_handler)
3737

3838

39-
DEFAULT_SGLANG_CONFIG_PATH = "components/backends/sglang/deploy/disagg.yaml"
39+
DEFAULT_SGLANG_CONFIG_PATH = "examples/backends/sglang/deploy/disagg.yaml"
4040

4141

4242
class SGLangConfigModifier:

benchmarks/profiler/utils/config_modifiers/trtllm.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -38,7 +38,7 @@
3838
logger.addHandler(console_handler)
3939

4040

41-
DEFAULT_TRTLLM_CONFIG_PATH = "components/backends/trtllm/deploy/disagg.yaml"
41+
DEFAULT_TRTLLM_CONFIG_PATH = "examples/backends/trtllm/deploy/disagg.yaml"
4242

4343

4444
class TrtllmConfigModifier:

benchmarks/profiler/utils/config_modifiers/vllm.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@
3434
logger.addHandler(console_handler)
3535

3636

37-
DEFAULT_VLLM_CONFIG_PATH = "components/backends/vllm/deploy/disagg.yaml"
37+
DEFAULT_VLLM_CONFIG_PATH = "examples/backends/vllm/deploy/disagg.yaml"
3838

3939

4040
class VllmV1ConfigModifier:

components/README.md

Lines changed: 7 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -19,25 +19,17 @@ limitations under the License.
1919

2020
This directory contains the core components that make up the Dynamo inference framework. Each component serves a specific role in the distributed LLM serving architecture, enabling high-throughput, low-latency inference across multiple nodes and GPUs.
2121

22-
## Supported Inference Engines
23-
24-
Dynamo supports multiple inference engines (with a focus on SGLang, vLLM, and TensorRT-LLM), each with their own deployment configurations and capabilities:
25-
26-
- **[vLLM](/docs/backends/vllm/README.md)** - High-performance LLM inference with native KV cache events and NIXL-based transfer mechanisms
27-
- **[SGLang](/docs/backends/sglang/README.md)** - Structured generation language framework with ZMQ-based communication
28-
- **[TensorRT-LLM](/docs/backends/trtllm/README.md)** - NVIDIA's optimized LLM inference engine with TensorRT acceleration
29-
30-
Each engine provides launch scripts for different deployment patterns in their respective `/launch` & `/deploy` directories.
31-
3222
## Core Components
3323

34-
### [Backends](backends/)
24+
### Backends
25+
26+
Dynamo supports multiple inference engines, each with their own deployment configurations and capabilities:
3527

36-
The backends directory contains inference engine integrations and implementations, with a key focus on:
28+
- **[vLLM](/docs/backends/vllm/README.md)** - Full-featured vLLM integration with disaggregated serving, KV-aware routing, SLA-based planning, native KV cache events, and NIXL-based transfer mechanisms
29+
- **[SGLang](/docs/backends/sglang/README.md)** - SGLang engine integration with ZMQ-based communication, supporting disaggregated serving and KV-aware routing
30+
- **[TensorRT-LLM](/docs/backends/trtllm/README.md)** - TensorRT-LLM integration with disaggregated serving capabilities and TensorRT acceleration
3731

38-
- **vLLM** - Full-featured vLLM integration with disaggregated serving, KV-aware routing, and SLA-based planning
39-
- **SGLang** - SGLang engine integration supporting disaggregated serving and KV-aware routing
40-
- **TensorRT-LLM** - TensorRT-LLM integration with disaggregated serving capabilities
32+
Each engine provides launch and deploy scripts for different deployment patterns in the [examples](../examples/backends/) folder.
4133

4234

4335
### [Frontend](src/dynamo/frontend/)

components/src/dynamo/router/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ Clients query the `find_best_worker` endpoint to determine which worker should p
4747
>
4848
> Use this manual setup if you need explicit control over prefill routing configuration or want to manage prefill and decode routers separately.
4949
50-
See [`components/backends/vllm/launch/disagg_router.sh`](/components/backends/vllm/launch/disagg_router.sh) for a complete example.
50+
See [`examples/backends/vllm/launch/disagg_router.sh`](/examples/backends/vllm/launch/disagg_router.sh) for a complete example.
5151

5252
```bash
5353
# Start frontend router for decode workers

container/Dockerfile.sglang-wideep

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -87,4 +87,4 @@ ENV PATH=/usr/local/bin/etcd:$PATH
8787
# Enable forceful shutdown of inflight requests
8888
ENV SGL_FORCE_SHUTDOWN=1
8989

90-
WORKDIR /sgl-workspace/dynamo/components/backends/sglang
90+
WORKDIR /sgl-workspace/dynamo/examples/backends/sglang

0 commit comments

Comments
 (0)