Skip to content

Commit 7d7e3b7

Browse files
authored
Use --ipc=host in docker run for distributed inference (#1125)
1 parent f98b745 commit 7d7e3b7

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

docs/source/getting_started/installation.rst

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -46,4 +46,5 @@ You can also build and install vLLM from source:
4646
.. code-block:: console
4747
4848
$ # Pull the Docker image with CUDA 11.8.
49-
$ docker run --gpus all -it --rm --shm-size=8g nvcr.io/nvidia/pytorch:22.12-py3
49+
$ # Use `--ipc=host` to make sure the shared memory is large enough.
50+
$ docker run --gpus all -it --rm --ipc=host nvcr.io/nvidia/pytorch:22.12-py3

0 commit comments

Comments
 (0)