forked from docker/model-runner
-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathdocker_model_reinstall-runner.yaml
More file actions
65 lines (64 loc) · 1.89 KB
/
docker_model_reinstall-runner.yaml
File metadata and controls
65 lines (64 loc) · 1.89 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
command: docker model reinstall-runner
short: Reinstall Docker Model Runner (Docker Engine only)
long: |
This command removes the existing Docker Model Runner container and reinstalls it with the specified configuration. Models and images are preserved during reinstallation.
usage: docker model reinstall-runner
pname: docker model
plink: docker_model.yaml
options:
- option: backend
value_type: string
description: 'Specify backend (llama.cpp|vllm). Default: llama.cpp'
deprecated: false
hidden: false
experimental: false
experimentalcli: false
kubernetes: false
swarm: false
- option: do-not-track
value_type: bool
default_value: "false"
description: Do not track models usage in Docker Model Runner
deprecated: false
hidden: false
experimental: false
experimentalcli: false
kubernetes: false
swarm: false
- option: gpu
value_type: string
default_value: auto
description: Specify GPU support (none|auto|cuda|musa|rocm|cann)
deprecated: false
hidden: false
experimental: false
experimentalcli: false
kubernetes: false
swarm: false
- option: host
value_type: string
default_value: 127.0.0.1
description: Host address to bind Docker Model Runner
deprecated: false
hidden: false
experimental: false
experimentalcli: false
kubernetes: false
swarm: false
- option: port
value_type: uint16
default_value: "0"
description: |
Docker container port for Docker Model Runner (default: 12434 for Docker Engine, 12435 for Cloud mode)
deprecated: false
hidden: false
experimental: false
experimentalcli: false
kubernetes: false
swarm: false
deprecated: false
hidden: false
experimental: false
experimentalcli: false
kubernetes: false
swarm: false