You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The device specification is passed directly to the underlying container engine. See documentation of the supported container engine for more information.
45
+
46
+
Pass '--device=none' explicitly add no device to the container, eg for
47
+
running a CPU-only performance comparison.
48
+
43
49
#### **--env**=
44
50
45
51
Set environment variables inside of the container.
@@ -57,7 +63,7 @@ OCI container image to run with specified AI model. RamaLama defaults to using
57
63
images based on the accelerator it discovers. For example:
58
64
`quay.io/ramalama/ramalama`. See the table below for all default images.
59
65
The default image tag is based on the minor version of the RamaLama package.
60
-
Version 0.11.1 of RamaLama pulls an image with a `:0.11` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
66
+
Version 0.12.1 of RamaLama pulls an image with a `:0.12` tag from the quay.io/ramalama OCI repository. The --image option overrides this default.
61
67
62
68
The default can be overridden in the ramalama.conf file or via the
Usage: Lower numbers are good for virtual assistants where we need deterministic responses. Higher numbers are good for roleplay or creative tasks like editing stories
141
147
148
+
#### **--thinking**=*true*
149
+
Enable or disable thinking mode in reasoning models
150
+
142
151
#### **--threads**, **-t**
143
152
Maximum number of cpu threads to use.
144
153
The default is to use half the cores available on this system for the number of threads.
0 commit comments