You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Models are pulled from Docker Hub and then loaded dynamically into memory based on request usage.
23
-
The first pull may take a while; after that, model files are cached locally. You can interact with the model using [OpenAI-compatible APIs](#what-api-endpoints-are-available).
22
+
Models are pulled from Docker Hub the first time they're used and stored locally. They're loaded into memory only at runtime when a request is made, and unloaded when not in use to optimize resources. Since models can be large, the initial pull may take some time — but after that, they're cached locally for faster access. You can interact with the model using [OpenAI-compatible APIs](#what-api-endpoints-are-available).
24
23
25
24
## Enable the feature
26
25
@@ -92,25 +91,11 @@ Lists all models currently pulled to your local environment.
92
91
$ docker model list
93
92
```
94
93
95
-
If no models have been pulled yet, you will see:
96
-
97
-
```json
98
-
{"object":"list","data":[]}
99
-
```
100
-
101
-
For better readability, format the output using `jq`:
94
+
If no models have been pulled yet, you will something similar to:
102
95
103
-
```console
104
-
$ docker model list | jq .
105
-
```
106
-
107
-
Expected formatted output:
108
-
109
-
```json
110
-
{
111
-
"object": "list",
112
-
"data": []
113
-
}
96
+
```text
97
+
MODEL PARAMETERS QUANTIZATION ARCHITECTURE MODEL ID CREATED SIZE
98
+
ignaciolopezluna020/gemma-3-it:4B-Q4_K_M 3.88 B IQ2_XXS/Q4_K_M gemma3 adea14bef2fe 55 years ago 2.31 GiB
0 commit comments