dmr: request inspector

ArthurFlag · ArthurFlag · commit 2cc416f3bdb6 · 2025-07-21T14:04:16.000+02:00
diff --git a/content/manuals/ai/model-runner/_index.md b/content/manuals/ai/model-runner/_index.md
@@ -39,6 +39,7 @@ with AI models locally.
 - Package GGUF files as OCI Artifacts and publish them to any Container Registry
 - Run and interact with AI models directly from the command line or from the Docker Desktop GUI
 - Manage local models and display logs
+- Display prompts and responses details
 
 ## Requirements
 
@@ -188,7 +189,7 @@ docker model pull hf.co/bartowski/Llama-3.2-1B-Instruct-GGUF
 {{< tabs group="release" >}}
 {{< tab name="From Docker Desktop">}}
 
-1. Select **Models** and select the **Local** tab
+1. Select **Models** and select the **Local** tab.
 1. Click the play button. The interactive chat screen opens.
 
 ![screencapture of the Local view](./images/dmr-run.png)
@@ -203,6 +204,8 @@ Use the [`docker model run` command](/reference/cli/docker/model/run/).
 
 ## Troubleshooting
 
+### Display the logs
+
 To troubleshoot potential issues, display the logs:
 
 {{< tabs group="release" >}}
@@ -220,6 +223,28 @@ Use the [`docker model logs` command](/reference/cli/docker/model/logs/).
 {{< /tab >}}
 {{< /tabs >}}
 
+### Inpect requests and responses
+
+Inspecting requests and responses helps you diagnose model-related issues.
+For example, you can evaluate context usage to verify you stay within the model's context
+window or display the full body of a request to control the parameters you are passing to your models
+when developing with a framework.
+
+In Docker Desktop, to inspect the requests and responses for each model:
+
+1. Select **Models** and select the **Requests** tab. This view displays all the requests to all models:
+   - The time the request was sent.
+   - The model name and version
+   - The prompt/request
+   - The context usage
+   - The time it took for the response to be generated.
+2. Select one of the requests to display further details:
+   - In the **Overview** tab, view the token usage, response metadata and generation speed, and the actual prompt and response.
+   - In the **Request**  and **Response** tabs, view the full JSON payload of the request and the response.
+
+> [!NOTE]
+> You can also display the requests for a specific model when you select a model and then select the **Requests** tab.
+
 ## Publish a model
 
 > [!NOTE]