Skip to content
This repository was archived by the owner on Sep 10, 2025. It is now read-only.

Commit 47b77b3

Browse files
authored
Update multimodal.md to show server example
1 parent 8278aa2 commit 47b77b3

File tree

1 file changed

+37
-4
lines changed

1 file changed

+37
-4
lines changed

docs/multimodal.md

Lines changed: 37 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
Released on September 25th, 2024, **Llama3.2 11B Vision** is torchchat's first multimodal model.
44

5-
This page goes over the different commands you can run with LLama 3.2 11B Vision.
5+
This page goes over the different commands you can run with LLama 3.2 11B Vision.
66

77
## Model Access
88

@@ -44,7 +44,42 @@ python3 torchchat.py server llama3.2-11B
4444

4545
In another terminal, query the server using `curl`. This query might take a few minutes to respond.
4646

47-
**We are currently debugging the server integration and will have updated examples shortly.**
47+
<details>
48+
<summary>Example Query</summary>
49+
50+
Setting `stream` to "true" in the request emits a response in chunks. If `stream` is unset or not "true", then the client will await the full response from the server.
51+
52+
**Example Input + Output**
53+
54+
```
55+
curl http://127.0.0.1:5000/v1/chat/completions \
56+
-H "Content-Type: application/json" \
57+
-d '{
58+
"model": "llama3.2",
59+
"messages": [
60+
{
61+
"role": "user",
62+
"content": [
63+
{
64+
"type": "text",
65+
"text": "What'\''s in this image?"
66+
},
67+
{
68+
"type": "image_url",
69+
"image_url": ""
70+
}
71+
]
72+
}
73+
],
74+
"max_tokens": 300
75+
}'
76+
```
77+
78+
```
79+
{"id": "chatcmpl-cb7b39af-a22e-4f71-94a8-17753fa0d00c", "choices": [{"message": {"role": "assistant", "content": "The image depicts a simple black and white cartoon-style drawing of an animal face. It features a profile view, complete with two ears, expressive eyes, and a partial snout. The animal looks to the left, with its eye and mouth implied, suggesting that the drawn face might belong to a rabbit, dog, or pig. The graphic face has a bold black outline and a smaller, solid black nose. A small circle, forming part of the face, has a white background with two black quirkly short and long curved lines forming an outline of what was likely a mouth, complete with two teeth. The presence of the curve lines give the impression that the animal is smiling or speaking. Grey and black shadows behind the right ear and mouth suggest that this face is looking left and upwards. Given the prominent outline of the head and the outline of the nose, it appears that the depicted face is most likely from the side profile of a pig, although the ears make it seem like a dog and the shape of the nose makes it seem like a rabbit. Overall, it seems that this image, possibly part of a character illustration, is conveying a playful or expressive mood through its design and positioning."}, "finish_reason": "stop"}], "created": 1727487574, "model": "llama3.2", "system_fingerprint": "cpu_torch.float16", "object": "chat.completion"}%
80+
```
81+
82+
</details>
4883

4984
## Browser
5085

@@ -58,8 +93,6 @@ First, follow the steps in the Server section above to start a local server. The
5893
streamlit run torchchat/usages/browser.py
5994
```
6095

61-
**We are currently debugging the browser integration and will have updated examples shortly.**
62-
6396
---
6497

6598
# Future Work

0 commit comments

Comments
 (0)