Add inference snippets for image-text-to-text #927

Wauplin · 2024-09-26T09:27:32Z

This PR adds inference snippets for image-text-to-text models, say meta-llama/Llama-3.2-11B-Vision-Instruct for example 😄

I've tested all three examples locally and they work as expected :)

julien-c · 2024-09-26T14:59:58Z

packages/tasks/src/snippets/js.ts

+	],
+	max_tokens: 500,
+})) {
+	process.stdout.write(chunk.choices[0]?.delta?.content || "");


Suggested change

process.stdout.write(chunk.choices[0]?.delta?.content || "");

process.stdout.write(chunk.choices[0]?.delta?.content);

packages/tasks/src/snippets/curl.ts

mishig25 · 2024-09-26T15:04:41Z

packages/tasks/src/snippets/python.ts

+	"${model.id}",
+	token="${accessToken || "{API_TOKEN}"}",
+)
+image_url = "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg"


maybe good to show an example with pillow.Image.open and use that image's base64 str representation so that users can get an example where they can load local images

from PIL import Image import requests from io import BytesIO import base64 image_url = "https://cdn.britannica.com/61/93061-050-99147DCE/Statue-of-Liberty-Island-New-York-Bay.jpg" response = requests.get(image_url) image = Image.open(BytesIO(response.content)) # Convert the image to a byte array in PNG format buffered = BytesIO() image.save(buffered, format="PNG") # Encode this byte array to base64 img_base64 = base64.b64encode(buffered.getvalue()) # Print the base64 string print(img_base64.decode())

maybe the snippet would become too long. I will let you decide

hmm i'd say this would complicate a bit too much (no strong opinion though)

Note that this is for remote inference not local usage

Note that this is for remote inference not local usage

yes, I meant more like: remote inference using local img file (otherwise, to use the snippet, user needs to upload their image and get its url)

do you know if it's possible to have several snippets by returning a list, same as for code snippets ?

do you know if it's possible to have several snippets by returning a list, same as for code snippets ?

for inference snippet, not possible right now. So suggest that:

we merge this PR as it is with only image url example

mayeb unify on moon-side to have inference snippet to be able to have a list like code snippets. If so, we can re-iterate and add an example with a local image

Wauplin · 2024-09-30T10:03:17Z

@coyotte508 @mishig25 thanks for the feedback. I addressed above the comment to use "conversational" tag. Otherwise let's merge this and come back to it when multiple inference snippets can be provided. Can I have a final review before merging?

packages/tasks/src/snippets/types.ts

packages/tasks/src/snippets/js.ts

mishig25 · 2024-09-30T13:48:17Z

packages/tasks/src/snippets/python.ts

+export const snippetConversationalWithImage = (model: ModelDataMinimal, accessToken: string): string =>
+	`from huggingface_hub import InferenceClient
+
+client = InferenceClient(


nit, so in playground, we use snippet like this to match it as much as possible to OAI format/spec:

from huggingface_hub import InferenceClient client = InferenceClient(api_key="YOUR_HF_TOKEN") messages = [ { "role": "user", "content": "Tell me a story" } ] output = client.chat.completions.create( model="mistralai/Mistral-7B-Instruct-v0.3", messages=messages, stream=True, temperature=0.5, max_tokens=1024, top_p=0.7 )

the specific changes are:

use api_key rather than token

declare model inside completions.create rather than InferenceClient(

I will let you decide

this comment maybe applies to text conv snippet as well

if we decide to change it, lets handle in subseq PR

I'll use the same convention then. I've addressed it in e4c6cba for both the text-generation and text-image-to-text snippets.

mishig25 · 2024-09-30T14:00:01Z

github is showing that model is indendted wrongly?

…gingface/huggingface.js into code-snippets-for-image-text-to-text

Wauplin · 2024-09-30T14:01:51Z

github is showing that model is indendted wrongly?

Well well well, looks like it yes. Addressed in 0f8452c

mishig25

lgtm! again

Wauplin · 2024-09-30T14:10:05Z

Thanks! Sorry about the back and forth 😬

mishig25 · 2024-09-30T14:12:22Z

trigerred https://github.com/huggingface/huggingface.js/actions/runs/11107930206 so that we can get it in moon

Follow up to #927 python equavalent of https://github.com/huggingface/huggingface.js/blob/1bb5b31131c6990547087d91aebda2361e91dfad/packages/tasks/src/snippets/js.ts#L188 Because of this missing line, user does not see `python` amongst the options in the[ inference snippet](https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct?inference_api=true) <img width="995" alt="image" src="https://github.com/user-attachments/assets/2237ca48-0b90-4c68-8beb-60ecdbbb0b86">

Wauplin added 2 commits September 26, 2024 11:15

Add code snippets for image-text-to-text

38dbee7

fix curl

fd712c1

Wauplin requested review from SBrandeis, gary149, julien-c, osanseviero and pcuenca as code owners September 26, 2024 09:27

Wauplin changed the title ~~Add code snippets for image-text-to-text~~ Add inference snippets for image-text-to-text Sep 26, 2024

julien-c reviewed Sep 26, 2024

View reviewed changes

packages/tasks/src/snippets/curl.ts Outdated Show resolved Hide resolved

mishig25 reviewed Sep 26, 2024

View reviewed changes

Wauplin added 2 commits September 30, 2024 11:58

Merge branch 'main' into code-snippets-for-image-text-to-text

1970e31

rely on conversational tag

9cc3d86

Wauplin requested review from coyotte508 and mishig25 September 30, 2024 10:02

julien-c approved these changes Sep 30, 2024

View reviewed changes

packages/tasks/src/snippets/types.ts Show resolved Hide resolved

mishig25 reviewed Sep 30, 2024

View reviewed changes

packages/tasks/src/snippets/js.ts Outdated Show resolved Hide resolved

mishig25 reviewed Sep 30, 2024

View reviewed changes

packages/tasks/src/snippets/js.ts Outdated Show resolved Hide resolved

Wauplin added 2 commits September 30, 2024 15:44

javascript casing

f6d7ff8

;

d9d5a43

Wauplin requested a review from mishig25 September 30, 2024 13:47

mishig25 reviewed Sep 30, 2024

View reviewed changes

mishig25 approved these changes Sep 30, 2024

View reviewed changes

Wauplin and others added 2 commits September 30, 2024 15:56

update python snippets

e4c6cba

Merge branch 'main' into code-snippets-for-image-text-to-text

11a15ea

Wauplin added 2 commits September 30, 2024 16:01

indent

0f8452c

Merge branch 'code-snippets-for-image-text-to-text' of github.com:hug…

1fc2586

…gingface/huggingface.js into code-snippets-for-image-text-to-text

mishig25 approved these changes Sep 30, 2024

View reviewed changes

Wauplin merged commit 4b211b0 into main Sep 30, 2024
5 checks passed

Wauplin deleted the code-snippets-for-image-text-to-text branch September 30, 2024 14:10

mishig25 mentioned this pull request Oct 2, 2024

[Python snippets] map snippetConversationalWithImage #938

Merged

	process.stdout.write(chunk.choices[0]?.delta?.content \|\| "");
	process.stdout.write(chunk.choices[0]?.delta?.content);

Add inference snippets for image-text-to-text #927

Add inference snippets for image-text-to-text #927

Uh oh!

Conversation

Wauplin commented Sep 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

julien-c Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mishig25 Sep 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

julien-c Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

mishig25 Sep 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Wauplin Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

mishig25 Sep 30, 2024

Choose a reason for hiding this comment

Uh oh!

Wauplin commented Sep 30, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mishig25 Sep 30, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mishig25 Sep 30, 2024

Choose a reason for hiding this comment

Uh oh!

mishig25 Sep 30, 2024

Choose a reason for hiding this comment

Uh oh!

Wauplin Sep 30, 2024

Choose a reason for hiding this comment

Uh oh!

mishig25 commented Sep 30, 2024

Uh oh!

Wauplin commented Sep 30, 2024

Uh oh!

mishig25 left a comment

Choose a reason for hiding this comment

Uh oh!

Wauplin commented Sep 30, 2024

Uh oh!

Uh oh!

mishig25 commented Sep 30, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Wauplin commented Sep 26, 2024 •

edited

Loading

mishig25 Sep 26, 2024 •

edited

Loading

mishig25 Sep 26, 2024 •

edited

Loading

mishig25 Sep 30, 2024 •

edited

Loading