Project idea #688

venkatasai-kadamati · 2023-10-17T19:19:08Z

venkatasai-kadamati
Oct 17, 2023

I am working on my major project for btech and want to make a innovative product (gradio WEBSITE ) using computer vision and deep learning with LLM functionality. Now see Let me make it more clear. I want to have an application in such a manner that the pregnant lady takes a photo or upload one of the food item or beverage. Now The LLM model provides a q/a interface such that the user can ask some questions about the item. Example : A pregnant lady is shopping an coke drink but feels unsure, so opens our application and takes a photo or uploads one, and in the later llm segment asks a question like what quantity of this item is good. another question is alternative to this item.

I would love to know more information on how to connect the cv module with the llm part.
I have taken both the pytorch and llm workshop project, but lack some clarity.

Would highly appreciate some voice 😊

mrdbourke · 2023-10-20T05:50:05Z

mrdbourke
Oct 20, 2023
Maintainer

Hey @Radiant690 ,

You could use a combination of computer vision/LLMs or even multi-modal LLMs (also called VLMs for Vision Language Model).

One thing you could do is have a computer vision model understand what's in the image and then use the result of the computer vision as the prompt interface to the LLM.

For example, say you had a picture of a coke drink, you could go:

Picture -> Computer vision model -> Output: "coke drink" -> Input to LLM: "Is it safe for a pregnant woman to consume {coke_drink}?" -> Output

This could all be done through an interface with Gradio: https://www.gradio.app/

See this example using the LLaVA model with an image/chat interface: https://llava.hliu.cc/ (made with Gradio)

screenshot of the LLaVa VLM model performing visual question and answering on an image of softdrink

1 reply

venkatasai-kadamati Oct 20, 2023
Author

Thanks @mrdbourke its great for you to have taken time and list the solution.

Definitely would love to explore more and revert for further queries.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Project idea #688

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Project idea #688

Uh oh!

venkatasai-kadamati Oct 17, 2023

Replies: 1 comment · 1 reply

Uh oh!

mrdbourke Oct 20, 2023 Maintainer

Uh oh!

venkatasai-kadamati Oct 20, 2023 Author

venkatasai-kadamati
Oct 17, 2023

Replies: 1 comment 1 reply

mrdbourke
Oct 20, 2023
Maintainer

venkatasai-kadamati Oct 20, 2023
Author