📷 AI Image Captioning App

A simple Streamlit app that uses the BLIP/BLIP2 models to generate captions for any uploaded image.

🚀 How It Works

Upload an image (jpg/png)
The AI model will analyze it
You’ll get a descriptive caption like:

“A cat wearing sunglasses while sitting on a couch.”

🧠 Tech Stack

Python
Streamlit
Transformers (BLIP model from Salesforce)
Hugging Face

🛠️ Run Locally

pip install streamlit transformers torch torchvision Pillow
streamlit run ai_image_captioning_app.py

🧠 Model Comparison: BLIP Base vs Large vs BLIP2

We tested the same image using 3 different models. Here's how their descriptions differ:

🏃 Input Image:

⚙️ Using BLIP Base Model

model = BlipForConditionalGeneration.from_pretrained("Salesforce/blip-image-captioning-base")

🔥 Using BLIP Large Model

model = BlipForConditionalGeneration.from_pretrained("Salesforce/blip-image-captioning-large")

🧠 Using BLIP2 + Flan-T5 Model

processor = Blip2Processor.from_pretrained("Salesforce/blip2-flan-t5-xl")
model = Blip2ForConditionalGeneration.from_pretrained("Salesforce/blip2-flan-t5-xl")

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
README.md		README.md
ai_image_captioning_app.py		ai_image_captioning_app.py
ai_image_captioning_appAPI.py		ai_image_captioning_appAPI.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📷 AI Image Captioning App

🚀 How It Works

🧠 Tech Stack

🛠️ Run Locally

🧠 Model Comparison: BLIP Base vs Large vs BLIP2

🏃 Input Image:

⚙️ Using BLIP Base Model

🔥 Using BLIP Large Model

🧠 Using BLIP2 + Flan-T5 Model

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

MaryemHadjWannes/ai-image-captioning

Folders and files

Latest commit

History

Repository files navigation

📷 AI Image Captioning App

🚀 How It Works

🧠 Tech Stack

🛠️ Run Locally

🧠 Model Comparison: BLIP Base vs Large vs BLIP2

🏃 Input Image:

⚙️ Using BLIP Base Model

🔥 Using BLIP Large Model

🧠 Using BLIP2 + Flan-T5 Model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages