[Feature] Image functionality

## Description
The backend must support receiving images from the frontend, process them, and pass the extracted content to the LLM (Large Language Model). This enables users to upload images (e.g. screenshots) and receive relevant answers based on the content in those images.

## Proposed Solution
I am sure http and python/fastAPI has a way of doing this

## Task Checklist
- [x] POST /image-endpoint i FastAPI
- [x] Sent the image to Azure OpenAI Vision or another image-to-text model
- [x] Embedded the generated image caption using existing embedding services
- [x] Stored the embedding and metadata in ChromaDB with traceable content
- [x] LLM responds based on both the image content and retrieved contextual documents
- [x] Teste with local temporary file storage before integrating with the frontend


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Image functionality #120

Description

Proposed Solution

Task Checklist

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature] Image functionality #120

Description

Description

Proposed Solution

Task Checklist

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions