Skip to content

Add support for arbitary image keys for multi modal queries#74

Merged
KennyVaneetvelde merged 3 commits intoBrainBlend-AI:mainfrom
reinoldus:feature/support-arbitary-text-and-keys-for-multimodal
Feb 2, 2025
Merged

Add support for arbitary image keys for multi modal queries#74
KennyVaneetvelde merged 3 commits intoBrainBlend-AI:mainfrom
reinoldus:feature/support-arbitary-text-and-keys-for-multimodal

Conversation

@reinoldus
Copy link
Contributor

@reinoldus reinoldus commented Feb 2, 2025

Currently the only way to add images is to put them in the "images" key of the InputSchema to send them to the model.
Further in the current implementation we can only have 1 text description for the image, but for my use case the image is just a small part of a larger analysis, so I think it would be better to be able to add the full payload like in the normal text queries instead of silently swallowing the rest.

Not sure if I misunderstand something w.r.t multi-modality, but I don't see any other way for my use case.

Caveats

I had to edit one test for all the tests to pass, because before just the image description was sent to the model not the full dumped json like in the "text only" queries.

TODO:
Found the dev-guide, will fix the issues

@KennyVaneetvelde KennyVaneetvelde merged commit 5697bdf into BrainBlend-AI:main Feb 2, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants