Replies: 3 comments 3 replies
-
Hi @anakin87 (and @julian-risch), I just ran the cookbook notebook โ great job on implementing the multimodal features! Itโs clear a lot of thought went into making this both accessible and powerful! I would like to share some of my personal thoughts and impressions: Implementation
Ideas/Requests
If you have any questions about my feedback or want to discuss any of these points further, please let me know. Iโm genuinely excited to see where this goes! ๐ |
Beta Was this translation helpful? Give feedback.
-
UPDATE We released v0.11.0, including several new multimodal features, mostly focused on indexing and retrieval.
|
Beta Was this translation helpful? Give feedback.
-
All multimodality-related features are released with Haystack 2.16.0. Please use the discussion board from haystack for comments/suggestions ๐ |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Features
Here are the experimental features we've developed so far for multimodal text generation in Haystack. The experiment is ongoing, so expect more in the future.
ImageContent
dataclass, which represents the image content of a userChatMessage
.OpenAIChatGenerator
was extended to support multimodal messages.ChatPromptBuilder
was refactored to also work with string templates, making it easier to support multimodal use cases.Notebook
For an introduction to the new features, check out the ๐ Introduction to Multimodal Text Generation notebook.
It also contains an application using textual retrieval + multimodal generation, and a multimodal Agent.
What's next?
You can follow the progress of the Multimodal experiment in this GitHub issue.
We'll also take care of keeping this discussion up to date.
In the future, you can expect support for more LLM providers, improvements to multimodal indexing and retrieval pipelines, plus the exploration of other interesting directions.
July updates
We introduced several new features, related to indexing and retrieval.
Check out #302 (comment).
Beta Was this translation helpful? Give feedback.
All reactions