Skip to content

Conversation

@mogith-pn
Copy link
Contributor

Description

  • ImageSummarizer tool is failing for large PDFs or PDFs with more images, since it is doing single batch predict.
  • Changed the Imagesummarizer class function to have batch predicts.
  • Added a filter identifier in order to filter queries while doing multimodal search.
  • For ex
    res = search.query(filters=[{'input_types': ['text'], 'input_dataset_ids':['d3'], "metadata": {"image_summary": "yes"}}], ranks=[{"text_raw": qstn}])
    In above search query we filter the summary text from normal document chunk by the metadata, which was added in the ImageSummarizer pipeline.

https://clarifai.atlassian.net/browse/DEVX-870

Copy link
Contributor

@sanjaychelliah sanjaychelliah left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍

@mogith-pn mogith-pn merged commit 876a848 into main Jan 8, 2025
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

3 participants