File tree Expand file tree Collapse file tree 2 files changed +12
-4
lines changed
haystack_experimental/components Expand file tree Collapse file tree 2 files changed +12
-4
lines changed Original file line number Diff line number Diff line change @@ -87,9 +87,16 @@ def __init__(
8787 :param root_path: The root directory path where document files are located. If provided, file paths in
8888 document metadata will be resolved relative to this path. If None, file paths are treated as absolute paths.
8989 :param model:
90- The Sentence Transformers model to use for calculating embeddings. To be used with this component,
91- the model must be able to embed images and text into the same vector space.
92- Pass a local path or ID of the model on Hugging Face.
90+ The Sentence Transformers model to use for calculating embeddings. Pass a local path or ID of the model on
91+ Hugging Face. To be used with this component, the model must be able to embed images and text into the same
92+ vector space. Compatible models include:
93+ - "sentence-transformers/clip-ViT-B-32"
94+ - "sentence-transformers/clip-ViT-L-14"
95+ - "sentence-transformers/clip-ViT-B-16"
96+ - "sentence-transformers/clip-ViT-B-32-multilingual-v1"
97+ - "jinaai/jina-embeddings-v4"
98+ - "jinaai/jina-clip-v1"
99+ - "jinaai/jina-clip-v2".
93100 :param device:
94101 The device to use for loading the model.
95102 Overrides the default device.
Original file line number Diff line number Diff line change @@ -100,7 +100,8 @@ def __init__(
100100 Initialize the LLMDocumentContentExtractor component.
101101
102102 :param chat_generator: A ChatGenerator instance representing the LLM used to extract text. This generator must
103- support vision-based input and return a plain text response.
103+ support vision-based input and return a plain text response. Currently, the experimental versions of
104+ OpenAIChatGenerator and AmazonBedrockChatGenerator are supported.
104105 :param prompt: Instructional text provided to the LLM. It must not contain Jinja variables.
105106 The prompt should only contain instructions on how to extract the content of the image-based document.
106107 :param file_path_meta_field: The metadata field in the Document that contains the file path to the image or PDF.
You can’t perform that action at this time.
0 commit comments