Add AI Agent PDF Input Info #336

pinkeshmars · 2025-05-05T06:49:01Z

Description

Add AI Agent PDF Input Info

Linear ticket and magic word Fixes DEVR-898

Type of change

bolt-new-by-stackblitz · 2025-05-05T06:49:04Z

Run & review this pull request in StackBlitz Codeflow.

michael-mcroskey · 2025-05-05T07:55:45Z

Looks mostly good! One small point of clarification that @MaggieThomann will investigate (added her as a reviewer) — I believe some AI models require text parts in all messages (so can't just upload a photo for example, needs to include text). This has been a point of confusion for users so I think it's worth calling out. I'll let Maggie take a look.

Thanks! 🙏

MaggieThomann · 2025-05-05T08:14:15Z

docs/ff-integrations/ai/ai-agents.md


-Selecting multiple input types makes it easier for users to clearly communicate what they need. Instead of relying only on text descriptions, users can combine inputs—for example, uploading an image along with text to better illustrate their queries and help the agent provide more accurate responses.
+Selecting multiple input types makes it easier for users to clearly communicate what they need. Instead of relying only on text descriptions, users can combine inputs. For instance, in an AI Stylist agent, enabling both Text and Image allows users to either describe their outfits in words or upload clothing photos for personalized analysis.


Can you add a caveat here that users typically need to include something in the text prompt to the agent?

or upload clothing photos for personalized analysis.

^ Basically, it would technically not be sufficient to do this in the OpenAI or Google cases. They would also need to put something in the "Text input" field in the "Send Message" action. Otherwise the agent will complain. Anthropic functions differently though. Anthropic is OK with just accepting the image.

Actually correction -- all vendors require that Text is passed. Seeing this error from Anthropic now:

{ "error": { "details": { "message": "400 {\"type\":\"error\",\"error\":{\"type\":\"invalid_request_error\",\"message\":\"messages: text content blocks must be non-empty\"}}", "details": "Error: 400 {\"type\":\"error\",\"error\":{\"type\":\"invalid_request_error\",\"message\":\"messages: text content blocks must be non-empty\"}}" }, "message": "Error running assistant", "status": "INTERNAL" } }

Ok sorry for the back and forth 😅 @pinkeshmars

I'm going to make a change to the agent that will allow the user the ability to just send an image in their request by defaulting the text parameter that we send in the cloud function code to an empty string. This way, the API won't complain and the user can configure the agent with just image. So feel free to keep this language as is.

cool. Thanks for the clarification @MaggieThomann

pinkeshmars · 2025-05-06T17:27:14Z

Hi @MaggieThomann could you please approve this so that we can merge?

pinkeshmars · 2025-05-13T18:02:00Z

Hi @MaggieThomann if it looks good to you, could you please approve?

PoojaB26

I'm approving this since this is already released, but @MaggieThomann let us know if there were any additional comments.

Add AI Agent PDF Input Info

0378b08

pinkeshmars requested review from PoojaB26 and michael-mcroskey May 5, 2025 06:49

michael-mcroskey requested a review from MaggieThomann May 5, 2025 07:53

MaggieThomann reviewed May 5, 2025

View reviewed changes

Merge branch 'main' into feature/ai-agent-pdf-input

9ea65df

pinkeshmars added 2 commits May 6, 2025 22:59

Merge branch 'main' into feature/ai-agent-pdf-input

72c5122

Merge branch 'main' into feature/ai-agent-pdf-input

84c6e58

Merge branch 'main' into feature/ai-agent-pdf-input

11410e0

PoojaB26 approved these changes May 21, 2025

View reviewed changes

PoojaB26 merged commit f05cb25 into main May 21, 2025
1 check passed

PoojaB26 deleted the feature/ai-agent-pdf-input branch May 21, 2025 10:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add AI Agent PDF Input Info #336

Add AI Agent PDF Input Info #336

pinkeshmars commented May 5, 2025

Uh oh!

bolt-new-by-stackblitz bot commented May 5, 2025

Uh oh!

michael-mcroskey commented May 5, 2025

Uh oh!

MaggieThomann May 5, 2025 •

edited

Loading

Uh oh!

MaggieThomann May 5, 2025

Uh oh!

MaggieThomann May 5, 2025

Uh oh!

pinkeshmars May 5, 2025

Uh oh!

pinkeshmars commented May 6, 2025

Uh oh!

pinkeshmars commented May 13, 2025

Uh oh!

PoojaB26 left a comment

Uh oh!

Uh oh!

Uh oh!


		Selecting multiple input types makes it easier for users to clearly communicate what they need. Instead of relying only on text descriptions, users can combine inputs—for example, uploading an image along with text to better illustrate their queries and help the agent provide more accurate responses.
		Selecting multiple input types makes it easier for users to clearly communicate what they need. Instead of relying only on text descriptions, users can combine inputs. For instance, in an AI Stylist agent, enabling both Text and Image allows users to either describe their outfits in words or upload clothing photos for personalized analysis.

Add AI Agent PDF Input Info #336

Add AI Agent PDF Input Info #336

Conversation

pinkeshmars commented May 5, 2025

Description

Type of change

Uh oh!

bolt-new-by-stackblitz bot commented May 5, 2025

Uh oh!

michael-mcroskey commented May 5, 2025

Uh oh!

MaggieThomann May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

MaggieThomann May 5, 2025

Choose a reason for hiding this comment

Uh oh!

MaggieThomann May 5, 2025

Choose a reason for hiding this comment

Uh oh!

pinkeshmars May 5, 2025

Choose a reason for hiding this comment

Uh oh!

pinkeshmars commented May 6, 2025

Uh oh!

pinkeshmars commented May 13, 2025

Uh oh!

PoojaB26 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

MaggieThomann May 5, 2025 •

edited

Loading