Skip to content

Commit f05cb25

Browse files
Add AI Agent PDF Input Info (#336)
Co-authored-by: Pooja Bhaumik <[email protected]>
1 parent a9df82f commit f05cb25

File tree

1 file changed

+7
-6
lines changed

1 file changed

+7
-6
lines changed

docs/ff-integrations/ai/ai-agents.md

Lines changed: 7 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -76,14 +76,15 @@ You can obtain your OpenAI API key from [**OpenAI API Keys**](https://platform.o
7676

7777
#### Request Options
7878

79-
Here, you specify the type of inputs users can send to the AI.
79+
Define the types of inputs users can send to the AI agent. You can enable one or more of the following options:
8080

81-
- **Text**: Allows users to send text-based messages.
82-
- **Image**: Enables image input, allowing the agent to analyze photos.
83-
- **Audio**: (Google Agent only) Allows to send audio messages or voice inputs.
84-
- **Video**: (Google Agent only) Allows users to send short video clips to analyze.
81+
- **Text**: Allows users to send written messages, questions, or prompts.
82+
- **Image**: Enables users to upload photos for the AI to analyze visual content, such as objects, styles, or scenes.
83+
- **PDF** (Anthropic and Google Agent only): Lets users submit PDF documents, allowing the AI to extract and interpret information from files like resumes, reports, or forms.
84+
- **Audio** (Google Agent only): Supports voice input, enabling users to record or upload audio clips for transcription, sentiment analysis, or voice-based commands.
85+
- **Video** (Google Agent only): Allows users to submit video files, enabling the AI to analyze visual elements.
8586

86-
Selecting multiple input types makes it easier for users to clearly communicate what they need. Instead of relying only on text descriptions, users can combine inputs—for example, uploading an image along with text to better illustrate their queries and help the agent provide more accurate responses.
87+
Selecting multiple input types makes it easier for users to clearly communicate what they need. Instead of relying only on text descriptions, users can combine inputs. For instance, in an AI Stylist agent, enabling both Text and Image allows users to either describe their outfits in words or upload clothing photos for personalized analysis.
8788

8889
#### Response Options
8990

0 commit comments

Comments
 (0)