You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/ff-integrations/ai/ai-agents.md
+7-6Lines changed: 7 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -76,14 +76,15 @@ You can obtain your OpenAI API key from [**OpenAI API Keys**](https://platform.o
76
76
77
77
#### Request Options
78
78
79
-
Here, you specify the type of inputs users can send to the AI.
79
+
Define the types of inputs users can send to the AI agent. You can enable one or more of the following options:
80
80
81
-
-**Text**: Allows users to send text-based messages.
82
-
-**Image**: Enables image input, allowing the agent to analyze photos.
83
-
-**Audio**: (Google Agent only) Allows to send audio messages or voice inputs.
84
-
-**Video**: (Google Agent only) Allows users to send short video clips to analyze.
81
+
-**Text**: Allows users to send written messages, questions, or prompts.
82
+
-**Image**: Enables users to upload photos for the AI to analyze visual content, such as objects, styles, or scenes.
83
+
-**PDF** (Anthropic and Google Agent only): Lets users submit PDF documents, allowing the AI to extract and interpret information from files like resumes, reports, or forms.
84
+
-**Audio** (Google Agent only): Supports voice input, enabling users to record or upload audio clips for transcription, sentiment analysis, or voice-based commands.
85
+
-**Video** (Google Agent only): Allows users to submit video files, enabling the AI to analyze visual elements.
85
86
86
-
Selecting multiple input types makes it easier for users to clearly communicate what they need. Instead of relying only on text descriptions, users can combine inputs—for example, uploading an image along with text to better illustrate their queries and help the agent provide more accurate responses.
87
+
Selecting multiple input types makes it easier for users to clearly communicate what they need. Instead of relying only on text descriptions, users can combine inputs. For instance, in an AI Stylist agent, enabling both Text and Image allows users to either describe their outfits in words or upload clothing photos for personalized analysis.
0 commit comments