Merge branch 'main' into DDOC-1453-iframe-for-chromium

anastasiaberyoza · web-flow · commit 2233d36c8989 · 2025-11-14T11:28:27.000+01:00
diff --git a/content/guides/box-ai/ai-tutorials/extract-metadata-structured.md b/content/guides/box-ai/ai-tutorials/extract-metadata-structured.md
@@ -20,6 +20,30 @@ and get the result in the form of key-value pairs.
 As input, you can either create a structure using the `fields` parameter, or use an already defined metadata template.
 To learn more about creating templates, see [Creating metadata templates in the Admin Console][templates-console] or use the [metadata template API][templates-api].
 
+## Supported file formats
+
+The endpoint supports the following file formats:
+
+- PDF
+- TIFF
+- PNG
+- JPEG
+
+Box AI automatically applies optical character recognition (OCR) when processing image files (TIFF, PNG, JPEG) and scanned documents. This eliminates the need to convert images to PDF before extraction, saving time and simplifying your integration.
+
+## Supported languages
+
+Box AI can extract metadata from documents in the following languages:
+<!--alex ignore-->
+- English
+- Japanese
+- Chinese
+- Korean
+<!--alex enable-->
+- Cyrillic-based languages (such as Russian, Ukrainian, Bulgarian, and Serbian)
+
+No additional configuration is required to use different languages or image formats. Box AI automatically detects the language and applies OCR when needed.
+
 ## Before you start
 
 Make sure you followed the steps listed in [getting started with Box AI][prereq] to create a platform app and authenticate.
diff --git a/content/guides/box-ai/ai-tutorials/extract-metadata.md b/content/guides/box-ai/ai-tutorials/extract-metadata.md
@@ -14,9 +14,13 @@ alias_paths:
 
 # Extract metadata from file (freeform)
 
-Box AI API allows you to query a document or image and extract metadata based on a provided prompt.
+Box AI API allows you to query a document and extract metadata based on a provided prompt.
 **Freeform** means that the prompt can include a stringified version of formats such as JSON or XML, or even plain text.
 
+<Message type="notice">
+The **Extract metadata (freeform)** endpoint doesn't support OCR. To extract metadata from image files (TIFF, PNG, JPEG) or documents in languages other than English, use the [Extract metadata (structured)][structured-endpoint] endpoint.
+</Message>
+
 ## Before you start
 
 Make sure you followed the steps listed in [getting started with Box AI][prereq] to create a platform app and authenticate.
@@ -151,4 +155,5 @@ The response includes the `fields` present in the file, along with their values:
 [agent]: e://get_ai_agent_default
 [model-param]: r://ai_agent_text_gen#param_basic_gen_model
 [prompt-param]: r://ai_agent_text_gen#param_basic_gen_prompt_template
-[overrides]: g://box-ai/ai-agents/ai-agent-overrides
+[overrides]: g://box-ai/ai-agents/ai-agent-overrides
+[structured-endpoint]: g://box-ai/ai-tutorials/extract-metadata-structured