Add video segmentation into field extraction notebook (#94)

chienyuanchang · web-flow · commit 9bd11c548295 · 2025-08-19T05:16:27.000-07:00
diff --git a/analyzer_templates/marketing_video_segmenation_auto.json b/analyzer_templates/marketing_video_segmenation_auto.json
@@ -0,0 +1,33 @@
+{
+  "description": "Sample marketing video analyzer",
+  "baseAnalyzerId": "prebuilt-videoAnalyzer",
+  "config": {
+    "returnDetails": true,
+    "segmentationMode": "auto"
+  },
+  "fieldSchema": {
+    "fields": {
+      "Segments": {
+        "type": "array",
+        "items": {
+          "type": "object",
+          "properties": {
+            "Description": {
+              "type": "string",
+              "description": "Detailed summary of the video segment, focusing on product characteristics, lighting, and color palette."
+            },
+            "Sentiment": {
+              "type": "string",
+              "method": "classify",
+              "enum": [
+                "Positive",
+                "Neutral",
+                "Negative"
+              ]
+            }
+          }
+        }
+      }
+    }
+  }
+}
diff --git a/analyzer_templates/marketing_video_segmenation_custom.json b/analyzer_templates/marketing_video_segmenation_custom.json
@@ -0,0 +1,34 @@
+{
+  "description": "Sample marketing video analyzer",
+  "baseAnalyzerId": "prebuilt-videoAnalyzer",
+  "config": {
+    "returnDetails": true,
+    "segmentationMode": "custom",
+    "segmentationDefinition": "Segment the video at each clear narrative or visual transition that introduces a new marketing message, speaker, or brand moment. Segments should begin when there is a change in speaker, a shift in visual theme (e.g., logos, product shots, data center views, simulation footage, aircraft scenes), or the introduction of a new key message (e.g., quality of data, scale of infrastructure, customer benefit, real-world aviation use). Each segment should capture one distinct marketing idea or value point, ending when the focus transitions to the next theme."
+  },
+  "fieldSchema": {
+    "fields": {
+      "Segments": {
+        "type": "array",
+        "items": {
+          "type": "object",
+          "properties": {
+            "Description": {
+              "type": "string",
+              "description": "Detailed summary of the video segment, focusing on product characteristics, lighting, and color palette."
+            },
+            "Sentiment": {
+              "type": "string",
+              "method": "classify",
+              "enum": [
+                "Positive",
+                "Neutral",
+                "Negative"
+              ]
+            }
+          }
+        }
+      }
+    }
+  }
+}
diff --git a/notebooks/field_extraction.ipynb b/notebooks/field_extraction.ipynb
@@ -149,8 +149,6 @@
    "metadata": {},
    "outputs": [],
    "source": [
-    "import json\n",
-    "\n",
     "analyzer_template_path = '../analyzer_templates/invoice.json'\n",
     "with open(analyzer_template_path, 'r') as f:\n",
     "    template_content = json.load(f)\n",
@@ -649,7 +647,28 @@
     "\n",
     "Let's analyze a marketing video to extract descriptions, sentiment, and key insights that could be valuable for content understanding and marketing analytics.\n",
     "\n",
-    "Marketing video analytics template:"
+    "Content Understanding offers three ways to slice a video, letting you get the output you need for whole videos or short clips. You can use these options by setting the `segmentationMode` property on a custom analyzer.\n",
+    "- Whole-video – `\"segmentationMode\": \"noSegmentation\"` The service treats the entire video file as a single segment and extracts metadata across its full duration.  \n",
+    "  Example:\n",
+    "    - Compliance checks that look for specific brand-safety issues anywhere in an ad\n",
+    "    - full-length descriptive summaries\n",
+    "- Automatic segmentation – `\"segmentationMode\": \"auto\"` The service analyzes the timeline and breaks it up for you. Groups successive shots into coherent scenes, capped at one minute each.  \n",
+    "  Example:\n",
+    "    - Create storyboards from a show\n",
+    "    - Inserting mid-roll ads at logical pauses.\n",
+    "- Custom segmentation – `\"segmentationMode\": \"custom\"` You describe the logic in natural language and the model creates segments to match. Set `segmentationDefinition` with a string describing how you'd like the video to be segmented. Custom allows segments of varying length from seconds to minutes depending on the prompt.  \n",
+    "  Example:\n",
+    "    - Break a news broadcast up into stories."
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### 6-1 Analyze without Segmentation\n",
+    "\n",
+    "In this example, we analyze a marketing video without segmentation.\n",
+    "- Please set `segmentationMode` to `noSegmentation` in the analyzer schema `config` to process the entire video as one segment."
    ]
   },
   {
@@ -695,7 +714,150 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "Marketing video analysis result:"
+    "Marketing video analysis result\n",
+    "- The result is generated from the content of the entire video."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "print(json.dumps(result_json, indent=2))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Clean up marketing video analyzer\n",
+    "\n",
+    "Note: In production environments, you would typically keep analyzers for reuse rather than deleting them"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "client.delete_analyzer(video_analyzer_id)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### 6-2 Analyze With Automatic Segmentation\n",
+    "\n",
+    "In this example, we use automatic segmentation for marketing video analytics.  \n",
+    "- Please set `segmentationMode` to `auto` in the analyzer schema `config` to enable automatic segmentation."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "analyzer_template_path = '../analyzer_templates/marketing_video_segmenation_auto.json'\n",
+    "with open(analyzer_template_path, 'r') as f:\n",
+    "    template_content = json.load(f)\n",
+    "    print(json.dumps(template_content, indent=2))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Create and run marketing video analyzer"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "sample_file_path = '../data/FlightSimulator.mp4'\n",
+    "video_analyzer_id = \"marketing-video-analytics-\" + str(uuid.uuid4())\n",
+    "\n",
+    "print(f\"Creating marketing video analyzer: {video_analyzer_id}\")\n",
+    "response = client.begin_create_analyzer(video_analyzer_id, analyzer_template_path=analyzer_template_path)\n",
+    "result = client.poll_result(response)\n",
+    "print(\"✅ Marketing video analyzer created successfully!\")\n",
+    "\n",
+    "print(f\"Analyzing marketing video: {sample_file_path}\")\n",
+    "print(\"⏳ Note: Video analysis may take significantly longer than document analysis...\")\n",
+    "response = client.begin_analyze(video_analyzer_id, file_location=sample_file_path)\n",
+    "result_json = client.poll_result(response)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Marketing video analysis result\n",
+    "- The output includes automatically segmented clips with descriptions in the markdown content.  \n",
+    "- The analyzer generates the fields defined in the schema separately for each segment."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "print(json.dumps(result_json, indent=2))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Clean up marketing video analyzer\n",
+    "\n",
+    "Note: In production environments, you would typically keep analyzers for reuse rather than deleting them"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "client.delete_analyzer(video_analyzer_id)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### 6-3 Analyze With Custom Segmentation\n",
+    "\n",
+    "In this example, we use custom segmentation for marketing video analytics.  \n",
+    "- Please set `segmentationMode` to `custom`.  \n",
+    "- Provide a `segmentationDefinition` string describing how you would like the video to be segmented."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "analyzer_template_path = '../analyzer_templates/marketing_video_segmenation_custom.json'\n",
+    "with open(analyzer_template_path, 'r') as f:\n",
+    "    template_content = json.load(f)\n",
+    "    print(json.dumps(template_content, indent=2))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Create and run marketing video analyzer"
    ]
   },
   {
@@ -704,7 +866,36 @@
    "metadata": {},
    "outputs": [],
    "source": [
+    "sample_file_path = '../data/FlightSimulator.mp4'\n",
+    "video_analyzer_id = \"marketing-video-analytics-\" + str(uuid.uuid4())\n",
     "\n",
+    "print(f\"Creating marketing video analyzer: {video_analyzer_id}\")\n",
+    "response = client.begin_create_analyzer(video_analyzer_id, analyzer_template_path=analyzer_template_path)\n",
+    "result = client.poll_result(response)\n",
+    "print(\"✅ Marketing video analyzer created successfully!\")\n",
+    "\n",
+    "print(f\"Analyzing marketing video: {sample_file_path}\")\n",
+    "print(\"⏳ Note: Video analysis may take significantly longer than document analysis...\")\n",
+    "response = client.begin_analyze(video_analyzer_id, file_location=sample_file_path)\n",
+    "result_json = client.poll_result(response)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Marketing video analysis result\n",
+    "- The video is segmented according to your custom definition, with segment descriptions included in the markdown content.  \n",
+    "- The segmentation may differ from automatic segmentation results.  \n",
+    "- The analyzer generates the fields defined in the schema separately for each segment."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
     "print(json.dumps(result_json, indent=2))"
    ]
   },