docs: Reorder classification methods to prioritize multimodal page-level as default

Bob Strahan · Bob Strahan · commit e9455ed99be0 · 2025-09-11T22:44:38.000Z
diff --git a/docs/classification.md b/docs/classification.md
@@ -20,66 +20,7 @@ The solution supports multiple classification approaches that vary by pattern:
 
 Pattern 2 offers two main classification approaches, configured through different templates:
 
-#### Text-Based Holistic Classification (Default)
-
-- Analyzes entire document packets to identify logical boundaries
-- Identifies distinct document segments within multi-page documents
-- Determines document type for each segment
-- Better suited for multi-document packets where context spans multiple pages
-- Deployed when you select the default pattern-2 configuration during stack deployment or update
-
-The default configuration in `config_library/pattern-2/default/config.yaml` implements this approach with a task prompt that instructs the model to:
-
-1. Read through the entire document package to understand its contents
-2. Identify page ranges that form complete, distinct documents
-3. Match each document segment to one of the defined document types
-4. Record the start and end pages for each identified segment
-
-Example configuration:
-
-```yaml
-classification:
-  classificationMethod: textbasedHolisticClassification
-  model: us.amazon.nova-pro-v1:0
-  task_prompt: >-
-    <task-description>
-    You are a document classification system. Your task is to analyze a document package 
-    containing multiple pages and identify distinct document segments, classifying each 
-    segment according to the predefined document types provided below.
-    </task-description>
-
-    <document-types>
-    {CLASS_NAMES_AND_DESCRIPTIONS}
-    </document-types>
-
-    <document-boundary-rules>
-    Rules for determining document boundaries:
-    - Content continuity: Pages with continuing paragraphs, numbered sections, or ongoing narratives belong to the same document
-    - Visual consistency: Similar layouts, headers, footers, and styling indicate pages belong together
-    - Logical structure: Documents typically have clear beginning, middle, and end sections
-    - New document indicators: Title pages, cover sheets, or significantly different subject matter signal a new document
-    </document-boundary-rules>
-
-    <<CACHEPOINT>>
-
-    <document-text>
-    {DOCUMENT_TEXT}
-    </document-text>
-  ```
-
-## Limitations of Text-Based Holistic Classification
-
-Despite its strengths in handling full-document context, this method has several limitations:
-
-**Context & Model Constraints:**: 
-- Long documents can exceed the context window of smaller models, resulting in request failure.
-- Lengthy inputs may dilute the model’s focus, leading to inaccurate or inconsistent classifications.
-- Requires high-context models such as Amazon Nova Premier, which supports up to 1 million tokens. Smaller models are not suitable for this method.
-- For more details on supported models and their context limits, refer to the [Amazon Bedrock Supported Models documentation](https://docs.aws.amazon.com/bedrock/latest/userguide/models-supported.html).
-
-**Scalability Challenges**: Not ideal for very large or visually complex document sets. In such cases, the Multi-Modal Page-Level Classification method is more appropriate.
-
-#### MultiModal Page-Level Classification with Sequence Segmentation
+#### MultiModal Page-Level Classification with Sequence Segmentation (default)
 
 - Classifies each page independently using both text and image data
 - **Uses sequence segmentation with BIO-like tagging for document boundary detection**
@@ -141,6 +82,64 @@ The boundary detection is automatically included in the classification results.
   }
 }
 ```
+#### Text-Based Holistic Classification
+
+- Analyzes entire document packets to identify logical boundaries
+- Identifies distinct document segments within multi-page documents
+- Determines document type for each segment
+- Better suited for multi-document packets where context spans multiple pages
+- Deployed when you select the default pattern-2 configuration during stack deployment or update
+
+The default configuration in `config_library/pattern-2/default/config.yaml` implements this approach with a task prompt that instructs the model to:
+
+1. Read through the entire document package to understand its contents
+2. Identify page ranges that form complete, distinct documents
+3. Match each document segment to one of the defined document types
+4. Record the start and end pages for each identified segment
+
+Example configuration:
+
+```yaml
+classification:
+  classificationMethod: textbasedHolisticClassification
+  model: us.amazon.nova-pro-v1:0
+  task_prompt: >-
+    <task-description>
+    You are a document classification system. Your task is to analyze a document package 
+    containing multiple pages and identify distinct document segments, classifying each 
+    segment according to the predefined document types provided below.
+    </task-description>
+
+    <document-types>
+    {CLASS_NAMES_AND_DESCRIPTIONS}
+    </document-types>
+
+    <document-boundary-rules>
+    Rules for determining document boundaries:
+    - Content continuity: Pages with continuing paragraphs, numbered sections, or ongoing narratives belong to the same document
+    - Visual consistency: Similar layouts, headers, footers, and styling indicate pages belong together
+    - Logical structure: Documents typically have clear beginning, middle, and end sections
+    - New document indicators: Title pages, cover sheets, or significantly different subject matter signal a new document
+    </document-boundary-rules>
+
+    <<CACHEPOINT>>
+
+    <document-text>
+    {DOCUMENT_TEXT}
+    </document-text>
+  ```
+
+## Limitations of Text-Based Holistic Classification
+
+Despite its strengths in handling full-document context, this method has several limitations:
+
+**Context & Model Constraints:**: 
+- Long documents can exceed the context window of smaller models, resulting in request failure.
+- Lengthy inputs may dilute the model’s focus, leading to inaccurate or inconsistent classifications.
+- Requires high-context models such as Amazon Nova Premier, which supports up to 1 million tokens. Smaller models are not suitable for this method.
+- For more details on supported models and their context limits, refer to the [Amazon Bedrock Supported Models documentation](https://docs.aws.amazon.com/bedrock/latest/userguide/models-supported.html).
+
+**Scalability Challenges**: Not ideal for very large or visually complex document sets. In such cases, the Multi-Modal Page-Level Classification method is more appropriate.
 
 ### Pattern 3: UDOP-Based Classification