Platform: Platinum/VLM strategy (#319)

Paul-Cornell · web-flow · commit e7e0cfbae95e · 2024-11-13T07:56:49.000-08:00
diff --git a/faq/faq.mdx b/faq/faq.mdx
@@ -46,23 +46,6 @@ When you log in to the Serverless API dashboard, you can access your API keys by
 Under the `Actions` column, click the `Copy` icon to copy the key or an example code snippet to process the documents
 using the Unstructured REST API, or the [Unstructured Ingest CLI](/ingestion/overview#unstructured-ingest-cli), or the [Unstructured Python SDK](https://github.com/Unstructured-IO/unstructured-python-client) or [Unstructured JavaScript/TypeScript SDK](https://github.com/Unstructured-IO/unstructured-js-client).
 
-### What is the new Unstructured API pricing structure?
-
-We offer a clear and straightforward pay-per-page pricing model, giving you full control and predictability over your
-document preprocessing costs. We have two different document processing strategies:
-
-- **Fast Strategy**: $1 per 1000 pages processed
-
-    Designed for low-latency use cases, Fast Strategy is for file types other than PDF or images, where we can capitalize on the structure of the document to exact and classify the text. This strategy uses rules-based parsers to deliver fast and cost-effective processing to render natural language in structured JSON.
-
-- **Hi-Res Strategy**: $10 per 1000 pages processed
-
-    Best for complex file types like PDF and JPEG or documents with images, forms, and tables. This strategy uses AI models to understand document layouts and render their contents in structured JSON.
-
-import SharedPagesBilling from '/snippets/general-shared-text/pages-billing.mdx';
-
-<SharedPagesBilling />
-
 ### How can I check my balance and make the payment?
 
 Your usage information can be found on your API dashboard. Click the `Usage` link on the side navigation. You can view your usage by date, including detailed information, such as the number of requests, total pages processed by fast and hi-res strategy, and total cost.
diff --git a/platform/partitioning.mdx b/platform/partitioning.mdx
@@ -19,26 +19,12 @@ To choose one of these strategies, select one of the **Partition Strategy** opti
 
 <Note>You can change a workflow's predefined strategy only through [Custom](/platform/workflows#create-a-custom-workflow) workflow settings.</Note>
 
-- **Auto**: This strategy leaves the choice of using **High Res** or **Fast** to Unstructured to determine on a file-by-file basis as it goes along. 
-  Unstructured will use **High Res** if it can determine that the current file under analysis is an image file or a PDF file with embedded images or tables. 
-  Otherwise, Unstructured will use **Fast** on the current file. 
-  You should choose this strategy if you know that all of the files are a combination of:
-
-  - At least one image file; or at least one PDF file with embedded images or tables in it; and any number of other kinds of files.
-
-  Choosing **Auto** can be an effective choice with a reasonable balance of speed, cost, and quality 
-  when you have a mixture of these types of files.
-
-- **Fast**: This strategy is rule-based. It is faster and cheaper than **High Res** but might provide lower-quality resolution. 
-  You should choose this strategy if you know that:
-
-  - You have only PDF files, and you know that none of them have embedded images or tables in them, or 
-  - You have no PDF files or image files at all.
-
-- **Hi-res**: This strategy uses an image-to-text model for inference. It is slower and costlier than **Fast** but can provide 
-  higher-quality resolution. You should choose this strategy if you know that:
-
-  - All of the files are only image files, or
-  - All of the files are only PDF files, and they have embedded images or tables in them, or
-  - All of the files are a combination of only these two kinds of files.
+- **Fast**: This strategy is ideal for simple, text-based documents.
+- **Hi-Res**: This strategy is best for PDFs, images, and complex file types.
+- **VLM**: For your most challenging documents, including scanned and handwritten content, use this strategy, which leverages vision 
+  language models (VLMs). During processing, files that are not PDFs or images are processed by using the **Hi-Res** strategy and are charged 
+  at the **Hi-Res** rate instead.
+- **Auto**: This strategy examines each file before processing it. If the file is an image, or if the file is a PDF and at least one embedded table 
+  or image is found in it, **Hi-Res** is used to process that file and charged at the **Hi-Res** rate for that file. Otherwise, **Fast** is used and charged at the 
+  **Fast** rate for that file.
 
diff --git a/platform/workflows.mdx b/platform/workflows.mdx
@@ -52,8 +52,10 @@ To create an automatic workflow:
 7. Click **Continue**.
 8. In the **Optimize for** section, select the option to choose one of these predefined workflow settings groups:
 
-   - **Basic** is a good choice if you have text-only documents that have no images or tables in them.
-   - **Advanced** is a good choice if you have complex documents that have images or tables or both in them.
+   - **Basic** Ideal for simple, text-only documents.
+   - **Advanced** Best for PDFs, images, and complex file types.
+   - **Platinum** For your most challenging documents, including scanned and handwritten content. It uses vision language models (VLMs). 
+     During processing, files that are not PDFs or images are processed by using the **Advanced** strategy and are charged at the **Advanced** rate instead.
 
 9. The **Reprocess all** box applies only to the Amazon S3 and Azure Blob Storage source connectors:
 
@@ -78,7 +80,7 @@ To create an automatic workflow:
 
 There are two ways to create a custom workflow:
 
-- Through [Build it with me > Custom](#build-it-with-me-custom). This option enables you to fine-tune the kinds of settings that are in **Basic** and **Advanced**.
+- Through [Build it with me > Custom](#build-it-with-me-custom). This option enables you to fine-tune the kinds of settings that are in **Basic**, **Advanced**, and **Platinum**.
 - Through [Build it myself](#build-it-myself). This option offers a visual workflow designer with even more fine-tuning than the **Custom** option.  
 
 #### Build it with me - Custom
@@ -106,9 +108,13 @@ There are two ways to create a custom workflow:
 
 9. In the **Strategy** area, choose one of the following:
 
-    - **Fast**: This strategy uses traditional NLP extraction techniques to quickly pull in all text elements. This strategy is not good for image-based file types or files with images or tables in them.
-    - **Hi-res**: This strategy uses the document layout to gain additional information about document elements. This strategy is good for image-based file types and files with images or tables in them. This strategy is recommended if your use case is highly sensitive to correct classification for document elements. 
-    - **Auto**: This strategy chooses the partitioning strategy on a file-by-file basis, depending on detected document characteristics.
+    - **Fast**: Ideal for simple, text-only documents.
+    - **Hi-Res**: Best for PDFs, images, and complex file types.
+    - **VLM**: For your most challenging documents, including scanned and handwritten content. It uses vision language models (VLMs). 
+      During processing, files that are not PDFs or images are processed by using the **Hi-Res** strategy and are charged at the **Hi-Res** rate instead.
+    - **Auto**: This strategy examines each file before processing it. If the file is an image, or if the file is a PDF and at least one embedded table 
+      or image is found in it, **Hi-Res** is used to process that file and charged at the **Hi-Res** rate for that file. Otherwise, **Fast** is used and charged at the 
+      **Fast** rate for that file.
 
     [Learn more](/platform/partitioning).
 
@@ -259,9 +265,13 @@ There are two ways to create a custom workflow:
     <Accordion title="Partitioner node">
         For **Partition Strategy**, choose one of the following:
 
-        - **Auto**: This strategy chooses the partitioning strategy on a file-by-file basis, depending on detected document characteristics.    
-        - **Fast**: This strategy uses traditional NLP extraction techniques to quickly pull in all text elements. This strategy is not good for image-based file types or files with images or tables in them.
-        - **Hi-res**: This strategy uses the document layout to gain additional information about document elements. This strategy is good for image-based file types and files with images or tables in them. This strategy is recommended if your use case is highly sensitive to correct classification for document elements. 
+        - **Fast**: Ideal for simple, text-only documents.  
+        - **Hi-Res**: Best for PDFs, images, and complex file types.
+        - **VLM**: For your most challenging documents, including scanned and handwritten content. It uses vision language models (VLMs). 
+          During processing, files that are not PDFs or images are processed by using the **Hi-Res** strategy and are charged at the **Hi-Res** rate instead.
+        - **Auto**: This strategy examines each file before processing it. If the file is an image, or if the file is a PDF and at least one embedded table 
+          or image is found in it, **Hi-Res** is used to process that file and charged at the **Hi-Res** rate for that file. Otherwise, **Fast** is used and charged at the 
+          **Fast** rate for that file.
 
         [Learn more](/platform/partitioning).
     </Accordion>
diff --git a/snippets/quickstarts/platform.mdx b/snippets/quickstarts/platform.mdx
@@ -74,8 +74,10 @@ allowfullscreen
         7. Click **Continue**.
         8. In the **Optimize for** section, select the option to choose one of these predefined workflow settings groups:
 
-            - **Basic** is a good choice if you have text-only documents that have no images or tables in them.
-            - **Advanced** is a good choice if you have complex documents that have images or tables or both in them.
+            - **Basic**: Ideal for simple, text-only documents.
+            - **Advanced**: Best for PDFs, images, and complex file types.
+            - **Platinum**: For your most challenging documents, including scanned and handwritten content. It uses vision language models (VLMs). 
+              During processing, files that are not PDFs or images are processed by using the **Advanced** strategy and are charged at the **Advanced** rate instead.
 
         9. The **Reprocess all** box applies only to the Amazon S3 and Azure Blob Storage source connectors: