|
45 | 45 | "| o3 | Deep tool‑using agent | High‑stakes, multi‑step reasoning | Latency & price | Cost/latency → o4‑mini |\n", |
46 | 46 | "| o4‑mini | Cheap, fast reasoning | High‑volume \"good‑enough\" logic | Depth ceiling vs o3 | Accuracy critical → o3 |\n", |
47 | 47 | "\n", |
48 | | - "# *(Full price and utility table → [Section 6.1](#appendices))*\n", |
| 48 | + "*(Full price and utility table → [Section 6.1](#appendices))*\n", |
49 | 49 | "\n", |
50 | 50 | "## 2.2 Model Evolution at a Glance\n", |
51 | 51 | "\n", |
|
61 | 61 | "\n", |
62 | 62 | "### OpenAI Model Evolution \n", |
63 | 63 | "\n", |
64 | | - |
65 | 64 | "\n", |
66 | | - |
67 | 65 | "\n", |
68 | 66 | "### Key Characteristics\n", |
69 | 67 | "\n", |
|
81 | 79 | "\n", |
82 | 80 | "## 3A. Use Case: Long-Context RAG for Legal Q&A\n", |
83 | 81 | "\n", |
84 | | - |
85 | 82 | "\n", |
86 | | - |
87 | 83 | "## 🗂️ TL;DR Matrix\n", |
88 | 84 | "\n", |
89 | 85 | "This table summarizes the core technology choices and their rationale for **this specific Long-Context Agentic RAG implementation**.\n", |
|
137 | 133 | "id": "db9bad1b", |
138 | 134 | "metadata": {}, |
139 | 135 | "source": [ |
140 | | - |
141 | 136 | "\n", |
142 | | - |
143 | 137 | "\n", |
144 | 138 | "\n", |
145 | 139 | "## Agentic RAG System: Model Usage\n", |
|
1821 | 1815 | "================================================================================\n", |
1822 | 1816 | "\n", |
1823 | 1817 | "## 3B. Use Case: AI Co-Scientist for Pharma R&D\n", |
1824 | | - |
1825 | 1818 | "\n", |
1826 | | - |
1827 | 1819 | "\n", |
1828 | 1820 | "This section details how to build an AI system that functions as a \"co-scientist\" to accelerate experimental design in pharmaceutical R&D, focusing on optimizing a drug synthesis process under specific constraints.\n", |
1829 | 1821 | "\n", |
|
1863 | 1855 | "\n", |
1864 | 1856 | "The system employs a multi-agent architecture that emulates a high-performing scientific team. Different AI components, acting in specialized roles (such as ideation, critique, and learning from outcomes), collaborate using various models and tools to execute the workflow.\n", |
1865 | 1857 | "\n", |
1866 | | - |
1867 | 1858 | "\n", |
1868 | | - |
1869 | 1859 | "\n", |
1870 | 1860 | "### 2.1. **Scientist Input & Constraints:** \n", |
1871 | 1861 | "The process starts with the scientist defining the goal, target compound, and constraints." |
|
2473 | 2463 | "\n", |
2474 | 2464 | "## 3C. Use Case: Insurance Claim Processing\n", |
2475 | 2465 | "\n", |
2476 | | - |
2477 | 2466 | "\n", |
2478 | | - |
2479 | 2467 | "\n", |
2480 | 2468 | "Many businesses are faced with the task of digitizing hand filled forms. In this section, we will demonstrate how OpenAI can be used to digitize and validate a hand filled insurance form. While this is a common problem for insurance, the same techniques can be applied to a variety of other industries and forms, for example tax forms, invoices, and more.\n", |
2481 | 2469 | "\n", |
|
2505 | 2493 | "\n", |
2506 | 2494 | "The high level basic architecture of the solution is shown below.\n", |
2507 | 2495 | "\n", |
2508 | | - |
2509 | 2496 | "\n", |
2510 | | - |
2511 | 2497 | "\n", |
2512 | 2498 | "This task is complex and requires a wide variety of model capabilities, including vision, function calling, reasoning, and structured output. While `o3` is capable of doing all of these at once, we found during experimentation that `o4-mini` alone was not sufficient to achieve the necessary performance. Due to the higher relative costs of `o3`, we instead opted for a two-stage approach.\n", |
2513 | 2499 | "\n", |
|
2517 | 2503 | "\n", |
2518 | 2504 | "To demonstrate concretely how this works, let's look at a sample image of an insurance form.\n", |
2519 | 2505 | "\n", |
2520 | | - |
2521 | 2506 | "\n", |
2522 | | - |
2523 | 2507 | "\n", |
2524 | 2508 | "While the form itself is fairly straightforward, there is missing data and ambiguous information that will be difficult for a traditional OCR system to fill out correctly. First, notice that the zip code and county have been omitted. Second, the email address of the user is ambiguous \\-- it could be `[email protected]` or `[email protected]`. In the following sections, we will walk through how a well-designed solution can handle these ambiguities and return the correct form results.\n", |
2525 | 2509 | "\n", |
|
3202 | 3186 | "\n", |
3203 | 3187 | "## Adaptation Decision Tree\n", |
3204 | 3188 | "\n", |
3205 | | - |
3206 | 3189 | "\n", |
3207 | | - |
3208 | | - |
3209 | 3190 | "\n", |
3210 | 3191 | "## Communicating Model Selection to Non-Technical Stakeholders\n", |
3211 | 3192 | "\n", |
|
3305 | 3286 | "\n", |
3306 | 3287 | "## Contributors\n", |
3307 | 3288 | "\n", |
3308 | | - " This cookbook serves as a joint collaboration effort between OpenAI and [Tribe AI](https://www.tribe.ai/)\n", |
| 3289 | + " This cookbook serves as a joint collaboration effort between OpenAI and [Tribe AI](https://www.tribe.ai/)\n", |
3309 | 3290 | "- [Kashyap Coimbatore Murali](https://www.linkedin.com/in/kashyap-murali/)\n", |
3310 | 3291 | "- [Nate Harada](https://www.linkedin.com/in/nate-harada/) \n", |
3311 | 3292 | "- [Sai Prashanth Soundararaj](https://www.linkedin.com/in/saiprashanths/)\n", |
3312 | 3293 | "- [Shikhar Kwatra](https://www.linkedin.com/in/shikharkwatra/)" |
3313 | | - |
3314 | 3294 | ] |
3315 | 3295 | } |
3316 | 3296 | ], |
|
0 commit comments