Skip to content

Latest commit

 

History

History
24 lines (20 loc) · 693 Bytes

File metadata and controls

24 lines (20 loc) · 693 Bytes

{ "Model Family": null, "Model": "o3", "Method": "Direct + Few-Shot", "Successes": 51.0, "Failures": 5.0, "Abstentions": 44.0, "Break-Even Price": "$126.41 \u00b1 24.54" },

  orginal table - 
  OpenAI GPT-4.1

GPT-4.1 Few-Shot 87 8 5 $247.99 ± 341.76 GPT-4.1 Direct + Few-Shot 47 0 53 $143.10 ± 22.49 GPT-4.1 Parsed + Few-Shot 38 1 61 $164.70 ± 21.98 GPT-4.1 Few-Shot + Few-Shot 81 5 14 $40.08 ± 15.87 o3 Few-Shot 81 13 6 $60.26 ± 58.93 o3 Direct + Few-Shot 51 5 44 $126.41 ± 24.54 o3 Parsed + Few-Shot 68 7 25 $75.11 ± 22.46 o3 Few-Shot + Few-Shot 74 8 18 $58.13 ± 20.91

model family should be "OpenAI GPT-4.1"