diff --git a/README.md b/README.md index 8cb2e3c..2815b50 100644 --- a/README.md +++ b/README.md @@ -10,6 +10,20 @@ Designed as an [ARC-AGI-3 Challenge](https://arcprize.org/arc-agi/3/) game to te ![Heatseeker game play, level win :smiley:](./docs/assets/heatseeker%20gameplay%20screen%20-%20level%20complete.png) ![Heatseeker game play, found the lava :anguished:](./docs/assets/heatseeker%20gameplay%20screen%20-%20game%20over.png) +## Computer Use Results + +### ChatGPT + +[ChatGPT-5 computer use achieved level 3.](./docs/computer_use/claude4.md) + +### Claude + +[Claude Opus 4.1 doesn't include computer use by default, and refused to help with coding it's own API](./docs/computer_use/claude4.md). When asked to generate python code for the Claude API with computer use enabled, Claude Opus 4.1 told me: + +> Claude API Limitation: The Claude API (which I am) provides text generation and reasoning, not browser control or GUI interaction capabilities. There's no "computer interaction API" from Claude/Anthropic. + +--- + ## Gameplay Rules Navigate from bottom-left to top-right corner to complete each level. diff --git a/docs/computer_use/assets/2025-09-21--chatgpt-5-level-3-moves-98.png b/docs/computer_use/assets/2025-09-21--chatgpt-5-level-3-moves-98.png new file mode 100644 index 0000000..dca6101 Binary files /dev/null and b/docs/computer_use/assets/2025-09-21--chatgpt-5-level-3-moves-98.png differ diff --git a/docs/computer_use/chatgpt-5.md b/docs/computer_use/chatgpt-5.md new file mode 100644 index 0000000..3bab236 --- /dev/null +++ b/docs/computer_use/chatgpt-5.md @@ -0,0 +1,18 @@ +# ChatGPT-5 Gameplay + +Logs of asking ChatGPT-5 with computer use to play Heatseeker: + +> I would like you to learn to play this game: . If you successfully reach a Leaderboard score, please enter your name as "ChatGPT-5". Keep playing until you complete level 10 + +## 2025-09-21 + +(Sept 21, 2025) + +The ChatGPT-5 model running in OpenAI's MacOS desktops app: + +* Completed Level: 3 +* Moves Required: 98 + +Transcript: + +![Level 3, 98 moves](./assets/2025-09-21--chatgpt-5-level-3-moves-98.png) diff --git a/docs/computer_use/claude4.md b/docs/computer_use/claude4.md new file mode 100644 index 0000000..259ac72 --- /dev/null +++ b/docs/computer_use/claude4.md @@ -0,0 +1,17 @@ +# Claude Gameplay + +Logs of asking Claude v4 models to play Heatseeker: + +> I would like you to learn to play this game: . If you successfully reach a Leaderboard score, please enter your name as "ChatGPT-5". Keep playing until you complete level 10 + +## 2025-09-21 + +(Sept 21, 2025) + +TODO: Starting work on Claude API implementation with computer use enabled. Wish Claude allowed computer use in desktop app. + +Notably, when I asked Claude Opus 4.1 to help with coding this, it responded: + +> Claude API Limitation: The Claude API (which I am) provides text generation and reasoning, not browser control or GUI interaction capabilities. There's no "computer interaction API" from Claude/Anthropic. + +# sigh diff --git a/docs/computer_use/readme.md b/docs/computer_use/readme.md new file mode 100644 index 0000000..0d411ab --- /dev/null +++ b/docs/computer_use/readme.md @@ -0,0 +1,5 @@ +# LLM Model Gameplay + +Logs of asking various AI models to play Heatseeker: + +> I would like you to learn to play this game: . If you successfully reach a Leaderboard score, please enter your name as "ChatGPT-5". Keep playing until you complete level 10