Merge pull request #19 from AugmentedCamel/patch-1

xrdevrob · web-flow · commit 4539e0cc6714 · 2025-05-21T21:37:17.000+02:00
typo :)
diff --git a/README.md b/README.md
@@ -48,7 +48,7 @@ QuestCameraKit is a collection of template and reference projects demonstrating
 ## 5. 🧠 OpenAI vision model
 
 - **Purpose:** Ask OpenAI's vision model (or any other multi-modal LLM) for context of your current scene.
-- **Description:** We use a the OpenAI Speech to text API to create a coommand. We then send this command together with a screenshot to the Vision model. Lastly, we get the response back and use the Text to speech API to turn the response text into an audio file in Unity to speak the response. The user can select different speakers, models, and speed. For the command we can add additional instructions for the model, as well as select an image, image & text, or just a text mode. The whole loop takes anywhere from `2-6 seconds`, depending on the internet connection.
+- **Description:** We use a the OpenAI Speech to text API to create a command. We then send this command together with a screenshot to the Vision model. Lastly, we get the response back and use the Text to speech API to turn the response text into an audio file in Unity to speak the response. The user can select different speakers, models, and speed. For the command we can add additional instructions for the model, as well as select an image, image & text, or just a text mode. The whole loop takes anywhere from `2-6 seconds`, depending on the internet connection.
 
 https://github.com/user-attachments/assets/a4cfbfc2-0306-40dc-a9a3-cdccffa7afea