Skip to content

Commit ef5921f

Browse files
committed
Update more headers in README.md
1 parent 725103c commit ef5921f

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -11,20 +11,20 @@
1111
<img src="https://github.com/OthersideAI/self-operating-computer/blob/main/readme/self-operating-computer.png" width="750" style="margin: 10px;"/>
1212
</div>
1313

14-
### Key Features
14+
## Key Features
1515
- **Compatibility**: Designed for various multimodal models.
1616
- **Integration**: Currently integrated with **GPT-4v** as the default model.
1717
- **Future Plans**: Support for additional models.
1818
- **Accessibility**: Voice control thanks to [Whisper](https://github.com/mallorbc/whisper_mic) & [younesbram](https://github.com/younesbram)
1919

2020

21-
### Current Challenges
21+
## Current Challenges
2222
> **Note:** GPT-4V's error rate in estimating XY mouse click locations is currently quite high. This framework aims to track the progress of multimodal models over time, aspiring to achieve human-level performance in computer operation.
2323
24-
### Ongoing Development
24+
## Ongoing Development
2525
At [HyperwriteAI](https://www.hyperwriteai.com/), we are developing Agent-1-Vision a multimodal model with more accurate click location predictions.
2626

27-
### Agent-1-Vision Model API Access
27+
## Agent-1-Vision Model API Access
2828
We will soon be offering API access to our Agent-1-Vision model.
2929

3030
If you're interested in gaining access to this API, sign up [here](https://othersideai.typeform.com/to/FszaJ1k8?typeform-source=www.hyperwriteai.com).

0 commit comments

Comments
 (0)