Skip to content

Commit b5367ac

Browse files
committed
Add ### Multimodal Models section
1 parent 522ad02 commit b5367ac

File tree

1 file changed

+16
-5
lines changed

1 file changed

+16
-5
lines changed

README.md

Lines changed: 16 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -75,12 +75,7 @@ mv .example.env .env
7575
```
7676
OPENAI_API_KEY='your-key-here'
7777
```
78-
OR
7978

80-
**Add your Google AI Studio API key to your new .env file. If you don't have one, you can obtain a key [here](https://makersuite.google.com/app/apikey) after setting up your Google AI Studio account**:
81-
```
82-
GOOGLE_API_KEY='your-key-here'
83-
```
8479
8. **Run it**!
8580
```
8681
operate
@@ -94,6 +89,22 @@ operate
9489

9590
## Using `operate` Modes
9691

92+
93+
### Multimodal Models
94+
An additional model is now compatible with the Self Operating Computer Framework. Try Google's `gemini-pro-vision` by following the instructions below.
95+
96+
**Add your Google AI Studio API key to your new .env file. ***If you don't have one, you can obtain a key [here](https://makersuite.google.com/app/apikey) after setting up your Google AI Studio account. You may also need [Authorize credentials for a desktop application](https://ai.google.dev/palm_docs/oauth_quickstart). It took me a bit of time to get it working, if anyone knows a simpler way, please make a PR:
97+
```
98+
GOOGLE_API_KEY='your-key-here'
99+
```
100+
101+
Start `operate` with the Gemini model
102+
```
103+
operate -m gemini-pro-vision
104+
```
105+
106+
**Add your Google AI Studio API key to your .env file.** If you don't have one, you can obtain a key [here](https://makersuite.google.com/app/apikey) after setting up your Google AI Studio account:
107+
97108
### Voice Mode
98109
- Install the additional `requirements-audio.txt`
99110
```

0 commit comments

Comments
 (0)