A native macOS menu bar app for voice-to-text. Hold fn to record, release to transcribe and type.
- Hold fn key to record voice, release to transcribe and insert text
- Multiple cloud providers — Groq and OpenAI Whisper APIs for high accuracy
- On-device fallback — Apple Speech (no API key or internet needed)
- Works in any app — types transcribed text into the focused application
- Self-serve — bring your own API key (Groq free tier, or OpenAI)
- Lightweight — runs in menu bar, no dock icon
Important: AudioType previously ran transcription 100% locally using whisper.cpp. We found the local model quality insufficient for reliable daily use, so we switched to cloud-based Whisper APIs which provide significantly better accuracy and speed. An on-device Apple Speech fallback is available if you prefer no cloud usage.
What this means:
- When using a cloud engine, audio recordings are sent to the provider's servers for transcription
- An internet connection is required for cloud transcription (not needed for Apple Speech)
- Your API keys are stored locally in the macOS Keychain
- No audio is saved to disk locally — it is recorded in memory, sent to the cloud provider, and discarded
- See Groq's data policy or OpenAI's data policy for how they handle your data
If you prefer 100% offline transcription with no data leaving your machine, you can use AudioType v1.1.1 — the last release that runs transcription entirely on-device using a local OpenAI Whisper model via whisper.cpp. No internet or API key required. Note that local transcription accuracy is lower than the cloud version.
- macOS 14.0 (Sonoma) or later
- Apple Silicon or Intel Mac
- Internet connection (for cloud engines; not needed for Apple Speech)
- A cloud API key (optional — app works without one using Apple Speech):
- Free Groq API key, or
- OpenAI API key
AudioType works out of the box using Apple's on-device speech recognition. For higher accuracy, configure a cloud provider:
- Go to console.groq.com/keys
- Create an account or sign in
- Generate a new API key
- Copy the key — you'll paste it into AudioType on first launch
Groq's free tier is generous enough for typical dictation use. See Groq's rate limits for current details.
- Go to platform.openai.com/api-keys
- Create an account or sign in
- Generate a new API key
- Copy the key — you'll paste it into AudioType Settings
- Download the latest
.dmgfrom Releases - Open the DMG and drag AudioType to Applications
- First launch — Right-click the app and select "Open" (required for unsigned apps)
- Click "Open" in the dialog to confirm
Note: Since this app is not notarized, macOS will block it on first launch. You can also bypass this via Terminal:
xattr -cr /Applications/AudioType.app
# Clone the repository
git clone https://github.com/PatelUtkarsh/audio-type.git
cd audio-type
# Build and create app bundle
make app
# Run the app
open AudioType.appOn first launch, AudioType will ask you to:
- Grant Microphone access — to record your voice
- Grant Accessibility access — to type text into other apps
- Grant Speech Recognition — for on-device Apple Speech
- Enter a Groq API key (optional) — for cloud transcription
You can skip the API key step to use Apple Speech. Additional cloud providers (OpenAI) can be configured later in Settings.
| Permission | Purpose |
|---|---|
| Microphone | Record voice for transcription |
| Accessibility | Detect fn key and type text into apps |
| Speech Recognition | On-device Apple Speech transcription |
| Internet | Send audio to cloud provider (Groq or OpenAI) |
- Launch AudioType — appears in menu bar with a waveform icon
- Hold fn key — starts recording (overlay shows waveform)
- Release fn key — sends audio to the active engine and types the result
- Click menu bar icon — access Settings or Quit
- Engine Selection:
Auto(default) — uses Groq if configured, then OpenAI, then Apple SpeechGroq Whisper— always use Groq (requires API key)OpenAI Whisper— always use OpenAI (requires API key)Apple Speech— always use on-device recognition
- Groq API Key — add or update your Groq key
- OpenAI API Key — add or update your OpenAI key
- Model Selection:
- Groq:
Whisper Large V3 Turbo(default, faster) orWhisper Large V3(most accurate) - OpenAI:
GPT-4o Mini Transcribe(default, balanced),GPT-4o Transcribe(best), orWhisper V2(cheapest)
- Groq:
- Language — auto-detect or choose from 25+ languages
fn key held -> Record audio -> Release fn key
|
v
Encode audio as WAV
|
v
EngineResolver picks engine
(Groq / OpenAI / Apple Speech)
|
v
Text post-processing
(capitalization, corrections)
|
v
Simulate keyboard typing
into focused app
- Swift — native macOS app
- Groq API — cloud speech-to-text (Whisper Large V3)
- OpenAI API — cloud speech-to-text (GPT-4o Transcribe / Whisper)
- Apple Speech — on-device speech-to-text (SFSpeechRecognizer)
- AVAudioEngine — low-latency audio capture
- CGEvent — global hotkey detection and keyboard simulation
- macOS Keychain — secure API key storage
- Check Accessibility permission in System Settings > Privacy & Security > Accessibility
- Try removing and re-adding AudioType from the list
- Check Microphone permission in System Settings > Privacy & Security > Microphone
- Ensure your microphone is working in other apps
- Check your internet connection (for cloud engines)
- Verify your API key is valid in Settings
- If you see "Rate limited", wait a moment and try again
- Check Groq status or OpenAI status for service issues
- Open Settings from the menu bar icon and enter your API key
- Get a free Groq key at console.groq.com/keys
- Or use Apple Speech (no key required) by setting engine to Auto or Apple Speech
Groq offers a free tier that is generous enough for typical dictation use. For current limits and pricing, see Groq's rate limits and pricing.
OpenAI uses pay-as-you-go pricing. See OpenAI's pricing for current rates.
MIT