Expo STT Blog

A real-time speech-to-text mobile application built with Expo and React Native. This app uses on-device AI models for real-time voice transcription without requiring internet connectivity.

Features

🎙️ Real-time Speech Transcription: Live voice-to-text conversion using Whisper Tiny EN model
📱 Cross-platform: Works on iOS and Android
🔒 Privacy-focused: On-device processing - no data sent to servers
⚡ Low latency: Optimized for real-time performance with 100ms audio chunks
🎛️ Audio optimizations: Configured for optimal speech recording quality

Technologies Used

Expo (~53.0.22) - Development platform
React Native (0.79.6) - Mobile framework
react-native-audio-api (^0.7.1) - Audio recording and processing
react-native-executorch (^0.5.1) - On-device AI model execution
Whisper Tiny EN - Lightweight speech recognition model

Prerequisites

Node.js (14 or higher)
Expo CLI
iOS Simulator or Android Emulator (or physical device)
Xcode (for iOS development)
Android Studio (for Android development)

Installation

Clone the repository:

git clone https://github.com/software-mansion-labs/expo-stt-blog.git
cd expo-stt-blog

Install dependencies:

npm install

Start the development server:

npm start

Run on your preferred platform:

# iOS
npm run ios

# Android
npm run android

Permissions

The app requires microphone permissions for speech recording:

iOS

Microphone access is automatically requested with the message: "This app requires microphone access for real-time speech transcription."

Android

RECORD_AUDIO - Required for audio recording
MODIFY_AUDIO_SETTINGS - Required for optimal audio configuration

Usage

Launch the app
Grant microphone permissions when prompted
Tap "Start Recording" to begin real-time transcription
Speak into your device's microphone
Watch as your speech is converted to text in real-time
Tap "Stop Recording" to end the session

Technical Details

Audio Configuration

Sample Rate: 16kHz (optimized for Whisper model)
Buffer Size: 1600 samples (100ms chunks)
iOS Audio Session: Configured for speech with playAndRecord category and spokenAudio mode

Model Information

Uses Whisper Tiny EN model for English speech recognition
On-device processing ensures privacy and works offline
Supports both committed (finalized) and non-committed (provisional) transcriptions

Project Structure

expo-stt-blog/
├── App.tsx                 # Main application component
├── app.json               # Expo configuration
├── package.json           # Dependencies and scripts
├── assets/               # App icons and images
├── android/              # Android-specific files
├── ios/                  # iOS-specific files
└── node_modules/         # Dependencies

License

This project is licensed under the MIT License - see the LICENSE file for details.

Performance Tips

Use the app in a quiet environment for best transcription accuracy
Keep the device close to your mouth when speaking
Speak clearly and at a moderate pace
Ensure your device has sufficient storage and memory

Acknowledgments

Expo - Development platform
React Native Audio API - Audio processing
React Native Executorch - On-device AI
OpenAI Whisper - Speech recognition model

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
android		android
assets		assets
ios		ios
.gitignore		.gitignore
App.tsx		App.tsx
LICENSE		LICENSE
README.md		README.md
app.json		app.json
index.ts		index.ts
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
yarn.lock		yarn.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Expo STT Blog

Features

Technologies Used

Prerequisites

Installation

Permissions

iOS

Android

Usage

Technical Details

Audio Configuration

Model Information

Project Structure

License

Performance Tips

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

software-mansion-labs/expo-stt-blog

Folders and files

Latest commit

History

Repository files navigation

Expo STT Blog

Features

Technologies Used

Prerequisites

Installation

Permissions

iOS

Android

Usage

Technical Details

Audio Configuration

Model Information

Project Structure

License

Performance Tips

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages