AI Transcribe

An audio transcriber application that captures audio and transforms it into text using OpenAI's Whisper API.

Detailed Write up

There is a two-series Medium articles explaining everything in this project:

Feel free to read those articles before exploring the codes to get more context about this project.

Overview

AI Transcribe is an application focused on transcribing audio into text and insights. It captures audio through your device's microphone, processes it, and returns accurate transcriptions within seconds. The project demonstrates how to build a complete audio transcription system with a modern UI using Next.js. See demo video below to see it in action.

Screen.Recording.2025-04-26.at.00.41.11.mp4

Key Features

Real-time Audio Capture: Records audio through your device's microphone
Audio Processing: Chunks and processes audio for optimal transcription performance
OpenAI Whisper Integration: Utilizes the powerful Whisper API for accurate speech-to-text conversion
Modern UI: Built with a clean, responsive interface using Tailwind CSS and Shadcn UI

Tech Stack

Frontend: Next.js 15, React 19, Tailwind CSS
UI Components: Shadcn UI components
API Integration: OpenAI API, Next.js Server Action
State Management: React Hooks
Styling: Tailwind CSS with class-variance-authority

Getting Started

Prerequisites

Node.js 18.17 or higher
An OpenAI API key

Environment Setup

Clone the repository:

git clone https://github.com/ifindev/ai-transcribe.git
cd ai-transcribe

Install dependencies:
```
npm install
```
Create a .env.local file in the root directory with the following variables:
```
OPENAI_API_KEY=your-openai-api-key
```
Start the development server:
```
npm run dev
```
Open http://localhost:3000 in your browser to use the application.

Using the Application

Click on the record button to start capturing audio
Speak clearly into your microphone
The application will process your speech and display the transcription in real-time
You can pause, resume, or stop the recording at any time
Review your transcription history in the recordings list

Project Structure

src/app: Next.js app router pages and layouts
src/components: Reusable UI components
src/hooks: Custom React hooks for audio recording and processing
src/modules: Feature-specific modules (workspace, etc.)
src/services: Service layer for external API integrations
src/actions: Server actions for API requests
src/models: TypeScript type definitions
src/utils: Utility functions
src/libs: Third party library instantioation

Future Enhancements

Support for multiple languages
Speaker identification
Searchable transcription history
Export options (PDF, Word, etc.)
Automatic summarization using AI

Troubleshooting

Microphone Access Issues

Make sure to grant microphone access permission when prompted by your browser. If you accidentally denied it, you may need to reset permissions in your browser settings.

Transcription Quality Issues

For best results:

Use a high-quality microphone
Minimize background noise
Speak clearly and at a moderate pace
Position the microphone close to the speaker

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
.cursor/rules		.cursor/rules
.vscode		.vscode
public		public
src		src
.cursorrules		.cursorrules
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc		.prettierrc
LICENSE		LICENSE
README.md		README.md
components.json		components.json
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AI Transcribe

Detailed Write up

Overview

Key Features

Tech Stack

Getting Started

Prerequisites

Environment Setup

Using the Application

Project Structure

Future Enhancements

Troubleshooting

Microphone Access Issues

Transcription Quality Issues

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ifindev/ai-transcribe

Folders and files

Latest commit

History

Repository files navigation

AI Transcribe

Detailed Write up

Overview

Key Features

Tech Stack

Getting Started

Prerequisites

Environment Setup

Using the Application

Project Structure

Future Enhancements

Troubleshooting

Microphone Access Issues

Transcription Quality Issues

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages