cd backend
npm run devThe server will run at http://localhost:3001
cd frontend
npm startThe app will open at http://localhost:3000
- Sign up at https://platform.openai.com/
- Go to API Keys: https://platform.openai.com/api-keys
- Create a new key
- Add it to the backend/.env file:
OPENAI_API_KEY=sk-your-real-key-here- Open http://localhost:3000
- Drag and drop an audio file into the upload area or click “Choose File”
- Supported formats: MP3, WAV, OGG, FLAC, M4A, WEBM
- Maximum file size: 25MB
- Click “Transcribe Audio”
- Wait for the result and copy the text
Make sure the backend runs on port 3001 and the frontend on 3000.
# Clear npm cache
npm cache clean --force
# Reinstall dependencies
rm -rf node_modules package-lock.json
npm install- Check your API key in the .env file
- Make sure your account has credits
- Check quotas and limits
- Verify the file format (MP3, WAV, OGG, etc.)
- Ensure the file size does not exceed 25MB
- Check permissions for the uploads folder
- AssemblyAI:
npm install assemblyai- Azure Speech Services:
npm install microsoft-cognitiveservices-speech-sdk- Google Cloud Speech:
npm install @google-cloud/speechpip install openai-whisperCreate a file whisper_local.py:
import whisper
import sys
model = whisper.load_model("base")
result = model.transcribe(sys.argv[1], language="hy")
print(result["text"])