WARNING:

This is currently not working as intended and is not developed further. Timings are not quite correct and the model get's stuck sometimes.

Vox Audio Transcription

Audio transcription using Mistral AI's Voxtral API. Supports long audio files and provides a REST API for job-based processing.

Features

Transcribes audio/video files using Mistral AI
Handles long files by splitting into segments
REST API with job queue
Outputs SRT subtitles or JSON format
Docker support

Installation

Using UV

git clone <repository>
uv sync

Using Docker

docker build -t transcription-api .
docker run -d -p 8888:8888 -e MISTRAL_API_KEY=your_key transcription-api

Web API Usage

Start the server:

MISTRAL_API_KEY=your_key uv run python web_server.py

API Endpoints

Create Transcription Job

POST /transcription/job
Content-Type: application/json

{
  "path": "/path/to/audio.mp3",
  "language": "en",
  "format": "json",
  "priority": 1000
}

Response:

{
  "job_id": "uuid-string",
  "status": "QUEUED"
}

Get Job Status

GET /transcription/job/{job_id}

Response (completed):

{
  "job_id": "uuid-string",
  "status": "COMPLETED",
  "result": {
    "text": "Full transcription...",
    "segments": [
      {
        "text": "Segment text",
        "start": 0.0,
        "end": 5.2,
        "words": [...]
      }
    ]
  }
}

List All Jobs

GET /transcription/jobs

Get Queue Stats

GET /stats

Configuration

Supported formats: json, srt
Supported models: voxtral-small-2507, voxtral-small-latest
Supported languages: All 67 Mistral AI language codes
Priority: 1-499 (low), 500-9999 (high)

Requirements

Python 3.9+
Mistral AI API key
FFmpeg (for audio processing)

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github/workflows		.github/workflows
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
audio.py		audio.py
pyproject.toml		pyproject.toml
srt.py		srt.py
test_segmentation_fix.py		test_segmentation_fix.py
test_transcribe_audio.py		test_transcribe_audio.py
test_web_api.py		test_web_api.py
transcribe_audio.py		transcribe_audio.py
uv.lock		uv.lock
web_server.py		web_server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

WARNING:

Vox Audio Transcription

Features

Installation

Using UV

Using Docker

Web API Usage

API Endpoints

Create Transcription Job

Get Job Status

List All Jobs

Get Queue Stats

Configuration

Requirements

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Languages

bcc-code/bcc-media-voxtral

Folders and files

Latest commit

History

Repository files navigation

WARNING:

Vox Audio Transcription

Features

Installation

Using UV

Using Docker

Web API Usage

API Endpoints

Create Transcription Job

Get Job Status

List All Jobs

Get Queue Stats

Configuration

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Languages

Packages