Speech Recognition Evaluation System

A comprehensive web-based system for evaluating speech recognition model performance and managing speech datasets.

Features

Evaluation

Upload and process evaluation JSON files containing transcription results
Compare Character Error Rate (CER) metrics between different models
Visualize performance metrics with interactive charts
View detailed transcription results in a tabulated format
Support for multiple model comparisons

Dataset Management

Manage and organize speech datasets
View dataset statistics and information
Process raw and processed audio files
Track dataset versions and modifications

Technical Stack

Backend: FastAPI (Python)
Frontend: Bootstrap 5, Chart.js
Database: SQLite (for temporary data storage)
File Processing: JSON, Audio file handling
Internationalization: Multi-language support (English/Korean)

Installation

Clone the repository:

git clone [repository-url]
cd speech-recognition-evaluation

Create and activate virtual environment (optional but recommended):

python -m venv venv
source venv/bin/activate  # Linux/Mac
# or
.\venv\Scripts\activate  # Windows

Install dependencies:

pip install -r requirements.txt

Usage

Start the server:

uvicorn app.main:app --reload

Access the web interface:

http://localhost:8000

Navigate to the evaluation page:

http://localhost:8000/evaluation

Upload your evaluation JSON file with the following structure:

{
  "transcriptions": {
    "model-name": [
      {
        "audio_filepath": "path/to/audio.wav",
        "pred_sentence": "predicted transcription"
      }
    ]
  },
  "CER metric": {
    "model-name": 0.123
  }
}

Example Image

Project Structure

app/
├── main.py              # FastAPI application entry point
├── config.py            # Configuration settings
├── models/             
│   ├── database.py      # Database models
│   └── evaluation.py    # Evaluation data models
├── routers/
│   ├── evaluation.py    # Evaluation routes
│   ├── dataset.py       # Dataset management routes
│   └── main_routes.py   # Main page routes
├── services/
│   └── evaluation_service.py  # Evaluation processing logic
├── templates/           # HTML templates
└── utils/
    └── i18n.py         # Internationalization support

Features in Detail

Evaluation System

Model Comparison: Compare multiple speech recognition models side by side
Metric Visualization: Interactive charts showing CER metrics
Detailed Results: View complete transcription results in a searchable table
Error Analysis: Identify and analyze transcription errors

User Interface

Clean and intuitive web interface
Responsive design for various screen sizes
Interactive data visualization
Drag-and-drop file upload support

Language Support

English (Default)
Korean (한국어)

Language can be switched using the selector in the navigation bar.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
app		app
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
project.md		project.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Speech Recognition Evaluation System

Features

Evaluation

Dataset Management

Technical Stack

Installation

Usage

Example Image

Project Structure

Features in Detail

Evaluation System

User Interface

Language Support

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

hwanython/Speech-Recognition-Evaluation-System

Folders and files

Latest commit

History

Repository files navigation

Speech Recognition Evaluation System

Features

Evaluation

Dataset Management

Technical Stack

Installation

Usage

Example Image

Project Structure

Features in Detail

Evaluation System

User Interface

Language Support

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages