Nezha-LLM

Made for personal use

A local LLM/voice-to-LLM application that transcribes audio using Whisper and generates responses with a local LLM model.

Project Structure

Nezha-LLM/
├── api/          # FastAPI server (REST endpoints)
├── asr/          # Automatic Speech Recognition (faster-whisper)
├── llm/          # Local LLM service (Qwen)
├── ui/           # Web frontend
├── tests/        # Test files
└── models/       # Local model files

Requirements

fastapi>=0.123.0
uvicorn>=0.38.0
python-multipart>=0.0.20
pydantic-settings>=2.12.0
faster-whisper>=1.2.1
transformers>=4.57.0
torch>=2.9.0
sounddevice>=0.5.3
numpy>=2.3.0
scipy>=1.16.0

Installation

# Clone the repository
git clone https://github.com/yxxTries/Nezha-LLM.git
cd Nezha-LLM

# Create virtual environment
python -m venv .venv
.venv\Scripts\activate  # Windows
# or
source .venv/bin/activate  # Linux/Mac

# Install dependencies
pip install -r requirements.txt

Running

Setup Instructions: API Server + UI

launch start.bat to initialize the API server, then use webserver to open ui.

Once running, access the interactive docs at:

Swagger UI: http://localhost:8000/docs
ReDoc: http://localhost:8000/redoc

API Endpoints

POST /api/llm - Send text to LLM
POST /api/asr-llm - Upload audio for transcription + LLM response

Privacy

No user text or LLM responses are persisted to disk or any database.
Uploaded audio for /api/asr-llm is written to a temporary file only for transcription and is deleted immediately after processing.
The web UI does not use localStorage or sessionStorage; chat messages exist only in memory and disappear on refresh.

Built by Amil

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nezha-LLM

Project Structure

Requirements

Installation

Running

Setup Instructions: API Server + UI

API Endpoints

Privacy

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.github		.github
api		api
asr		asr
audiosample		audiosample
llm		llm
tests		tests
ui		ui
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
start.bat		start.bat

Folders and files

Latest commit

History

Repository files navigation

Nezha-LLM

Project Structure

Requirements

Installation

Running

Setup Instructions: API Server + UI

API Endpoints

Privacy

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages