docs(qwen3_tts): clarify voice cloning vs speaker synthesis usage rules by mm65x · Pull Request #582 · Blaizzy/mlx-audio

mm65x · 2026-03-15T18:20:07Z

Description

this updates the qwen3_tts README to explicitly clarify the API contract for voice cloning vs speaker synthesis to prevent user confusion, based on the discussion in #557.

closes #557

specifically, it clarifies that:

voice cloning is strictly for the Base model variants.
users should not supply a voice argument alongside ref_audio and ref_text to avoid routing/configuration conflicts.
ref_text must be a literal transcript string, unlike ref_audio which accepts a file path.

Changes in the codebase

updated mlx_audio/tts/models/qwen3_tts/README.md

Checklist

Documentation updated
Issue referenced - closes Qwen3 CustomVoice ignores clone refs at /v1/audio/speech and falls back to speaker-only path #557

…o fix convert script

mm65x added 2 commits March 15, 2026 18:14

fix(fish_speech): prevent fast_ path and embeddings from quantizing t…

055fd43

…o fix convert script

docs(qwen3_tts): clarify voice cloning vs speaker synthesis usage rules

c44f5ae

mm65x closed this Mar 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs(qwen3_tts): clarify voice cloning vs speaker synthesis usage rules#582

docs(qwen3_tts): clarify voice cloning vs speaker synthesis usage rules#582
mm65x wants to merge 2 commits intoBlaizzy:mainfrom
mm65x:docs-qwen3-tts

mm65x commented Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

mm65x commented Mar 15, 2026

Description

Changes in the codebase

Checklist

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant