Skip to content

docs(qwen3_tts): clarify voice cloning vs speaker synthesis usage rules#582

Closed
mm65x wants to merge 2 commits intoBlaizzy:mainfrom
mm65x:docs-qwen3-tts
Closed

docs(qwen3_tts): clarify voice cloning vs speaker synthesis usage rules#582
mm65x wants to merge 2 commits intoBlaizzy:mainfrom
mm65x:docs-qwen3-tts

Conversation

@mm65x
Copy link
Copy Markdown
Contributor

@mm65x mm65x commented Mar 15, 2026

Description

this updates the qwen3_tts README to explicitly clarify the API contract for voice cloning vs speaker synthesis to prevent user confusion, based on the discussion in #557.

closes #557

specifically, it clarifies that:

  1. voice cloning is strictly for the Base model variants.
  2. users should not supply a voice argument alongside ref_audio and ref_text to avoid routing/configuration conflicts.
  3. ref_text must be a literal transcript string, unlike ref_audio which accepts a file path.

Changes in the codebase

  • updated mlx_audio/tts/models/qwen3_tts/README.md

Checklist

@mm65x mm65x closed this Mar 15, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Qwen3 CustomVoice ignores clone refs at /v1/audio/speech and falls back to speaker-only path

1 participant