## 🔥 Feature: External API for Audiobook Generation by sergenes · Pull Request #19 · Blaizzy/mlx-audio

sergenes · 2025-03-08T00:27:20Z

Summary

This PR introduces generate_audio(), a new function to allow external scripts to generate speech audio using mlx_audio for audiobook projects.

Changes

✅ Exposes generate_audio() for direct function calls outside the library.
✅ Supports multiple audio formats (wav, flac, etc.).
✅ Allows external scripts to generate audiobook chapters with custom voices, speed, and language codes.
✅ Includes a verbose flag to control logging output.

--
✅ Added from_cli flag: a simple way to handle CLI vs. script execution differences. This ensures that CLI-generated files get the _000 suffix as expected, while script-based calls save files without a suffix (e.g., audiobook_chapter1.wav).

Usage Example

External projects can now generate audiobooks like this:

from mlx_audio.tts.generate import generate_audio

generate_audio(
    text="Once upon a time...",
    file_path="audiobook_intro",
    audio_format="flac"
)

Why This PR?
This change makes it easier to integrate mlx_audio into audiobook generation projects, showcasing its potential for text-to-speech applications.

I'm going to use it for my experiments here:
https://github.com/sergenes/runandread-audiobook

But I'm pretty sure it will be useful to others as well!

🔥🔥🔥 I love this repo! Thank you for your work! 🚀👏

…g. for audiobook projects

mlx_audio/tts/generate.py

Blaizzy · 2025-03-08T17:53:31Z

Thanks for the great contribution @sergenes!

I left one comment.

…ication

Blaizzy · 2025-03-15T19:47:20Z

Made some small changes.

Could you please run before we merge:

pre-commit run --all

sergenes · 2025-03-16T03:26:59Z

Made some small changes.

Could you please run before we merge:

pre-commit run --all

Done!

sergenes · 2025-03-16T03:27:59Z

Let me know if you'd like me to rebase the branch!

Blaizzy · 2025-03-16T08:40:19Z

Yes, that would be great.

Please do rebase :)

sergenes · 2025-03-16T23:10:06Z

Yes, that would be great.

Please do rebase :)

I was busy with the kids over the weekend. I'll try to find some time at the beginning of next week!

Blaizzy · 2025-03-16T23:12:09Z

No worries, ping me when you do :)

sergenes · 2025-03-17T01:37:19Z

No worries, ping me when you do :)

Done!
Re-rebased with the main branch and restored all features.

I tested both options a bit. Please merge to avoid needing to rebase again! :)

sergenes · 2025-03-17T23:25:52Z

Hmm, it looks like merging is blocked because you pushed to this branch once. What should we do?

sergenes · 2025-03-17T23:26:18Z

I can make a new Pull Request..

sergenes · 2025-03-17T23:30:47Z

New PR: #42

Blaizzy · 2025-03-17T23:37:51Z

@sergenes no need for a new PR, I made the necessary changes and it's ready to be merged :)

sergenes · 2025-03-17T23:38:15Z

thank you!

allow external scripts to generate speech audio using mlx_audio, e.…

f44bada

…g. for audiobook projects

Blaizzy reviewed Mar 8, 2025

View reviewed changes

mlx_audio/tts/generate.py Outdated Show resolved Hide resolved

Refactor TTS Generation: Unified CLI & Script Logic, Eliminating Dupl…

282255a

…ication

Blaizzy self-requested a review March 15, 2025 19:32

Merge branch 'main' into external_script_support

7a0f38c

reformat with pre-commit run --all

c3f9171

Blaizzy and others added 4 commits March 17, 2025 02:17

Merge branch 'main' into external_script_support

c8cb4e9

rebasing with the main

2616529

Re-rebased with the main branch and restored all features.

3060ac3

added sample_rate to join audio

45a509e

Blaizzy added 2 commits March 17, 2025 23:52

Merge branch 'main' into external_script_support

2ad5aa1

Update README.md

883e601

Blaizzy approved these changes Mar 17, 2025

View reviewed changes

format

0be031e

fix formatting

ed16426

Blaizzy added 2 commits March 18, 2025 00:38

add os

b6957bd

update doc string

b79965a

Blaizzy merged commit 47dc0bb into Blaizzy:main Mar 17, 2025
1 check passed

Uh oh!

Conversation

sergenes commented Mar 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Changes

Usage Example

Uh oh!

Uh oh!

Blaizzy commented Mar 8, 2025

Uh oh!

Blaizzy commented Mar 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sergenes commented Mar 16, 2025

Uh oh!

sergenes commented Mar 16, 2025

Uh oh!

Blaizzy commented Mar 16, 2025

Uh oh!

sergenes commented Mar 16, 2025

Uh oh!

Blaizzy commented Mar 16, 2025

Uh oh!

sergenes commented Mar 17, 2025

Uh oh!

sergenes commented Mar 17, 2025

Uh oh!

sergenes commented Mar 17, 2025

Uh oh!

sergenes commented Mar 17, 2025

Uh oh!

Blaizzy commented Mar 17, 2025

Uh oh!

sergenes commented Mar 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sergenes commented Mar 8, 2025 •

edited

Loading

Blaizzy commented Mar 15, 2025 •

edited

Loading