Skip to content

Conversation

@google-labs-jules
Copy link

This commit addresses two items:

  1. A bugfix for filename sanitization in process_audio.py. The previous implementation did not correctly handle non-ASCII characters (e.g., from Whisper transcriptions), which could create filenames that are incompatible with some filesystems or tooling. The sanitization function now ensures all filenames are ASCII-safe.

  2. Adds a HOW_README.md file with detailed instructions for setting up the environment and using the CLI tool, as requested by the user. This includes the necessary one-time manual step to make ffmpeg accessible.

This commit addresses two items:

1.  A bugfix for filename sanitization in `process_audio.py`. The previous implementation did not correctly handle non-ASCII characters (e.g., from Whisper transcriptions), which could create filenames that are incompatible with some filesystems or tooling. The sanitization function now ensures all filenames are ASCII-safe.

2.  Adds a `HOW_README.md` file with detailed instructions for setting up the environment and using the CLI tool, as requested by the user. This includes the necessary one-time manual step to make `ffmpeg` accessible.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants