-
Notifications
You must be signed in to change notification settings - Fork 394
Open
Description
For cases like #73, we should pre-process the text (apply text normalization) to make the job easier for the text to speech.
See e.g. https://developer.nvidia.com/blog/text-normalization-and-inverse-text-normalization-with-nvidia-nemo
Complicated examples:
- 2/3 -> two thirds
- $100 -> hundred dollars (or "one hundred dollars")
- 50g -> fifty grams
- Sunset Blvd -> Sunset Boulevard
The model might learn some of these from the training data, but we can make its job easier.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels