Commit 676a368
[DRAFT][TTS] Magpietts Simple API and loading audiocodec from Huggingface (#15172)
* Modularize magpie inference code, move inference code from scripts to example
Signed-off-by: subhankar-ghosh <[email protected]>
* Modify magpie CI with inference changes
Signed-off-by: subhankar-ghosh <[email protected]>
* Renaming magpietts inference scripts from magpie to magpietts
Signed-off-by: subhankar-ghosh <[email protected]>
* infer_batch returns dataclass object
Signed-off-by: subhankar-ghosh <[email protected]>
* Fixed context embedding without context encoder
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* Remove unnecessary configurations
Removed multiple long manifest configurations from evalset_config.py.
Signed-off-by: Subhankar Ghosh <[email protected]>
* Removing unused imports
Signed-off-by: subhankar-ghosh <[email protected]>
* Copilot suggested changes
Signed-off-by: subhankar-ghosh <[email protected]>
* Copilot suggested changes
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* do_tts method, load audiocodec from huggingface
Signed-off-by: subhankar-ghosh <[email protected]>
* Move inference helper modules from examples to tts collection
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* Review changes
Signed-off-by: subhankar-ghosh <[email protected]>
* Changes suggested in compute_mean_with_confidence_interval
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* Linting issue
Signed-off-by: subhankar-ghosh <[email protected]>
* do_tts method, load audiocodec from huggingface
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* register_tokenizer_artifacts to store tokenizer files in .nemo file
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* Modularize magpie inference code, move inference code from scripts to example
Signed-off-by: subhankar-ghosh <[email protected]>
* Renaming magpietts inference scripts from magpie to magpietts
Signed-off-by: subhankar-ghosh <[email protected]>
* Removing unused imports
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* Remove unnecessary configurations
Removed multiple long manifest configurations from evalset_config.py.
Signed-off-by: Subhankar Ghosh <[email protected]>
* Copilot suggested changes
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* Move inference helper modules from examples to tts collection
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* Changes suggested in compute_mean_with_confidence_interval
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* do_tts method, load audiocodec from huggingface
Signed-off-by: subhankar-ghosh <[email protected]>
* register_tokenizer_artifacts to store tokenizer files in .nemo file
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* rebase with main issues
Signed-off-by: subhankar-ghosh <[email protected]>
* changed datasets to json input, moved json file to examples/tts
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* Remove unwanted dataconfig.
Signed-off-by: subhankar-ghosh <[email protected]>
* optional utmos import, text_normalization cache and check, test updated
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* Update nemo/collections/tts/models/magpietts.py
Co-authored-by: Copilot <[email protected]>
Signed-off-by: Subhankar Ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* Update nemo/collections/tts/models/magpietts.py
Co-authored-by: Copilot <[email protected]>
Signed-off-by: Subhankar Ghosh <[email protected]>
* Linting errors
Signed-off-by: subhankar-ghosh <[email protected]>
* Refactored prepare_context_tensors, removed dummy context audio/text from do_tts
Signed-off-by: subhankar-ghosh <[email protected]>
* Apply isort and black reformatting
Signed-off-by: subhankar-ghosh <[email protected]>
* remove utmos, make dataset path required
Signed-off-by: subhankar-ghosh <[email protected]>
* remove unused imports
Signed-off-by: subhankar-ghosh <[email protected]>
* Enable loading MagpieTTS from HF
Signed-off-by: subhankar-ghosh <[email protected]>
* Support speaker index in do_tts api
Signed-off-by: subhankar-ghosh <[email protected]>
---------
Signed-off-by: subhankar-ghosh <[email protected]>
Signed-off-by: subhankar-ghosh <[email protected]>
Signed-off-by: Subhankar Ghosh <[email protected]>
Signed-off-by: Subhankar Ghosh <[email protected]>
Co-authored-by: subhankar-ghosh <[email protected]>
Co-authored-by: Copilot <[email protected]>1 parent 61aa919 commit 676a368
File tree
8 files changed
+731
-248
lines changed- examples/tts
- nemo/collections/tts
- models
- modules/magpietts_inference
- requirements
- tests/functional_tests
8 files changed
+731
-248
lines changedFile renamed without changes.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
24 | | - | |
| 24 | + | |
25 | 25 | | |
26 | 26 | | |
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
31 | 31 | | |
32 | | - | |
| 32 | + | |
33 | 33 | | |
34 | 34 | | |
35 | 35 | | |
| |||
65 | 65 | | |
66 | 66 | | |
67 | 67 | | |
68 | | - | |
69 | 68 | | |
70 | 69 | | |
71 | | - | |
72 | | - | |
73 | | - | |
74 | | - | |
75 | | - | |
76 | 70 | | |
77 | 71 | | |
78 | 72 | | |
| |||
127 | 121 | | |
128 | 122 | | |
129 | 123 | | |
130 | | - | |
| 124 | + | |
131 | 125 | | |
132 | 126 | | |
133 | 127 | | |
| |||
142 | 136 | | |
143 | 137 | | |
144 | 138 | | |
145 | | - | |
| 139 | + | |
146 | 140 | | |
147 | 141 | | |
148 | 142 | | |
| |||
176 | 170 | | |
177 | 171 | | |
178 | 172 | | |
179 | | - | |
| 173 | + | |
180 | 174 | | |
181 | 175 | | |
182 | 176 | | |
| |||
193 | 187 | | |
194 | 188 | | |
195 | 189 | | |
196 | | - | |
197 | | - | |
198 | | - | |
199 | | - | |
200 | 190 | | |
201 | 191 | | |
202 | 192 | | |
| |||
232 | 222 | | |
233 | 223 | | |
234 | 224 | | |
235 | | - | |
| 225 | + | |
236 | 226 | | |
237 | 227 | | |
238 | 228 | | |
| |||
377 | 367 | | |
378 | 368 | | |
379 | 369 | | |
380 | | - | |
| 370 | + | |
381 | 371 | | |
382 | 372 | | |
383 | | - | |
| 373 | + | |
384 | 374 | | |
385 | 375 | | |
386 | 376 | | |
| |||
487 | 477 | | |
488 | 478 | | |
489 | 479 | | |
490 | | - | |
491 | | - | |
492 | | - | |
| 480 | + | |
| 481 | + | |
493 | 482 | | |
494 | | - | |
| 483 | + | |
495 | 484 | | |
496 | 485 | | |
497 | 486 | | |
| |||
559 | 548 | | |
560 | 549 | | |
561 | 550 | | |
562 | | - | |
| 551 | + | |
563 | 552 | | |
564 | 553 | | |
565 | 554 | | |
| |||
584 | 573 | | |
585 | 574 | | |
586 | 575 | | |
587 | | - | |
| 576 | + | |
588 | 577 | | |
589 | 578 | | |
590 | 579 | | |
| |||
0 commit comments