RefineGAN #232
Replies: 2 comments 2 replies
-
|
Using RefineGAN isn't a good idea to use at all if you don't use it with a pretrained model. It would require you to train your model from absolute zero which would require you to have way more training samples and way more computational power (or month of time for the training). Just stick to HiFi-GAN and check what went wrong. Is your training material good enough? Did you train it properly? I recently learned that the Batch Size is very important quality-wise, not only in terms of speed. |
Beta Was this translation helpful? Give feedback.
-
Thanks for your response. Ive been doing Batch Size 32. Maybe that is the problem. You have 4 hours of training data? Thats quite a lot! How's the cloning and accent at that amount? Perhaps I should also increase my training data... Is it generally better to split the wav files (training data) to smaller chunks, and removing silence? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Greetings! First thank you for this wonderful program you made. Seems like the original RVC is no longer maintained. So far, I think this is the best alternative. However! I have a problem.... first, my setup.. I have 12 minutes of audio in 3 wav files with my voice. When I use HiFi-Gan, and train it for about 100-300 epochs, it works, but the voice isn't really like my voice (any tips to improve this?) So I decided to use RefineGAN instead (highest audio fidelity) and for pretrained model type: None (cant use default). Using the same wav files, i trained with around 250 epochs. It succeeds but when I test the voice, it is like whispering and robotic? Perhaps im doing something wrong? Or instead of None as pretrained model, perhaps I should use Custom? But how do I set that up? Any help would be greatly appreciated! Thanks in advance. Warm regards!
Beta Was this translation helpful? Give feedback.
All reactions