Is there any way to improve the quality of generated music? #134
-
Curruntly, the music quality is not so good. |
Beta Was this translation helpful? Give feedback.
Replies: 6 comments 5 replies
-
I don’t know if a larger model or longer inference time can improve the quality of music. I guess the current model is just used to verify the feasibility of generation and editing, and a higher quality model may come later. |
Beta Was this translation helpful? Give feedback.
-
You can see more discussions and issues here. The community has given us many useful prompts and parameter - tuning techniques. |
Beta Was this translation helpful? Give feedback.
-
Here is a reply on a closed issue here, figure maybe people may find it useful. It utilizes ComfyUI:
Also bunch of samples made from variations of the workflow as it develops. |
Beta Was this translation helpful? Give feedback.
-
I initially thought the same but l then started to play around with settings (there's no right or wrong just experiment) I have found success in 27 steps with 45 second songs this way you get some output quick and then can play with other settings until you get what you want. Its also worth using something like QWEN or other LLM to help with describing the instruments finally try it without vocals and just have the instruments then add the vocals after. |
Beta Was this translation helpful? Give feedback.
-
The sampler/schedule combination used with this model makes a huge difference in output quality. Using Euler_Ancestral / Simple at around 50+ steps dramatically improves overall cohesion and sound quality compared to all of the other combinations I have tried. If you generate a 30 second sample with that combination and then try the sampler settings you were using before, I think you'll be able to hear the difference almost immediately. |
Beta Was this translation helpful? Give feedback.
-
I wonder if one could use some music restoration tool. If an image generation is not that good, an upscaler can often conceal some of the artifacts. Maybe some GAN thought to restore old recordings might be able to improve on the audio quality of a finished track. |
Beta Was this translation helpful? Give feedback.
The sampler/schedule combination used with this model makes a huge difference in output quality. Using Euler_Ancestral / Simple at around 50+ steps dramatically improves overall cohesion and sound quality compared to all of the other combinations I have tried. If you generate a 30 second sample with that combination and then try the sampler settings you were using before, I think you'll be able to hear the difference almost immediately.