Looking for Guidance on a Custom Solution #23

caymanwjeffers · 2023-06-15T21:44:31Z

caymanwjeffers
Jun 15, 2023

Is it possible to take an image of a pre-existing mel spectogram with a little bit of noise added, and then feed that to the diffusion model at far fewer de-noising steps? Or is it possible to give it an existing audio-prior as a starting point instead of relying on the raw model to do all the work?

I am aware that this would require editing of the existing implementation I am just wondering if it's possible at all or my understanding of this is flawed.

I am essentially looking for a way to create coherent variations of a sound instead of generating them from scratch. Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Looking for Guidance on a Custom Solution #23

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Looking for Guidance on a Custom Solution #23

Uh oh!

Uh oh!

caymanwjeffers Jun 15, 2023

Replies: 0 comments

caymanwjeffers
Jun 15, 2023