When a ApplyImpulseResponse gets applied, it is sometimes quite strong. To the point you can't really hear the original audio.
The silliest thing I can think of is doing some kind of min|max_snr_db argument pair, sampling that target SNR, and then taking the dry (original) and wet (convolved) signals and adding them together in such a way the SNR is satisfied.
What do you think @iver56 ?