Skip to content

Commit 3c710ed

Browse files
feat: add mapping context features, improve output block
1 parent 4590e0c commit 3c710ed

File tree

4 files changed

+195
-111
lines changed

4 files changed

+195
-111
lines changed

README.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -139,6 +139,7 @@ unet = UNet1d(
139139
kernel_multiplier_downsample=2,
140140
use_nearest_upsample=False,
141141
use_skip_scale=True,
142+
use_context_time=True,
142143
)
143144

144145
x = torch.randn(3, 1, 2 ** 16)
@@ -236,6 +237,7 @@ y_long = composer(y, keep_start=True) # [1, 1, 98304]
236237
- [x] Add autoencoder bottleneck option for quantization.
237238
- [x] Add option to provide context tokens (resnet cross attention).
238239
- [x] Add conditional model with classifier-free guidance.
240+
- [x] Add option to provide context features mapping.
239241

240242
## Appreciation
241243

audio_diffusion_pytorch/model.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -219,6 +219,7 @@ def get_default_model_kwargs():
219219
kernel_multiplier_downsample=2,
220220
use_nearest_upsample=False,
221221
use_skip_scale=True,
222+
use_context_time=True,
222223
diffusion_sigma_distribution=LogNormalDistribution(mean=-3.0, std=1.0),
223224
diffusion_sigma_data=0.1,
224225
diffusion_dynamic_threshold=0.0,

0 commit comments

Comments
 (0)