Multi-resolution dataset for SD1/SDXL by woct0rdho · Pull Request #2269 · kohya-ss/sd-scripts

woct0rdho · 2026-02-16T06:15:13Z

I think multi-resolution training is something we should encourage people to do more. I'm still using SDXL as a lightweight model when I need to upscale images to 4K.

In sd-scripts, multi-resolution dataset is already documented in

sd-scripts/docs/config_README-en.md

Line 244 in 48d368f

[[datasets]]

where we can create multiple datasets with different resolutions and the same image_dir. It's already enabled for all newer models (Anima, Flux, Hunyuan, Lumina, SD3), but not for SD1/SDXL. This PR enables it.

However, this is a breaking change for people who already cached a lot of images. They may use a script to migrate the cache.

kohya-ss · 2026-02-17T12:32:47Z

Thank you, this is great!

Sorry, despite what the documentation says, I think SD/SDXL doesn't handle caching correctly when the image directory is the same even if the datasets are different currently.

For existing caches, it would be a good idea to prepare a migration script. Alternatively, as a temporary solution, falling back to key names without resolution suffixes might be one idea.

I'll review and merge this soon, probably tomorrow.

woct0rdho · 2026-02-17T13:36:19Z

Falling back to key names without resolution suffixes is not always safe. For example, if a user uses multiple resolutions 768, 1024, 1280 without re-caching the latents after this PR, and we fallback to the old keys, then all 3 datasets will load the same latents.

Currently I do not do any fallback, so when the user starts a training after this PR, all latents will be cached again. The only downside is that the old latents are still saved in the same npz files. If the user is out of disk space, they can just delete the old npz files and cache the latents again.

I guess those people who already cached TBs of latents should know how to write the script and migrate it...

kohya-ss · 2026-02-17T14:08:18Z

Hmm, that certainly could be a problem...

It might be one idea to set a guard for fallback. If the shape of the previously saved latent to fallback to is different from the resolution, raise an error. I think this should prevent unintended fallbacks.

woct0rdho · 2026-02-17T21:17:21Z

If we check the array shape using npz[key].shape, it will load the array data (rather than just the metadata) when checking the cache before training, which is fine for GBs of cache but not so fine for TBs of cache.

It's possible to only read the metadata but we need some private API of numpy. Do you think we should implement this? (BTW, it's easy to read metadata in safetensors)

kohya-ss · 2026-02-17T23:19:14Z

Thank you, I didn't realize that fallbacks would also need to be considered when checking the cache.

It might be a good idea to release this PR at the same time as the safetensors format cache feature and provide a script for migrating the cache (adding the resolution suffix and converting to safetensors).

Multi-resolution dataset for SD1/SDXL

ab0d534

woct0rdho mentioned this pull request Feb 16, 2026

Deduplicate multi-resolution dataset with bucket_no_upscale #2270

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Multi-resolution dataset for SD1/SDXL#2269

Multi-resolution dataset for SD1/SDXL#2269
woct0rdho wants to merge 1 commit intokohya-ss:mainfrom
woct0rdho:multi-reso-sd

woct0rdho commented Feb 16, 2026

Uh oh!

kohya-ss commented Feb 17, 2026

Uh oh!

woct0rdho commented Feb 17, 2026 •

edited

Loading

Uh oh!

kohya-ss commented Feb 17, 2026

Uh oh!

woct0rdho commented Feb 17, 2026 •

edited

Loading

Uh oh!

kohya-ss commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

woct0rdho commented Feb 16, 2026

Uh oh!

kohya-ss commented Feb 17, 2026

Uh oh!

woct0rdho commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kohya-ss commented Feb 17, 2026

Uh oh!

woct0rdho commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kohya-ss commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

woct0rdho commented Feb 17, 2026 •

edited

Loading

woct0rdho commented Feb 17, 2026 •

edited

Loading