Skip to content

Conversation

wbruna
Copy link
Contributor

@wbruna wbruna commented Jul 3, 2025

It turns out 15360 is still not enough for some SDXL LoRAs; for instance, a quantized SDXL model with the DMD2 LoRA.

Koboldcpp has been using 20480 for some time, so this value should be enough.

Hopefully fixes #648 , #684 and #688 .

It turns out 15360 is still not enough for some SDXL LoRAs; for instance, a
quantized SDXL model with the DMD2 LoRA.

Koboldcpp has been using 20480 for some time, so this value should be enough.

Hopefully fixes leejet#648 , leejet#684 and leejet#688 .
@leejet
Copy link
Owner

leejet commented Jul 3, 2025

Could you give an example of a LoRA?

@wbruna
Copy link
Contributor Author

wbruna commented Jul 3, 2025

I was able to reproduce it with the DMD2 acceleration LoRA (https://huggingface.co/tianweiy/DMD2/blob/main/dmd2_sdxl_4step_lora_fp16.safetensors), but only with a quantized model (Q5_0 or Q8_0); all fp16 models I've tried worked fine with 15360 as the limit.

@leejet
Copy link
Owner

leejet commented Jul 5, 2025

I think it would be better to make the lora graph variable. In the latest master commit, I made the corresponding change.

@wbruna
Copy link
Contributor Author

wbruna commented Jul 5, 2025

Did a few tests with 1ce1c1a ; the resulting size was a bit of an overkill for the dmd2 LoRA (33880), but it worked fine. Thanks!

@wbruna wbruna closed this Jul 5, 2025
@wbruna wbruna deleted the fix_sdxl_lora_again branch July 5, 2025 15:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

PhotoMaker crash with quantized model

2 participants