google/gemma-2-27b-it QLORA #1743

CyberNativeAI · 2024-07-12T22:41:44Z

CyberNativeAI
Jul 12, 2024

Hi everyone,

I have tried QLORA fine-tuning gemma-2-27b with chatml and flash-attention yesterday and the result model seems confused, even tho the loss seemed to go down and overall everything looked smooth. May someone please share tips&tricks for fine-tuning this model?

NanoCode012 · 2024-10-16T08:09:39Z

NanoCode012
Oct 16, 2024
Maintainer

Hey, sorry for late response, since it's a qlora, the impact on the weights are not as large as lora / full fine-tuning. Have you tried those other options?

3 replies

NanoCode012 Oct 31, 2024
Maintainer

Hey, I'll close this discussion for now but if you're still having this issue, please re-open / create a new one.

CyberNativeAI Oct 31, 2024
Author

It worked out eventually, had to play with parameters. Please close.

NanoCode012 Nov 1, 2024
Maintainer

Thanks for updating us on this!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

google/gemma-2-27b-it QLORA #1743

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 3 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

google/gemma-2-27b-it QLORA #1743

Uh oh!

CyberNativeAI Jul 12, 2024

Replies: 1 comment · 3 replies

Uh oh!

NanoCode012 Oct 16, 2024 Maintainer

Uh oh!

NanoCode012 Oct 31, 2024 Maintainer

Uh oh!

CyberNativeAI Oct 31, 2024 Author

Uh oh!

NanoCode012 Nov 1, 2024 Maintainer

CyberNativeAI
Jul 12, 2024

Replies: 1 comment 3 replies

NanoCode012
Oct 16, 2024
Maintainer

NanoCode012 Oct 31, 2024
Maintainer

CyberNativeAI Oct 31, 2024
Author

NanoCode012 Nov 1, 2024
Maintainer