lora: Fix LoRA K/V gradient flow with gradient-connected kv cache retrieval #43

makaveli10 · 2025-10-22T08:24:13Z

Add get_k_lora() and get_v_lora() methods that use concatenation instead of ggml_view_4d to maintain gradient connectivity during training. This ensures LoRA K/V parameters receive proper gradients while preserving causal attention behavior.

…rieval Add get_k_lora() and get_v_lora() methods that use concatenation instead of ggml_view_4d to maintain gradient connectivity during training. This ensures LoRA K/V parameters receive proper gradients while preserving causal attention behavior.

github-actions bot added the examples label Oct 22, 2025

makaveli10 force-pushed the vineet/lora-kv-cache branch from 9c3b4a8 to e52e5d7 Compare October 22, 2025 08:27

makaveli10 force-pushed the vineet/lora-kv-cache branch from e52e5d7 to 7f3cae5 Compare October 22, 2025 08:29

makaveli10 mentioned this pull request Oct 24, 2025

Add Instruction Fine-tuning Support for LoRA with Assistant-Only Loss #46

Merged

olyasir approved these changes Oct 24, 2025

View reviewed changes

olyasir merged commit 9f133bb into tetherto:temp-latest-finetuning Oct 24, 2025
38 of 47 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

lora: Fix LoRA K/V gradient flow with gradient-connected kv cache retrieval #43

lora: Fix LoRA K/V gradient flow with gradient-connected kv cache retrieval #43

Uh oh!

makaveli10 commented Oct 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lora: Fix LoRA K/V gradient flow with gradient-connected kv cache retrieval #43

lora: Fix LoRA K/V gradient flow with gradient-connected kv cache retrieval #43

Uh oh!

Conversation

makaveli10 commented Oct 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants