Skip to content

Pull requests: hiyouga/LlamaFactory

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add workflow for building ROCm image
#10330 opened Mar 30, 2026 by ErikJiang Loading…
fix: pin 12 unpinned action(s)
#10325 opened Mar 26, 2026 by dagecko Loading…
[WIP] Support huggingface/kernels
#10319 opened Mar 25, 2026 by zheliuyu Draft
2 tasks
fix: add qwen3_5_moe to MoE configuration in moe.py invalid This doesn't seem right
#10307 opened Mar 21, 2026 by majiayu000 Loading…
feat: clearer train_result metrics log through calculate_tps function
#10288 opened Mar 17, 2026 by UmeanNever Loading…
1 of 2 tasks
[V1]support resume training from checkpoint
#10280 opened Mar 13, 2026 by frozenleaves Loading…
feat: add LightOnOCR-2 integration for LoRA/QLoRA fine-tuning
#10192 opened Feb 16, 2026 by johnlockejrr Loading…
2 tasks
Fix memory leak on MPS by explicitly clearing cache in trainer step
#10190 opened Feb 14, 2026 by asebaq Loading…
1 of 2 tasks
[v1] Add hyperparams and training docs
#10188 opened Feb 13, 2026 by frozenleaves Loading…
[deps] Add libibverbs for RDMA support
#10185 opened Feb 12, 2026 by RossCZ Loading…
1 of 2 tasks
Feature: experimental fine-tuning comparison
#10172 opened Feb 6, 2026 by caterina0718 Loading…
[feat] Add DeepSpeed ZeRO-3 LoRA checkpoint save support
#10124 opened Jan 22, 2026 by kimberlykang Loading…
2 tasks done
[model] support NVIDIA's Audio-Flamingo-3 audio model
#9740 opened Jan 9, 2026 by vovanphuc Loading…
4 tasks done
Add entropy logging for SFT training path
#9717 opened Jan 5, 2026 by pankd Loading…
Support loss_mask in dataset to control loss calculation for specific turns solved This problem has been already solved
#9630 opened Dec 18, 2025 by CjangCjengh Loading…
2 tasks
Add hf_infer script for inference using HuggingFace backend pending This problem is yet to be addressed
#9370 opened Oct 29, 2025 by WinterShiver Loading…
1 of 2 tasks
support pre-tokenized parquet datasets pending This problem is yet to be addressed
#9351 opened Oct 25, 2025 by AbdulmalikDS Loading…
2 of 3 tasks
Implement LoRA for MoE with support for LoRA injection for nn.parameters pending This problem is yet to be addressed
#9337 opened Oct 23, 2025 by Ziheng-Zhang-AUS Loading…
2 tasks done
ProTip! Exclude everything labeled bug with -label:bug.