Add initial_weights support for fine-tuning by sirmarcel · Pull Request #7 · lab-cosmo/lorem-jax

sirmarcel · 2026-03-16T16:17:26Z

Summary

Adds initial_weights setting to settings.yaml — loads model weights from a previous run's checkpoint (.msgpack) while starting optimizer, step counter, and data iterator fresh
Recursive merge_params supports partial architecture matches (new layers keep random init)
Records initial_weights path in saved config for reproducibility
Fine-tuning example (my_experiment_finetune/) and tox smoke test
Unit tests for merge_params

Test plan

uvx tox -e tests — all 22 tests pass
uvx ruff format . && uvx ruff check --fix . — clean
Fine-tuning example runs end-to-end (DATASETS=.. lorem-train from my_experiment_finetune/)

🤖 Generated with Claude Code

…ghts Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

PicoCentauri

LGTM!

Add initial_weights support for fine-tuning from pretrained model wei…

63363a5

…ghts Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

PicoCentauri approved these changes Mar 17, 2026

View reviewed changes

sirmarcel merged commit 35e1889 into main Mar 20, 2026
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add initial_weights support for fine-tuning#7

Add initial_weights support for fine-tuning#7
sirmarcel merged 1 commit intomainfrom
feature/initial-weights

sirmarcel commented Mar 16, 2026

Uh oh!

PicoCentauri left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sirmarcel commented Mar 16, 2026

Summary

Test plan

Uh oh!

PicoCentauri left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants