Skip to content

Conversation

@clessig
Copy link
Collaborator

@clessig clessig commented Jan 14, 2026

Description

Adapt plot_train to new, more complex structure of loss terms

Issue Number

Closes #1606

Checklist before asking for review

  • I have performed a self-review of my code
  • My changes comply with basic sanity checks:
    • I have fixed formatting issues with ./scripts/actions.sh lint
    • I have run unit tests with ./scripts/actions.sh unit-test
    • I have documented my code and I have updated the docstrings.
    • I have added unit tests, if relevant
  • I have tried my changes with data and code:
    • I have run the integration tests with ./scripts/actions.sh integration-test
    • (bigger changes) I have run a full training and I have written in the comment the run_id(s): launch-slurm.py --time 60
    • (bigger changes and experiments) I have shared a hegdedoc in the github issue with all the configurations and runs for this experiments
  • I have informed and aligned with people impacted by my change:
    • for config changes: the MatterMost channels and/or a design doc
    • for changes of dependencies: the MatterMost software development channel

…culator and various other details cleaned up
…of config is passed to LRScheduler, which leads to major simplifications
clessig and others added 27 commits January 11, 2026 19:18
…hub.com:ecmwf/WeatherGenerator into clessig/develop/fix_config_1534
…allows to specify number of samples;

Added copyright statement
…yiung the number of samples).

Fixed some minor subtle problems in validate() to fully distinguish validation and testing.
* Add option to modify teacher TODO fix ema update

* Fix EMA under teacher and student model differences

* Attempt to revert newline

* Raise error if teacher has weights not in student

* Clessig/sophiex/dev/teacher overrides (#1585)

* Simplified error message

* Added support for target_and_aux configs

* Fix bug that validation EMA params are not used

* Removing unused/superfluous function

* Removed debug statement

* Changed config so that target_aux params are specified as dict at the appropriate place

---------

Co-authored-by: Christian Lessig <[email protected]>
* Adapt code for batchsize > 1

* Fixed comment

---------

Co-authored-by: Christian Lessig <[email protected]>
@github-actions github-actions bot added the model Related to model training or definition (not generic infra) label Jan 14, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

model Related to model training or definition (not generic infra)

Projects

Status: No status

Development

Successfully merging this pull request may close these issues.

plot_train is broken

4 participants