Skip to content

Defaulting detect_anomaly to True (in Trainer) #20708

@romeokienzler

Description

@romeokienzler

Description & Motivation

Currently, if detect_anomaly is set to False (default), if losses become NaN or Inf, training continues.
Defaulting detect_anomaly to True (in Trainer) will raise an RTE if losses become NaN or Inf as further training doesn't make sense in this case

Pitch

Will save a lot of unnecessary debug and training time (fail fast)

Alternatives

No response

Additional context

No response

cc @lantiga @Borda

Metadata

Metadata

Assignees

No one assigned

    Labels

    featureIs an improvement or enhancementneeds triageWaiting to be triaged by maintainers

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions