Skip to content

Why pytorch lightning do not return the traceback informantion? #21038

@leeruibin

Description

@leeruibin

Bug description

When I use pytorch lightning to train a model with trainer.fit(), there are some error in my code which may cause because of the mismatch of feature dimension.

However, although my training fails, I cannot find where the error occur? Why lightning do not return the traceback information to help us locate the position of error?

I only get message like "The size of tensor a (0) must match the size of tensor b (16) at non-singleton dimension 1"

This is useless because I have no idea where this error happen.

What version are you seeing the problem on?

v2.5

Reproduced in studio

No response

How to reproduce the bug

Error messages and logs

# Error messages and logs here please

Environment

Current environment
#- PyTorch Lightning Version (e.g., 2.5.0):
#- PyTorch Version (e.g., 2.5):
#- Python version (e.g., 3.12):
#- OS (e.g., Linux):
#- CUDA/cuDNN version:
#- GPU models and configuration:
#- How you installed Lightning(`conda`, `pip`, source):

More info

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingneeds triageWaiting to be triaged by maintainersver: 2.5.x

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions