-
Notifications
You must be signed in to change notification settings - Fork 228
Open
Description
I got an AssertionError: Mask is silently ignored due to the use of a custom kernel when training GPT-2 with examples/pretrain_gpt.sh.
This line leads to the assertion error:
| assert mask is None, "Mask is silently ignored due to the use of a custom kernel" |
Is this assertion necessary? And is it even correct?
Metadata
Metadata
Assignees
Labels
No labels