Skip to content

Commit 0f6420c

Browse files
committed
Hack my way into fix attn_mask
1 parent 35aec3c commit 0f6420c

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

pretrain_shared_t5_with_mlm.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -35,6 +35,9 @@ def model_provider(pre_process=True, post_process=True):
3535
# TODO @thomasw21: fix this for PP > 1 (the issue is that you're passing two values that require grad)
3636
assert mpu.get_pipeline_model_parallel_world_size() == 1, "PP > 1 is not supported yet"
3737

38+
# TODO @thomasw21 hack to bypass a specific check
39+
args.attn_mask = None
40+
3841
model = SharedT5ModelPipe(
3942
num_tokentypes=0,
4043
parallel_output=True

0 commit comments

Comments
 (0)