Default max_seq_length to 128 for ExecuTorch export #1170

swolchok · 2024-09-20T22:19:16Z

Stack from ghstack (oldest at bottom):

With the current default behavior, performance for e.g. stories110Mwithout custom SDPA is bad because the QKV tensors are long (8192 in the last dim). Limiting the max sequence length remedies this.

[ghstack-poisoned]

pytorch-bot · 2024-09-20T22:19:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1170

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit c7efb72 with merge base 2cf4016 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

swolchok · 2024-09-24T02:05:16Z

looks like we need to fix torchchat not respecting max_seq_length from PTEs in order to land this

[ghstack-poisoned]

Update

1945ebe

[ghstack-poisoned]

swolchok mentioned this pull request Sep 20, 2024

parallelize ExecuTorch build #1168

Merged

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 20, 2024

This was referenced Sep 20, 2024

add ConvertToLinear, disable custom SDPA for bfloat16 #1169

Merged

export.py: fix custom SDPA type conversion logic & re-enable for bfloat16 #1171

Merged

swolchok requested review from Jack-Khuu, desertfire and kimishpatel September 20, 2024 22:21

Jack-Khuu approved these changes Sep 21, 2024

View reviewed changes

desertfire approved these changes Sep 23, 2024

View reviewed changes

swolchok added 2 commits September 23, 2024 14:34

Update

18f9c40

[ghstack-poisoned]

Update

577865e

[ghstack-poisoned]

swolchok mentioned this pull request Sep 23, 2024

Update ExecuTorch pin to pick up bfloat16 fixes #1181

Merged

Update

2c1e97f

[ghstack-poisoned]

swolchok mentioned this pull request Sep 24, 2024

Use default max_seq_length of 128 when loading ExecuTorch models #1184

Merged

Update

c7efb72

[ghstack-poisoned]

swolchok mentioned this pull request Sep 24, 2024

Also use default max_seq_length of 128 for ExecuTorch native runner #1186

Merged

swolchok merged commit c7efb72 into gh/swolchok/3/base Sep 24, 2024
51 checks passed

swolchok deleted the gh/swolchok/3/head branch September 24, 2024 18:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Default max_seq_length to 128 for ExecuTorch export #1170

Default max_seq_length to 128 for ExecuTorch export #1170

Uh oh!

swolchok commented Sep 20, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 20, 2024 •

edited

Loading

Uh oh!

swolchok commented Sep 24, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Default max_seq_length to 128 for ExecuTorch export #1170

Default max_seq_length to 128 for ExecuTorch export #1170

Uh oh!

Conversation

swolchok commented Sep 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1170

✅ No Failures

Uh oh!

swolchok commented Sep 24, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

swolchok commented Sep 20, 2024 •

edited

Loading

pytorch-bot bot commented Sep 20, 2024 •

edited

Loading