apply output layer pruning #5426

iseeyuan · 2024-09-17T17:17:12Z

Summary:
Apply output layer pruning if we are using a model trained with a large output vocabulary to use as a classification task to output only smaller set of vocabulary. The output interface is ensured to be the same as unpruned model.

e.g., if the last linear layer has 2048 x 128k shape, and we trained the model to output only 20 output vocab, then we can prune away the last layer to have a shape of 2048 x 20. But we still expand the 1,20 output shape to 1,128k so that the app consuming the model outputs don't need to change.

Differential Revision: D62143905

pytorch-bot · 2024-09-17T17:17:17Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5426

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 20d7986 with merge base ad95e46 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-09-17T17:17:36Z

This pull request was exported from Phabricator. Differential Revision: D62143905

iseeyuan

LGTM. Thanks for putting it in the source transform! Please make sure both OSS and internal tests pass.

facebook-github-bot · 2024-09-17T18:53:05Z

This pull request was exported from Phabricator. Differential Revision: D62143905

Summary: Pull Request resolved: pytorch#5426 Apply output layer pruning if we are using a model trained with a large output vocabulary to use as a classification task to output only smaller set of vocabulary. The output interface is ensured to be the same as unpruned model. e.g., if the last linear layer has 2048 x 128k shape, and we trained the model to output only 20 output vocab, then we can prune away the last layer to have a shape of 2048 x 20. But we still expand the 1,20 output shape to 1,128k so that the app consuming the model outputs don't need to change. Reviewed By: iseeyuan Differential Revision: D62143905

facebook-github-bot · 2024-09-17T21:58:34Z

This pull request was exported from Phabricator. Differential Revision: D62143905

Summary: Pull Request resolved: pytorch#5426 Apply output layer pruning if we are using a model trained with a large output vocabulary to use as a classification task to output only smaller set of vocabulary. The output interface is ensured to be the same as unpruned model. e.g., if the last linear layer has 2048 x 128k shape, and we trained the model to output only 20 output vocab, then we can prune away the last layer to have a shape of 2048 x 20. But we still expand the 1,20 output shape to 1,128k so that the app consuming the model outputs don't need to change. Reviewed By: tarun292, iseeyuan Differential Revision: D62143905

facebook-github-bot · 2024-09-18T18:08:53Z

This pull request was exported from Phabricator. Differential Revision: D62143905

facebook-github-bot · 2024-09-18T20:14:06Z

This pull request has been merged in 2afcd96.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 17, 2024

facebook-github-bot added the fb-exported label Sep 17, 2024

iseeyuan commented Sep 17, 2024

View reviewed changes

iseeyuan force-pushed the export-D62143905 branch from 3695692 to 20ea3af Compare September 17, 2024 18:53

tarun292 approved these changes Sep 17, 2024

View reviewed changes

iseeyuan force-pushed the export-D62143905 branch from 20ea3af to dd6f355 Compare September 17, 2024 21:58

iseeyuan force-pushed the export-D62143905 branch from dd6f355 to 20d7986 Compare September 18, 2024 18:08

facebook-github-bot closed this in 2afcd96 Sep 18, 2024

facebook-github-bot added the Merged label Sep 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

apply output layer pruning #5426

apply output layer pruning #5426

Uh oh!

iseeyuan commented Sep 17, 2024

Uh oh!

pytorch-bot bot commented Sep 17, 2024 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 17, 2024

Uh oh!

iseeyuan left a comment

Uh oh!

facebook-github-bot commented Sep 17, 2024

Uh oh!

facebook-github-bot commented Sep 17, 2024

Uh oh!

facebook-github-bot commented Sep 18, 2024

Uh oh!

facebook-github-bot commented Sep 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

apply output layer pruning #5426

apply output layer pruning #5426

Uh oh!

Conversation

iseeyuan commented Sep 17, 2024

Uh oh!

pytorch-bot bot commented Sep 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5426

✅ No Failures

Uh oh!

facebook-github-bot commented Sep 17, 2024

Uh oh!

iseeyuan left a comment

Choose a reason for hiding this comment

Uh oh!

facebook-github-bot commented Sep 17, 2024

Uh oh!

facebook-github-bot commented Sep 17, 2024

Uh oh!

facebook-github-bot commented Sep 18, 2024

Uh oh!

facebook-github-bot commented Sep 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Sep 17, 2024 •

edited

Loading