remove model.model in ckpt loading #1176

Gasoonjia · 2024-09-22T09:15:09Z

This PR introduced a hook for model checkpoint remapping to remove model.model when model loading for better clearance.

pytorch-bot · 2024-09-22T09:15:12Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1176

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 Cancelled Job, 1 Unrelated Failure

As of commit 96b51be with merge base 72d2d20 ():

CANCELLED JOB - The following job was cancelled. Please retry:

pull / test-tinystories-executorch (16-core-ubuntu) (gh)
##[error]The operation was canceled.

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / test-tinystories-executorch (macos-14-xlarge) (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Jack-Khuu · 2024-09-22T09:52:22Z

Feels much cleaner; Thanks for adding the hook

Jack-Khuu · 2024-09-22T09:47:46Z

torchchat/model.py

        return params

+    def _load_model_state_dict(self, state_dict, prefix, local_metadata, strict, missing_keys, unexpected_keys, error_msgs):
+        # 修改 state dict 中的键值


Multilingual comments

Jack-Khuu · 2024-09-22T09:51:17Z

torchchat/model.py

                params[key] = patterns[value]
        return params

+    def _load_model_state_dict(self, state_dict, prefix, local_metadata, strict, missing_keys, unexpected_keys, error_msgs):


Comment on what the hook does

Jack-Khuu · 2024-11-06T21:17:31Z

Closing stale PR, feel free to reopen

remove model.model in ckpt loading

3f2cf36

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 22, 2024

revert tune version back

28c02bd

Gasoonjia requested review from Jack-Khuu and vmpuri September 22, 2024 09:17

update gguf model loading

bd8ff07

Jack-Khuu approved these changes Sep 22, 2024

View reviewed changes

Gasoonjia added 5 commits September 22, 2024 14:07

expose internal model attribute:

d18da59

avoid inf recursive

b9d1621

get around gguf issue

6831cc6

added doc string for _load_model_state_dict

bcb414f

Merge branch 'main' into rm-modle.model

96b51be

Jack-Khuu closed this Nov 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

remove model.model in ckpt loading #1176

remove model.model in ckpt loading #1176

Uh oh!

Gasoonjia commented Sep 22, 2024

Uh oh!

pytorch-bot bot commented Sep 22, 2024 •

edited

Loading

Uh oh!

Jack-Khuu commented Sep 22, 2024

Uh oh!

Jack-Khuu Sep 22, 2024

Uh oh!

Jack-Khuu Sep 22, 2024

Uh oh!

Jack-Khuu commented Nov 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

remove model.model in ckpt loading #1176

remove model.model in ckpt loading #1176

Uh oh!

Conversation

Gasoonjia commented Sep 22, 2024

Uh oh!

pytorch-bot bot commented Sep 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1176

❌ 1 Cancelled Job, 1 Unrelated Failure

Uh oh!

Jack-Khuu commented Sep 22, 2024

Uh oh!

Jack-Khuu Sep 22, 2024

Choose a reason for hiding this comment

Uh oh!

Jack-Khuu Sep 22, 2024

Choose a reason for hiding this comment

Uh oh!

Jack-Khuu commented Nov 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Sep 22, 2024 •

edited

Loading