Implement the AO API in torchchat quantization handlers and unify logic. #1291

mikekgfb · 2024-10-10T20:39:17Z

Implement the AO API in torchchat quantization handlers and unify logic.

1 - implement .quantize() for TC quantization handlers and support args to make consistent with AO
2 - remove special handling for various combinations of parameters and use validate_args before calling with **q_kwargs
3 - remove check probing whether we successfully loaded a8wx and install an error-reporting handler if loading failed which will be called as quant handler and issue an error
4 - unify both tc and ao quantization handler dicts with shared calling logic
5 - provide informational message when quantizer option not supported (via introspection)

Implement the AO API in torchchat quantization handlers and unify logic. 1 - implement .quantize() for TC quantization handlers and support args to make consistent with AO 2 - remove special handling for various combinations of parameters and use validate_args before calling with **q_kwargs 3 - remove check probing whether we successfully loaded a8wx and install an error-reporting handler if loading failed which will be called as quant handler and issue an error 4 - unify both tc and ao quantization handler dicts with shared calling logic

Added comment, and a missing self parameter

pytorch-bot · 2024-10-10T20:39:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1291

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 6926b9c with merge base 95ebcb8 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Fix typo (func -> q.__init__)

Fix arg order (args with default after args w/o default)

larryliu0820 · 2024-10-14T20:58:35Z

torchchat/utils/quantize.py

    torchao_experimental_quant_api_spec.loader.exec_module(torchao_experimental_quant_api)
    from torchao_experimental_quant_api import Int8DynActIntxWeightQuantizer
-    ao_quantizer_class_dict["linear:a8wxdq"] = Int8DynActIntxWeightQuantizer
+    quantizer_class_dict["linear:a8wxdq"] = Int8DynActIntxWeightQuantizer


Any reason we can't put this inside of the quantizer_class_dict?

It seems cleaner to try import Int8DynActIntxWeightQuantizer first and fallback to ErrorHandler, then assign this handler to linear:a8wxdq in quantizer_class_dict.

It seems cleaner to try import Int8DynActIntxWeightQuantizer first and fallback to ErrorHandler, then assign this handler to linear:a8wxdq in quantizer_class_dict.

This is what the code is doing now. Try to import, and if the import fails, set up the error handler.

Or were you thinking to do a wrapper that imports the class Int8DynActIntxWeightQuantizer and then calls the error if it fails, and the imported method if the import succeeds? I assumed that all the conditional import will go away soonish since we should know what version of AO we pin, and whether it has the new class. (And that enablement is there, it doesn't disappear again, so that we can just do a simple import in the future.)

.... Because init can't return an alternate class, the wrapper would have to redispatch all methods internally, which isn't a big deal per se, but may add readability concerns? I'm happy to go either way, ideally as a follow-on.

LMK what you think the best long-term trajectory is for this functionality, and we'll align the code to that. (I think the current version is preferable to the previous state, b/c we don't have to special case in the dispatch loop)

I think if this can go away soon I don't have a strong opinion and can live with this.

larryliu0820

Thanks for the cleanup! Just one comment and please make sure CI jobs are passing

Fixed 2 typos.

mikekgfb · 2024-10-14T21:27:28Z

Thanks for the cleanup! Just one comment and please make sure CI jobs are passing

Fix default args

mikekgfb · 2024-10-14T21:49:32Z

Thanks for the cleanup! Just one comment and please make sure CI jobs are passing

My pleasure. Sorry about that error, should be fixed now.

larryliu0820

CI is passing

mikekgfb added 2 commits October 8, 2024 17:21

Typo / Docs

4378b26

Added comment, and a missing self parameter

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 10, 2024

mikekgfb added 2 commits October 10, 2024 14:46

Merge branch 'main' into patch-2

4cbd6b6

Update quantize.py

be06ff4

Fix typo (func -> q.__init__)

Jack-Khuu requested review from Jack-Khuu, jerryzh168, larryliu0820 and vmpuri October 12, 2024 00:18

Merge branch 'main' into patch-2

0b62eb7

Jack-Khuu added the Quantization Issues related to Quantization or torchao label Oct 12, 2024

Update quantize.py

bb3c3cd

Fix arg order (args with default after args w/o default)

larryliu0820 reviewed Oct 14, 2024

View reviewed changes

larryliu0820 suggested changes Oct 14, 2024

View reviewed changes

Fix typo

5f19018

Fixed 2 typos.

mikekgfb added 4 commits October 14, 2024 14:39

Fix default args

3956f53

Fix default args

Update quantize.py

72c1373

Update quantize.py

8b924d9

Update quantize.py

3168bc7

Merge branch 'main' into patch-2

159597c

larryliu0820 approved these changes Oct 14, 2024

View reviewed changes

Merge branch 'main' into patch-2

6926b9c

Jack-Khuu merged commit 7d5ba09 into pytorch:main Oct 23, 2024
52 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement the AO API in torchchat quantization handlers and unify logic. #1291

Implement the AO API in torchchat quantization handlers and unify logic. #1291

Uh oh!

mikekgfb commented Oct 10, 2024

Uh oh!

pytorch-bot bot commented Oct 10, 2024 •

edited

Loading

Uh oh!

larryliu0820 Oct 14, 2024

Uh oh!

larryliu0820 Oct 14, 2024

Uh oh!

mikekgfb Oct 14, 2024

Uh oh!

larryliu0820 Oct 14, 2024

Uh oh!

larryliu0820 left a comment •

edited

Loading

Uh oh!

mikekgfb commented Oct 14, 2024

Uh oh!

mikekgfb commented Oct 14, 2024

Uh oh!

larryliu0820 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Implement the AO API in torchchat quantization handlers and unify logic. #1291

Implement the AO API in torchchat quantization handlers and unify logic. #1291

Uh oh!

Conversation

mikekgfb commented Oct 10, 2024

Uh oh!

pytorch-bot bot commented Oct 10, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1291

✅ No Failures

Uh oh!

larryliu0820 Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

larryliu0820 Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

mikekgfb Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

larryliu0820 Oct 14, 2024

Choose a reason for hiding this comment

Uh oh!

larryliu0820 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mikekgfb commented Oct 14, 2024

Uh oh!

mikekgfb commented Oct 14, 2024

Uh oh!

larryliu0820 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Oct 10, 2024 •

edited

Loading

larryliu0820 left a comment •

edited

Loading