-
Notifications
You must be signed in to change notification settings - Fork 19.6k
feat(quantization): Add GPTQ n-bit quantization support #21551
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 26 commits
Commits
Show all changes
27 commits
Select commit
Hold shift + click to select a range
a3b1ee2
feat(quantization): Add GPTQ n-bit quantization support
amitsrivastava78 48d6b6e
added dataset to be installed
amitsrivastava78 105a006
Fix the AI comments except one
amitsrivastava78 27e92f1
Fixed gptq algo for inline weights update
amitsrivastava78 907f98c
updated the review comments
amitsrivastava78 73683ad
Renamed the quant to gptqquant class
amitsrivastava78 0e774ab
Renamed the quant file to gptqqnat
amitsrivastava78 278b827
Reworked some superficial comments
amitsrivastava78 6f6871e
Reworked on review comments
amitsrivastava78 d520d75
Removed the huggingfce dependency
amitsrivastava78 d610edb
changed the file name to gptq_config.py
amitsrivastava78 e34e45b
fix comments and added additional test file
amitsrivastava78 4197eef
added test to improve converage
amitsrivastava78 7b0d7b0
removed numerics like +,-,* etc and used keras.ops
amitsrivastava78 9880f36
reworked on the review comments
amitsrivastava78 9ba30bf
updated the interface as per comments
amitsrivastava78 2b49d80
reworked the comments
amitsrivastava78 2ccbf61
fixed failing test case
amitsrivastava78 f83ae0b
Added test case to improve the coverage
amitsrivastava78 a575537
Added test case to improve the coverage
amitsrivastava78 7ad29c6
Added test case to improve the coverage
amitsrivastava78 364d4da
reworke on review comments
amitsrivastava78 eba87d3
reworked on final review comments
amitsrivastava78 2a0a881
fix issue while fixing review comments
amitsrivastava78 420b916
fix minor review comments
amitsrivastava78 ec8c744
fixed failing test case
amitsrivastava78 28df44e
fixed some typos
amitsrivastava78 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.