- 
                Notifications
    You must be signed in to change notification settings 
- Fork 150
Support exaone4 model #1491
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Support exaone4 model #1491
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please provide tests and JIRA ticket.
| The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. | 
| 
 Hi, I had add JITA ticket and tests. | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
since we have already supported previous "exaone" version. Please add tests for quantization. It should work as well.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
However, when applying quantization, feature-extraction works correctly, but text-classification throws an error:
nncf.errors.InvalidGroupSizeError: Compression of the '__module.model.rotary_emb/aten::matmul/MatMul' layer failed with the following error:
Channel size 1 should be divisible by size of group 128.
Ensure that the group size is divisible by the channel size, or include this node and others with similar issues in the ignored scope:
nncf.compress_weight(
        ...,
        ignored_scope=IgnoredScope(names=["__module.model.rotary_emb/aten::matmul/MatMul"]
        )
)There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
However, when applying quantization, feature-extraction works correctly, but text-classification throws an error:
nncf.errors.InvalidGroupSizeError: Compression of the '__module.model.rotary_emb/aten::matmul/MatMul' layer failed with the following error: Channel size 1 should be divisible by size of group 128. Ensure that the group size is divisible by the channel size, or include this node and others with similar issues in the ignored scope: nncf.compress_weight( ..., ignored_scope=IgnoredScope(names=["__module.model.rotary_emb/aten::matmul/MatMul"] ) )
What version of nncf you are using?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@nikita-savelyevv I don't think this error is related to the NNCF. It's more likely an issue with my dummy model.
I will double check that. thank you.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@zhaohb It could be, because there was a fix addressing this issue. In any case I would suggest to update to NNCF v 2.18
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@nikita-savelyevv
Hi, the issue disappeared after upgrading to version 2.18. It wasn't a problem with the dummy model after all.
Thank you!
Just for EXAONE-4.0-1.2B
ticket : 175853