Skip to content

Conversation

@zhaohb
Copy link

@zhaohb zhaohb commented Oct 24, 2025

Just for EXAONE-4.0-1.2B

ticket : 175853

Copy link
Collaborator

@rkazants rkazants left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please provide tests and JIRA ticket.

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@zhaohb
Copy link
Author

zhaohb commented Oct 31, 2025

Please provide tests and JIRA ticket.

Hi, I had add JITA ticket and tests.
Thank you very much.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

since we have already supported previous "exaone" version. Please add tests for quantization. It should work as well.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

However, when applying quantization, feature-extraction works correctly, but text-classification throws an error:

nncf.errors.InvalidGroupSizeError: Compression of the '__module.model.rotary_emb/aten::matmul/MatMul' layer failed with the following error:
Channel size 1 should be divisible by size of group 128.
Ensure that the group size is divisible by the channel size, or include this node and others with similar issues in the ignored scope:
nncf.compress_weight(
        ...,
        ignored_scope=IgnoredScope(names=["__module.model.rotary_emb/aten::matmul/MatMul"]
        )
)

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

However, when applying quantization, feature-extraction works correctly, but text-classification throws an error:

nncf.errors.InvalidGroupSizeError: Compression of the '__module.model.rotary_emb/aten::matmul/MatMul' layer failed with the following error:
Channel size 1 should be divisible by size of group 128.
Ensure that the group size is divisible by the channel size, or include this node and others with similar issues in the ignored scope:
nncf.compress_weight(
        ...,
        ignored_scope=IgnoredScope(names=["__module.model.rotary_emb/aten::matmul/MatMul"]
        )
)

What version of nncf you are using?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@nikita-savelyevv I don't think this error is related to the NNCF. It's more likely an issue with my dummy model.
I will double check that. thank you.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@zhaohb It could be, because there was a fix addressing this issue. In any case I would suggest to update to NNCF v 2.18

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@nikita-savelyevv
Hi, the issue disappeared after upgrading to version 2.18. It wasn't a problem with the dummy model after all.
Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants