Skip to content

Conversation

@andrey-churkin
Copy link
Contributor

@andrey-churkin andrey-churkin commented Aug 28, 2025

Changes

  • Fixed incorrect inputs for the MatMulNBits operation that led to a graph break.
  • Fixed incorrect INT4 weight compression in the Gemm operation when the transB attribute is set to 1.

Related tickets

Ref: 171011
Ref: 171016

Tests

  • tests/onnx/quantization/test_weights_compression.py::test_matmulnbits_gemm

Copy link
Collaborator

@daniil-lyakhov daniil-lyakhov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is it possible to add a test case for the changes?

@andrey-churkin
Copy link
Contributor Author

Is it possible to add a test case for the changes?

I've just added it.

@github-actions github-actions bot added the NNCF ONNX Pull requests that updates NNCF ONNX label Sep 12, 2025
@andrey-churkin andrey-churkin merged commit 58d8d8c into openvinotoolkit:develop Sep 15, 2025
20 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

NNCF ONNX Pull requests that updates NNCF ONNX

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants