Skip to content

Conversation

@nikita-savelyevv
Copy link
Collaborator

@nikita-savelyevv nikita-savelyevv commented Nov 28, 2025

Changes

Achieved a small time reduction for HAWQ algorithm. Weight quantization error is now fully computed on OV side. I observe about 30% reduction in mixed precision assignment runtime on a Llama-3.1-8B model. There can be small numerical differences though compared to NumPy's linalg Frobenius norm implementation (up to rtol=1e-4 based on the experiments).

Reason for changes

Improving UX.

Ticket

163229

Tests

Extended tests/openvino/optimized_functions/test_compression_functions.py

https://github.com/openvinotoolkit/nncf/actions/runs/19769177402

@github-actions github-actions bot added the NNCF OpenVINO Pull requests that updates NNCF OpenVINO label Nov 28, 2025
@nikita-savelyevv nikita-savelyevv marked this pull request as ready for review November 28, 2025 16:47
@nikita-savelyevv nikita-savelyevv requested a review from a team as a code owner November 28, 2025 16:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

NNCF OpenVINO Pull requests that updates NNCF OpenVINO

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants