fix fp8 accuracy drop issue by mengniwang95 · Pull Request #1611 · intel/auto-round

mengniwang95 · 2026-03-25T06:58:27Z

Description

fp8 quant_func only use weight.max() for scale calculation when tuning

Type of Change

acc before:

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
gsm8k	3	flexible-extract	5	exact_match	↑	0.7286	±	0.0122
		strict-match	5	exact_match	↑	0.7316	±	0.0122

acc after:

Tasks	Version	Filter	n-shot	Metric		Value		Stderr
gsm8k	3	flexible-extract	5	exact_match	↑	0.7589	±	0.0118
		strict-match	5	exact_match	↑	0.7619	±	0.0117

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

for more information, see https://pre-commit.ci

wenhuach21 · 2026-03-25T07:40:48Z

Really nice catch! Thanks!

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

for more information, see https://pre-commit.ci

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

wenhuach21 · 2026-03-25T15:25:23Z

based on the data,I still recommend using rtn

mengniwang95 · 2026-03-26T01:39:58Z

based on the data,I still recommend using rtn

okay

for more information, see https://pre-commit.ci

auto_round/compressors/base.py

wenhuach21 · 2026-03-26T05:08:50Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-03-26T05:09:00Z

Azure Pipelines successfully started running 1 pipeline(s).

mengniwang95 and others added 3 commits March 25, 2026 06:56

fix fp8 quant_func

e008e29

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

Merge branch 'main' into mengni/fp8_fix

a231385

[pre-commit.ci] auto fixes from pre-commit.com hooks

6aedfe0

for more information, see https://pre-commit.ci

chensuyue added this to the 0.12.0 milestone Mar 25, 2026

xin3he approved these changes Mar 25, 2026

View reviewed changes

mengniwang95 and others added 4 commits March 25, 2026 12:00

fix CI

f42c503

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

Merge branch 'main' into mengni/fp8_fix

68b8400

[pre-commit.ci] auto fixes from pre-commit.com hooks

b98f4bb

for more information, see https://pre-commit.ci

update acc and remove warning

d6e9eb5

Signed-off-by: Mengni Wang <mengni.wang@intel.com>

mengniwang95 and others added 2 commits March 26, 2026 09:42

Update base.py

0001f0a

[pre-commit.ci] auto fixes from pre-commit.com hooks

1a59fe2

for more information, see https://pre-commit.ci

wenhuach21 reviewed Mar 26, 2026

View reviewed changes

auto_round/compressors/base.py Outdated Show resolved Hide resolved

mengniwang95 added 4 commits March 26, 2026 10:23

Update base.py

bbc483f

Merge branch 'main' into mengni/fp8_fix

c48c3d1

fix CI

564e2d9

Merge branch 'main' into mengni/fp8_fix

46c1221

wenhuach21 changed the title ~~fix fp8 quant_func~~ fix fp8 accuracy drop issue Mar 26, 2026

wenhuach21 merged commit b78453c into main Mar 26, 2026
40 checks passed

wenhuach21 deleted the mengni/fp8_fix branch March 26, 2026 06:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix fp8 accuracy drop issue#1611

fix fp8 accuracy drop issue#1611
wenhuach21 merged 13 commits intomainfrom
mengni/fp8_fix

mengniwang95 commented Mar 25, 2026

Uh oh!

wenhuach21 commented Mar 25, 2026

Uh oh!

wenhuach21 commented Mar 25, 2026

Uh oh!

mengniwang95 commented Mar 26, 2026

Uh oh!

Uh oh!

wenhuach21 commented Mar 26, 2026

Uh oh!

azure-pipelines bot commented Mar 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mengniwang95 commented Mar 25, 2026

Description

Type of Change

Uh oh!

wenhuach21 commented Mar 25, 2026

Uh oh!

wenhuach21 commented Mar 25, 2026

Uh oh!

mengniwang95 commented Mar 26, 2026

Uh oh!

Uh oh!

wenhuach21 commented Mar 26, 2026

Uh oh!

azure-pipelines bot commented Mar 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants