Skip to content

update xpu quantization cost#1618

Open
xin3he wants to merge 1 commit intomainfrom
xinhe/3-26
Open

update xpu quantization cost#1618
xin3he wants to merge 1 commit intomainfrom
xinhe/3-26

Conversation

@xin3he
Copy link
Contributor

@xin3he xin3he commented Mar 26, 2026

Description

update xpu quantization cost

Type of Change

  • Bug fix
  • New feature
  • Documentation update
  • Performance improvement
  • Code refactoring
  • Other (please specify):

Related Issues

Fixes or relates to #1027

Checklist Before Submitting

  • My code has been tested locally.
  • Documentation has been updated as needed.
  • New or updated tests are included where applicable.

@xin3he xin3he requested review from Copilot and wenhuach21 and removed request for Copilot March 26, 2026 03:15
Signed-off-by: Xin He <xin3.he@intel.com>
@xin3he xin3he requested review from chensuyue and thuang6 March 26, 2026 03:31
|------------------------------------------------------------------------------------------------------------------------|-------------------|-------------------|---------------------|---------------------|---------------------|
| 2.11.0+xpu with torch compile | 20min<br/>10.7GB | 26min<br/>13.2GB | 58min<br/>22.8GB | OOM | OOM |
| 2.11.0+xpu with torch compile<br/>low_gpu_mem_usage=True | 29min<br/>9.5GB | 38min<br/>9.8GB | 1h 23min<br/>19.4GB | 1h 32min<br/>20.1GB | 5h 33min<br/>22.8GB |
| 2.11.0+xpu with torch compile<br/>low_gpu_mem_usage=True<br/>gradient_accumulate_steps=8,bs=1 | 41min<br/>1.3GB | 42min<br/>1.8GB | 1h 29min<br/>3.6GB | 2h 4min<br/>4.6GB ||
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

what does "-" mean here? too long too wait for complete?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants