Skip to content

Refresh "Quantization Overview" docs page #2643

@andrewor14

Description

@andrewor14

https://docs.pytorch.org/ao/main/quantization.html

Our Quantization Overview docs page is a bit outdated and too developer focused. We should structure it around:

  • Two main quantization APIs, quantize_ and PT2E ones
  • Different quantization features supported, example user code snippets, link to different doc pages
  • Move all developer concepts (e.g. AQT, layout, quant primitives) to a separate page
  • Delete static quant and serialization sections. These have their own tutorials
  • Refresh dtypes supported (still mentions adding torch.uint2), add newer mx dtypes
  • Replace ascii art with real image

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions