Skip to content

Commit de2298b

Browse files
author
morelos
committed
Update on "[ET-VK][Ops] quantization op shaders and impl"
Creating the quantize_per_tensor and quantize_per_token logic shaders and impl which are linked with the testing framework. NOTE: Currently the only input types supported are **half** (fp16) and **float** (fp32). The only output types supported are **byte** (uint8), **char** (int8), **short** (int16), **int** (int32). Differential Revision: [D75959064](https://our.internmc.facebook.com/intern/diff/D75959064/) [ghstack-poisoned]
2 parents 499dbfd + b93c374 commit de2298b

File tree

6 files changed

+1872
-22
lines changed

6 files changed

+1872
-22
lines changed

0 commit comments

Comments
 (0)