Commit 9f7d105
morelos
Update on "[ET-VK][Ops] quantization op shaders and impl"
Creating the quantize_per_tensor and quantize_per_token logic shaders and impl which are linked with the testing framework.
NOTE: Currently the only input types supported are **half** (fp16) and **float** (fp32). The only output types supported are **byte** (uint8), **char** (int8), **short** (int16), **int** (int32).
Differential Revision: [D75959064](https://our.internmc.facebook.com/intern/diff/D75959064/)
[ghstack-poisoned]File tree
0 file changed
+0
-0
lines changed0 file changed
+0
-0
lines changed
0 commit comments