Commit 95b6192
ssjia
Update on "[ET-VK] Implement linear_q4gsw"
As title. Extend the quantized linear implementation to be able to handle 4-bit per group symmetrically quantized weights. This is in preparation to support using the int8 dot product extension to be able to handle dynamically quantized inputs.
Differential Revision: [D81800023](https://our.internmc.facebook.com/intern/diff/D81800023/)
[ghstack-poisoned]File tree
0 file changed
+0
-0
lines changed0 file changed
+0
-0
lines changed
0 commit comments