Skip to content

Commit e06d5c1

Browse files
author
ssjia
committed
Update base for Update on "[ET-VK] Implement linear_q4gsw"
As title. Extend the quantized linear implementation to be able to handle 4-bit per group symmetrically quantized weights. This is in preparation to support using the int8 dot product extension to be able to handle dynamically quantized inputs. Differential Revision: [D81800023](https://our.internmc.facebook.com/intern/diff/D81800023/) [ghstack-poisoned]
1 parent be1ab37 commit e06d5c1

File tree

0 file changed

+0
-0
lines changed

    0 file changed

    +0
    -0
    lines changed

    0 commit comments

    Comments
     (0)