Skip to content

Commit 30412e2

Browse files
author
morelos
committed
Update on "[ET] enabling half dtype output for dequantization and making logic consistent"
Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other Differential Revision: [D76289181](https://our.internmc.facebook.com/intern/diff/D76289181/) [ghstack-poisoned]
2 parents 7be4878 + fc18583 commit 30412e2

File tree

0 file changed

+0
-0
lines changed

    0 file changed

    +0
    -0
    lines changed

    0 commit comments

    Comments
     (0)