Commit 30412e2

morelos

committed

Update on "[ET] enabling half dtype output for dequantization and making logic consistent"

Enabling half dtype output and making dequantization logic consistent between per_tensor and per_token as it is currently prone to integer overflows on one over the other Differential Revision: [D76289181](https://our.internmc.facebook.com/intern/diff/D76289181/) [ghstack-poisoned]

2 parents 7be4878 + fc18583 commit 30412e2Copy full SHA for 30412e2

0 file changed

-0

lines changed

0 file changed

-0

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit 30412e2

0 file changed

0 file changed

File tree

0 file changed

0 file changed

0 commit comments