Skip to content
Discussion options

You must be logged in to vote

I understood why beta is not required for qgemm, this question can be closed.

Below is my understanding
A = sA * (qA - zA), where qA is quantized A, sA is scale and zA is zeropoint.
B = sB * (qB - zB)
C = sC * (qC - zC)
As per the definition zC=0 and sC = alpha/beta * sA * sB
C = alpha/beta * sA * sB * qC

Gemm = alpha AB + beta*C
QGemm = alpha * sA * (qA - zA) * sB * (qB - zB) + beta * (alpha/beta * sA * sB * qC)
= alpha * sA * (qA - zA) * sB * (qB - zB) + alpha * sA * sB * qC

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@snnn
Comment options

snnn Mar 11, 2022
Maintainer

@yufenglee
Comment options

Answer selected by rasagna-quic
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
3 participants