[quantization] is there any plan to support 6bits quantization , as 6 bits quantization more efficient than 8 bits on arm cpu #15432

tianylijun · 2019-07-02T11:22:42Z

tianylijun
Jul 2, 2019

Is there any plan to support 6bits quantization , as data overflow risk, 6 bits quantization more efficient than 8 bits on ARM CPU.

mxnet-label-bot · 2019-07-02T11:22:45Z

mxnet-label-bot
Jul 2, 2019

Hey, this is the MXNet Label Bot.
Thank you for submitting the issue! I will try and suggest some labels so that the appropriate MXNet community members can help resolve it.
Here are my recommended labels: Feature

0 replies

pengzhao-intel · 2019-07-02T14:41:07Z

pengzhao-intel
Jul 2, 2019
Collaborator

Thanks for the proposal.
Do you have any technical details about 6bits is more efficient than 8bits?

0 replies

tianylijun · 2019-07-02T15:19:17Z

tianylijun
Jul 2, 2019
Author

Thanks for the proposal.
Do you have any technical details about 6bits is more efficient than 8bits?

As current ARM SIMD do not support int32 += int8xint8（except Cotex A55，A75)，unfortunate 8bits MAC will overflow during 4X4 sgemm block(127x127x4 exceed int16 range )，so int8 need first convert to int16 then do int32 += int16xint16. but 6 bits will not overflow in 4x4 sgemm block，so 6 bits can use directly use int16+= int8xint8 MAC，more efficient than 8 bits

0 replies

pengzhao-intel · 2019-07-05T00:46:53Z

pengzhao-intel
Jul 5, 2019
Collaborator

@tianylijun thanks for the explanation. The tensorcore compute the 4x4 GEMM with INT8 and seems it can handle the overflow. Did you have a chance to look into the difference?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[quantization] is there any plan to support 6bits quantization , as 6 bits quantization more efficient than 8 bits on arm cpu #15432

Uh oh!

{{title}}

Uh oh!

Replies: 4 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

[quantization] is there any plan to support 6bits quantization , as 6 bits quantization more efficient than 8 bits on arm cpu #15432

Uh oh!

tianylijun Jul 2, 2019

Replies: 4 comments

Uh oh!

mxnet-label-bot Jul 2, 2019

Uh oh!

pengzhao-intel Jul 2, 2019 Collaborator

Uh oh!

tianylijun Jul 2, 2019 Author

Uh oh!

pengzhao-intel Jul 5, 2019 Collaborator

tianylijun
Jul 2, 2019

mxnet-label-bot
Jul 2, 2019

pengzhao-intel
Jul 2, 2019
Collaborator

tianylijun
Jul 2, 2019
Author

pengzhao-intel
Jul 5, 2019
Collaborator