-
Notifications
You must be signed in to change notification settings - Fork 16
Description
According to the perf test from @yifeizh2, the read lock will bring 10-20% performance regression
| read_lock_guard_t g(g_brgemm_lock); |
| data type | bs | hidden_list | without lock | with lock | |
|---|---|---|---|---|---|
| bf16 | 128 | 16x512 | 0.021597 | 0.01967 | 109.79% |
| bf16 | 128 | 512x256 | 0.025172 | 0.029239 | 86.09% |
| bf16 | 128 | 256x128 | 0.015969 | 0.026034 | 61.34% |
| bf16 | 128 | 512x1024 | 0.046416 | 0.055459 | 83.69% |
| bf16 | 128 | 1024x1024 | 0.082655 | 0.089712 | 92.13% |
| bf16 | 128 | 1024x512 | 0.066204 | 0.066523 | 99.52% |
| bf16 | 128 | 512x256 | 0.025992 | 0.029516 | 88.06% |