add support for per-head attention quantization#1791
Open
eldarkurtic wants to merge 1 commit intovllm-project:mainfrom
Open
add support for per-head attention quantization#1791eldarkurtic wants to merge 1 commit intovllm-project:mainfrom
eldarkurtic wants to merge 1 commit intovllm-project:mainfrom