Skip to content

llama:use F32 precision in GLM4 attention and no FA#9130

Merged
ggerganov merged 1 commit intoggml-org:masterfrom
piDack:fix_glm4_ggg_err
Aug 23, 2024
Merged

llama:use F32 precision in GLM4 attention and no FA#9130
ggerganov merged 1 commit intoggml-org:masterfrom
piDack:fix_glm4_ggg_err

Commits

Commits on Aug 22, 2024