Yet another llm model format to support? AWQ #448

dreemur99 · 2023-09-24T12:09:17Z

dreemur99
Sep 24, 2023

Im sure you already seen it already but theres a another new model format. AWQ. Claims to be "blazing-fast" with much lower vram requirements. Looks like an almost 45% reduction in reqs.

LostRuins · 2023-09-25T10:18:31Z

LostRuins
Sep 25, 2023
Maintainer

Probably won't be adding this on my own, unless llama.cpp also adds it, then koboldcpp will inherit their support.

1 reply

dreemur99 Sep 25, 2023
Author

makes sense.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Yet another llm model format to support? AWQ #448

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Yet another llm model format to support? AWQ #448

Uh oh!

dreemur99 Sep 24, 2023

Replies: 1 comment · 1 reply

Uh oh!

LostRuins Sep 25, 2023 Maintainer

Uh oh!

dreemur99 Sep 25, 2023 Author

dreemur99
Sep 24, 2023

Replies: 1 comment 1 reply

LostRuins
Sep 25, 2023
Maintainer

dreemur99 Sep 25, 2023
Author