Is it possible to create an NF4 Model based on a GGUF model? #1308

softtaco1 · 2024-08-19T20:20:30Z

softtaco1
Aug 19, 2024

I’m genuinely not certain if it’s even possible, so I’m being literal when I ask: Could a NF4 model be made from a GGUF model? Would there be any benefit to it? Would it further decrease model size and VRAM requirements while still retaining enough information to be used effectively?

pflky · 2024-08-19T20:51:02Z

pflky
Aug 19, 2024

NF4 is performance optimizations, while GGUF is compression. The better method would be to first make an NF4 model, then turn it into a GGUF model, and have a system that recognizes how to handle both methods when used together.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Is it possible to create an NF4 Model based on a GGUF model? #1308

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Is it possible to create an NF4 Model based on a GGUF model? #1308

Uh oh!

softtaco1 Aug 19, 2024

Replies: 1 comment

Uh oh!

Uh oh!

pflky Aug 19, 2024

softtaco1
Aug 19, 2024

pflky
Aug 19, 2024