llama.cpp and 7G Llama-2-Chat model: resource requirements, if possible #5172

christian-2 · 2024-01-28T11:01:16Z

christian-2
Jan 28, 2024

I am a new to this project and would like to try inference with llama.cpp and a 7G Llama-2-Chat model: is this combination currently supported and what are the resource requirements? I have e.g. a VM with 8 vCPUs and 16 GB RAM or (if need be) a bare-medal server with 56 CPUs and 500 GB RAM available. The (underlying) hardware is fairly recent in both cases. I guess the bare-metal second could serve in principle, but could the VM as well?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama.cpp and 7G Llama-2-Chat model: resource requirements, if possible #5172

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

llama.cpp and 7G Llama-2-Chat model: resource requirements, if possible #5172

Uh oh!

christian-2 Jan 28, 2024

Replies: 0 comments

christian-2
Jan 28, 2024