What is the chance of running Llama langauge model locally? #14879

elephantpanda · 2023-03-02T05:48:03Z

elephantpanda
Mar 2, 2023

Hi with the new Llama language model from Meta. What is the possibility that one could run this successfully locally on, say a GeForce 3080 32GB GPU using Onnxruntime.

Even though Llama is a rival to Chat GPT from Microsoft(!!)

It is reported to have a 7B parameter model. Which I guess in float16 would come to 14GB.

Should I even be asking this question 😯. Well even though it is from Meta people still want to run it on Microsoft Windows....

elephantpanda · 2023-03-03T06:32:00Z

elephantpanda
Mar 3, 2023
Author

People have got the model to run locally on GPU with 16GB VRAM. (CUDA) using torch. Maybe someone from Onnxruntime should try it to make sure it works with your system too.

0 replies

ivanstepanovftw · 2023-03-24T19:15:57Z

ivanstepanovftw
Mar 24, 2023

See https://github.com/ggerganov/llama.cpp, maybe they already have WASM module for it. They also have managed to run quantized LLAMA on Android in Termux.

2 replies

elephantpanda Mar 24, 2023
Author

See https://github.com/ggerganov/llama.cpp, maybe they already have WASM module for it. They also have managed to run quantized LLAMA on Android in Termux.

Thanks. I actually got the 4bit version running as well. But for certain situations it would be nice to run this as an onnx file. Just because it's easier to integrate into a c# Windows program.

ivanstepanovftw Mar 24, 2023

@pauldog, do you occasianally know how to use linear algebra using ONNX.js, ONNX Runtime or some other library? I have opened discussion, any suggestion may help.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

What is the chance of running Llama langauge model locally? #14879

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

What is the chance of running Llama langauge model locally? #14879

Uh oh!

elephantpanda Mar 2, 2023

Replies: 2 comments · 2 replies

Uh oh!

elephantpanda Mar 3, 2023 Author

Uh oh!

ivanstepanovftw Mar 24, 2023

Uh oh!

elephantpanda Mar 24, 2023 Author

Uh oh!

ivanstepanovftw Mar 24, 2023

elephantpanda
Mar 2, 2023

Replies: 2 comments 2 replies

elephantpanda
Mar 3, 2023
Author

ivanstepanovftw
Mar 24, 2023

elephantpanda Mar 24, 2023
Author