LLaMA C++ repository has lots of great [examples](https://github.com/ggerganov/llama.cpp/tree/master/examples) but I think its best to focus on four for now. - [x] [main](https://github.com/ggerganov/llama.cpp/tree/master/examples/main) - [x] [server](https://github.com/ggerganov/llama.cpp/tree/master/examples/server) - [ ] [imatrix](https://github.com/ggerganov/llama.cpp/tree/master/examples/imatrix) - [ ] [quantize](https://github.com/ggerganov/llama.cpp/tree/master/examples/quantize) Once these four tutorials are finished, then I can make a tutorial on how to take a new model and [add support](https://github.com/ggerganov/llama.cpp/blob/master/docs/development/HOWTO-add-model.md) for LLaMA C++.