Tool calling is very crucial for making agents which are executing commands and can iterate over themselves. Follow llama.cpp implementation for now to integrate in the inference.