-
Notifications
You must be signed in to change notification settings - Fork 91
Description
A few days ago I did the upgrade to zee 0.0.31 version of the llama-vscode extension. At first it looked like the new option to use agents within would be a great new feature. But my setup is so totally different from what seems to be the new defaults, that I have no clue how to get it going.
The wiki is no help for that because it is for the 0.0.27 version only.
I am lucky to have two modern Macs available, one M Mini which permanently serves Qwen3 Coder 30B using llama-server started by a LaunchDaemon within my private network. And one MacBook Pro which is setup to serve gpt-oss 120B locally also by a LaunchDaemon.
There is no need for me to configure any llama-server starting setups within the llama-vscode extension. As these services are used for other AI usage patterns here, there is no option to dynamically load and unload other models. And caused by memory constrains there would be the only option for the llama-vscode extension to start more models if they are tiny. Should work for the embeddings. But not for agents, I assume.
I understand how to configure these Chat and FIM services in the extension settings, looks like same as before. But I can not find out how to config the agent model. There are mostly references for modifying the settings.json file directly. But those 616 loc are too much for me to comprehend.
Could somebody write down how I could get the agents going with my, from my point of view, quite simple and powerful setup?