Release v3.0.0 #357

amakropoulos · 2025-08-13T16:00:54Z

Rewrite the LLM backend, LlamaLib, as a standalone C++/C# library and adaptation of LLMUnity.

Features:

implement LlamaLib as object-oriented C++/C# library
update llama.cpp to b7664
fix Vulkan GPU backend
Android 16kb support
fix iOS - xcode building
fix RAG functionality for iOS
polish samples
optimise streaming functionality and implement callbacks on the C++ end
remove chat templates from LLMUnity, use the llama.cpp templating
implement property checks
common handlng for both json and gbnf grammars
simplify integration of tinyBLAS (light GPU backend for Nvidia GPUs)
transition of client / server functionality in LlamaLib

Issues:

closes Unity 2022.3.6f1 Issue: 'VisionOS' #348
closes Request: Update LlamaLib to Support Gemma3n (like in Issue #323) #350
closes logit_bias not working #351
closes Tried architecture: cuda-cu12.2.0, error: LLM 11: Severe error occured #352
closes Invalid Operation Exception #359
closes Error on using/loading embeddinggemma-300m RAG #364

Draco18s

Downloaded this to see if it solves any of the problems I've been having.

Some issues:

Sample chat bot scene has a missing script on the LLMCharacter object and the ChatBot monobehavior has no reference for its LLMAgent, so that will throw an NRE
LLMAgent.Warmup calls ChatAsync before the Lama llmAgent.llmAgent field is set (by SetupLLMClient), throwing a NRE, even if not using a Lama based model.
Using a coroutine to wait for the field to be non-null before calling Warmup just straight up crashes the Unity editor.
Ended up just waiting for the field to be non-null and calling WarmupCallback directly instead.
Whether or not the response is streamed has gone missing. Seems to always be in stream mode, chatbot demo doesn't populate the chat bubble, just constantly replaces with the next token.

…versions

amakropoulos and others added 22 commits August 13, 2025 19:00

WIP

fdc53bf

Merge fdc53bf into 891497b

bda6dff

update VERSION

0f94f6f

working setup

728babc

fix platform paths on postprocess

ddc67c6

initial working LLM with LlamaLib

ff1afde

add NewtonSoft JSON dependency

00a76ec

add LlamaLib

bd1ec6d

implementation of LLMAgent methods (untested)

bc74831

allow empty templates

8118aa5

small fixes mainly on the ChatAsync

05caeea

use ChatAsync instead of Chat

23282d4

move completion parameters to LLMCaller

8762162

rename LLMCaller to LLMClient

42ae20f

simplify grammar by keeping the grammar value instead of the file

5c62cec

remove unused completion parameters

f778fb9

getters/setters for variables needing more actions

567ad3f

remove not needed classes

958f8b9

general improvements and small fixes

2614d20

more changes

18d1d1b

more improvements

8f7f869

check callback target for destruction

c715ccf

Draco18s suggested changes Sep 14, 2025

View reviewed changes

amakropoulos added 7 commits November 11, 2025 18:26

initial pass of basic unit test

b9e9464

update to latest LlamaLib (remote changes)

116e41e

simplify library finding

548820b

small fixes

1425189

fix llmagent save test

9d75920

autosave history

6be3137

fix blas selection with exclusion list

7d27df9

amakropoulos added 29 commits December 23, 2025 13:26

show server architecture in the command

102eb53

update LlamaLib

b60de0e

move tests to Editor instead of Runtime, adapt tests for windows

5af2577

fix and polish samples

9f409e1

check for null before competion

fc3b214

auto-indentation

06da6c5

bring back the legacy input / event system

e865b10

adapt unit tests

615279e

implement auto-setup of EventSystem for both legacy and latest Unity …

11ecc69

…versions

handle both input systems at the same time

fbf0aa9

remove secondar contentSizeFitter

004ff0f

show error if a LLM is used for embedding

ada88cf

update Windows unit tests

1937328

remove exit button from mobile demo

913a98d

use __Internal only in iOS Player

4cf6e69

fix iOS/visionOS plugin filenames

05c8b21

get processor type outside async

593c9b4

add search paths to xcode project

d576373

add cublas toggle in setup settings

25202be

add cublas toggle only in LLM

0ebe145

add logo

827961a

select (down)loaded model

09eb29b

add logo meta

e68192a

update images

ef2eca2

remove redundant variables

7abb5df

fix update tooltips script and update them

fc1d441

update readme

fd76b9c

guard VisionOS for Unity 2021 compatibility

e691523

add gemma-embedding to embedding architectures

9bde194

amakropoulos merged commit 3d478a6 into main Jan 12, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Release v3.0.0 #357

Release v3.0.0 #357

Uh oh!

amakropoulos commented Aug 13, 2025 •

edited

Loading

Uh oh!

Draco18s left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Release v3.0.0 #357

Release v3.0.0 #357

Uh oh!

Conversation

amakropoulos commented Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Draco18s left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

amakropoulos commented Aug 13, 2025 •

edited

Loading