Speculative partial fix for #4727 #1291

copybara-service · 2026-01-27T01:33:53Z

Speculative partial fix for #4727

We need to pick up commit bazelbuild/apple_support@44c43c7 in every dependency of LiteRT LM.

LiteRT-LM-PiperOrigin-RevId: 829563774

… limits. LiteRT-LM-PiperOrigin-RevId: 829572476

1. provide a real example of using the Kotlin API 1. make it very simple to try out the Kotlin API from source **To run it with bazel**: ``` bazel run -c opt //kotlin/java/com/google/ai/edge/litertlm/example:main -- <abs_model_path> ``` **To build a standalone binary and run**: ``` bazel build -c opt //kotlin/java/com/google/ai/edge/litertlm/example:main_deploy.jar ./bazel-bin/kotlin/java/com/google/ai/edge/litertlm/example/main_deploy.jar -- <abs_model_apth> ``` LiteRT-LM-PiperOrigin-RevId: 829573268

LiteRT-LM-PiperOrigin-RevId: 829591297

LiteRT-LM-PiperOrigin-RevId: 829596495

LiteRT-LM-PiperOrigin-RevId: 829609532

LiteRT-LM-PiperOrigin-RevId: 829625321

LiteRT-LM-PiperOrigin-RevId: 829635897

1. add jvm_flags to get rid of the warning message 2. fix typo in bazel command LiteRT-LM-PiperOrigin-RevId: 829699946

Printing the message as string is a very common use case. Make it super simple. If there is image / audio output in the future, then the toString() of them is something like: `Content(AudioFile("/path/to/file"))` LiteRT-LM-PiperOrigin-RevId: 829877174

Extends the `Message.of(content)` to `Message.of(content1, content2, content3, ...)`. It still works with only one content. LiteRT-LM-PiperOrigin-RevId: 830494523

Apple devices perform better with Buffers. LiteRT-LM-PiperOrigin-RevId: 830924175

Add `__declspec(dllexport)` when building for Windows so the .dll has the exported symbols. https://learn.microsoft.com/en-us/cpp/cpp/using-dllimport-and-dllexport-in-cpp-classes LiteRT-LM-PiperOrigin-RevId: 830991930

See Main.kt :-) This `Flow` approach is also faster than the callback approach because it does not wait for the `onMessage()` logic. LiteRT-LM-PiperOrigin-RevId: 831131771

LiteRT-LM-PiperOrigin-RevId: 831181727

LiteRT-LM-PiperOrigin-RevId: 831194431

Used Options::GetGpuOptions(). The new Create() method takes non-const reference to Options object. LiteRT-LM-PiperOrigin-RevId: 831216121

LiteRT-LM-PiperOrigin-RevId: 831627364

LiteRT-LM-PiperOrigin-RevId: 831637934

LiteRT-LM-PiperOrigin-RevId: 831948857

The new API doesn't require Model but create it internally with model_filename or model_buffer. Also users can get Signature from CompiledModel directly. LiteRT-LM-PiperOrigin-RevId: 832038920

LiteRT-LM-PiperOrigin-RevId: 832039059

LiteRT-LM-PiperOrigin-RevId: 832310477

… cache directory settings to LiteRT-LM C LiteRT-LM-PiperOrigin-RevId: 832381402

LiteRT-LM-PiperOrigin-RevId: 832406710

LiteRT-LM-PiperOrigin-RevId: 832460497

…thod returns null. LiteRT-LM-PiperOrigin-RevId: 832500201

…edHostMemory when environment is not needed. LiteRT-LM-PiperOrigin-RevId: 832519936

- Add sample code for a chat app and animation demo. And a Bazel command to quickly try it. - Mention JVM support, in additional to the original Android support. LiteRT-LM-PiperOrigin-RevId: 832701364

LiteRT-LM-PiperOrigin-RevId: 833396367

LiteRT-LM-PiperOrigin-RevId: 859642028

When context length is big, it increase init time noticeably, e.g. 8s when context length = 32k for gemma3-1b. LiteRT-LM-PiperOrigin-RevId: 859666933

LiteRT-LM-PiperOrigin-RevId: 859770885

LiteRT-LM-PiperOrigin-RevId: 859859182

LiteRT-LM-PiperOrigin-RevId: 859875851

LiteRT-LM-PiperOrigin-RevId: 859999259

LiteRT-LM-PiperOrigin-RevId: 860120569

The given option is true by default and used to set the mmap'ed memory for shared weights are swapped out to reduce memory footprint. When memory is swapped out, all the temporary changes made by magic numbers are reverted. So, when magic numbers are used, the give flags must be disabled. LiteRT-LM-PiperOrigin-RevId: 860150634

LiteRT-LM-PiperOrigin-RevId: 860170788

LiteRT-LM-PiperOrigin-RevId: 860216677

LiteRT-LM-PiperOrigin-RevId: 860278850

We will remove the `--hk_token`. The environment is the only way to set the token. LiteRT-LM-PiperOrigin-RevId: 860319701

LiteRT-LM-PiperOrigin-RevId: 860320494

LiteRT-LM-PiperOrigin-RevId: 860635677

@xla

- Add TensorBuffer Clear method - Replace @local_xla with @xla LiteRT-LM-PiperOrigin-RevId: 860680613

LiteRT-LM-PiperOrigin-RevId: 861221697

LiteRT-LM-PiperOrigin-RevId: 861230859

This is similar to the existing weight cache support that XNNPack uses. LiteRT-LM-PiperOrigin-RevId: 861245691

…ternal constraints and LLGuidance. LiteRT-LM-PiperOrigin-RevId: 861255750

…bled by checking whether there is corresponding `is_appending_to_prefill` in the jinja template. Templates with such capability can support multi-prefill. LiteRT-LM-PiperOrigin-RevId: 861409419

LiteRT-LM-PiperOrigin-RevId: 861449983

LiteRT-LM-PiperOrigin-RevId: 861460877

* Use LFS (Large File Storage) to prebuilt binaries * Enable LFS in workflows

…aries LiteRT-LM-PiperOrigin-RevId: 861722316

We need to pick up commit bazelbuild/apple_support@44c43c7 in every dependency of LiteRT LM. LiteRT-LM-PiperOrigin-RevId: 861425587

gcarranza-1 and others added 30 commits November 7, 2025 13:49

Remove usages of deprecated GpuOptions function overloads.

04b0671

LiteRT-LM-PiperOrigin-RevId: 829563774

Introduce TaskState::kMaxNumTokensReached to signal exceeding token…

c0780bf

… limits. LiteRT-LM-PiperOrigin-RevId: 829572476

Create fake model for testing.

e766268

LiteRT-LM-PiperOrigin-RevId: 829591297

Remove unused litert_qualcomm_options C header.

108d41e

LiteRT-LM-PiperOrigin-RevId: 829596495

Refactor prefill, decode and scoring logic into a new tasks.cc.

1424258

LiteRT-LM-PiperOrigin-RevId: 829609532

Refactor: Pass litert::TensorBuffer by value in pipeline and tasks.

2f4f7a7

LiteRT-LM-PiperOrigin-RevId: 829625321

Improve tool response formatting in Gemma3DataProcessor.

daf2d40

LiteRT-LM-PiperOrigin-RevId: 829635897

[litertlm] minor fix to example/BUILD

c96af67

1. add jvm_flags to get rid of the warning message 2. fix typo in bazel command LiteRT-LM-PiperOrigin-RevId: 829699946

[litertlm] minor improvement for creating Message inline

8dd50f5

Extends the `Message.of(content)` to `Message.of(content1, content2, content3, ...)`. It still works with only one content. LiteRT-LM-PiperOrigin-RevId: 830494523

Do not prefer textures on Apple devices

296e5c4

Apple devices perform better with Buffers. LiteRT-LM-PiperOrigin-RevId: 830924175

[litertlm] Windows JVM build

fcd8640

Add `__declspec(dllexport)` when building for Windows so the .dll has the exported symbols. https://learn.microsoft.com/en-us/cpp/cpp/using-dllimport-and-dllexport-in-cpp-classes LiteRT-LM-PiperOrigin-RevId: 830991930

[litertlm] Add sendMessageAsync that gets response as Flow

925b917

See Main.kt :-) This `Flow` approach is also faster than the callback approach because it does not wait for the `onMessage()` logic. LiteRT-LM-PiperOrigin-RevId: 831131771

Update dependencies of litert_lm

8d685d3

LiteRT-LM-PiperOrigin-RevId: 831181727

Update dependencies of litert_lm

4cdbc62

LiteRT-LM-PiperOrigin-RevId: 831194431

Update litert_lm with new way of setting GpuOptions

33cae51

Used Options::GetGpuOptions(). The new Create() method takes non-const reference to Options object. LiteRT-LM-PiperOrigin-RevId: 831216121

internal change and clean up.

baebde2

LiteRT-LM-PiperOrigin-RevId: 831627364

Internal change and clean up.

efd0c0d

LiteRT-LM-PiperOrigin-RevId: 831637934

Snap the litert dependency to the latest

824d223

LiteRT-LM-PiperOrigin-RevId: 831948857

litert_lm: Apply recent LiteRt CompiledModel::Create() API change

75111c2

The new API doesn't require Model but create it internally with model_filename or model_buffer. Also users can get Signature from CompiledModel directly. LiteRT-LM-PiperOrigin-RevId: 832038920

[litertlm] set the log level to "at least error" in JNI

0d85d9a

LiteRT-LM-PiperOrigin-RevId: 832039059

Remove LiteRT LM usages of LITERT_HOST_MEMORY_BUFFER_ALIGNMENT.

7df3610

LiteRT-LM-PiperOrigin-RevId: 832310477

Add vision/audio backend support in engine settings creation, and add…

4aeacba

… cache directory settings to LiteRT-LM C LiteRT-LM-PiperOrigin-RevId: 832381402

Add ExecutionManager for managing LLM inference tasks.

8438a60

LiteRT-LM-PiperOrigin-RevId: 832406710

Move tensor buffer creation for scoring into the worker thread.

46650bb

LiteRT-LM-PiperOrigin-RevId: 832460497

Implement CreateConstraint in Gemma3DataProcessor. Currently the me…

3073b2b

…thod returns null. LiteRT-LM-PiperOrigin-RevId: 832500201

Refactor LiteRT LM usages of TensorBuffer creation to use CreateManag…

53e7415

…edHostMemory when environment is not needed. LiteRT-LM-PiperOrigin-RevId: 832519936

[litertlm] update Kotlin API doc

a328771

- Add sample code for a chat app and animation demo. And a Bazel command to quickly try it. - Mention JVM support, in additional to the original Android support. LiteRT-LM-PiperOrigin-RevId: 832701364

Internal code change.

66adc81

LiteRT-LM-PiperOrigin-RevId: 833396367

hheydary and others added 20 commits January 22, 2026 09:24

Fix dangling reference issue.

7bf2787

LiteRT-LM-PiperOrigin-RevId: 859642028

Don't clear KV cache before prefill if requested.

4e6fc63

When context length is big, it increase init time noticeably, e.g. 8s when context length = 32k for gemma3-1b. LiteRT-LM-PiperOrigin-RevId: 859666933

Add a flag to en/disable sampler to handle decode input tensors

b9aea00

LiteRT-LM-PiperOrigin-RevId: 859770885

Internal change

dd3d5e4

LiteRT-LM-PiperOrigin-RevId: 859859182

[litertlm] Add a Kotlin example of tool calling

b01f42e

LiteRT-LM-PiperOrigin-RevId: 859875851

Internal change

7042b19

LiteRT-LM-PiperOrigin-RevId: 859999259

Internal changes and clean up.

50906b5

LiteRT-LM-PiperOrigin-RevId: 860120569

Change default backend to CPU in litert_lm_advanced_main.cc.

7cfdd0e

LiteRT-LM-PiperOrigin-RevId: 860170788

Implement LlgConstraintProvider and ExternalConstraintProvider.

2847d3b

LiteRT-LM-PiperOrigin-RevId: 860216677

Introduce LlmContext, configs and state definitions.

4ed3ca8

LiteRT-LM-PiperOrigin-RevId: 860278850

[litertlm] Update the instruction of setting hugging face token

cef9a49

We will remove the `--hk_token`. The environment is the only way to set the token. LiteRT-LM-PiperOrigin-RevId: 860319701

Encapsulate executor state within LlmContext.

e20eab4

LiteRT-LM-PiperOrigin-RevId: 860320494

Add force_f32 option in go & C api for better quality on gpu backend

3eecf9b

LiteRT-LM-PiperOrigin-RevId: 860635677

Update dependencies of litert_lm

83c6992

- Add TensorBuffer Clear method - Replace @local_xla with @xla LiteRT-LM-PiperOrigin-RevId: 860680613

Use TensorBuffer::Clear() instead of ZeroTensorBuffer()

d1d4d86

LiteRT-LM-PiperOrigin-RevId: 861221697

Reverts 374f764

dae6a75

LiteRT-LM-PiperOrigin-RevId: 861230859

Add Program Cache fd support to LiteRT LM

ed7a8f7

This is similar to the existing weight cache support that XNNPack uses. LiteRT-LM-PiperOrigin-RevId: 861245691

Add constrained decoding to Conversation API, enabling support for ex…

d20033a

…ternal constraints and LLGuidance. LiteRT-LM-PiperOrigin-RevId: 861255750

Add support_single_turn capability to prompt_template, which is ena…

f794979

…bled by checking whether there is corresponding `is_appending_to_prefill` in the jinja template. Templates with such capability can support multi-prefill. LiteRT-LM-PiperOrigin-RevId: 861409419

copybara-service bot force-pushed the litert_lm_pr_861425587 branch from fccfb75 to c9066b1 Compare January 27, 2026 01:48

ai-edge-bot and others added 5 commits January 26, 2026 18:46

Log the input.

1be73b5

LiteRT-LM-PiperOrigin-RevId: 861449983

Internal change

f1f1a9e

LiteRT-LM-PiperOrigin-RevId: 861460877

Use LFS (Large File Storage) to prebuilt binaries (#1294)

986919a

* Use LFS (Large File Storage) to prebuilt binaries * Enable LFS in workflows

Sync with github to support LFS (Large File Storage) for prebuilt bin…

014c177

…aries LiteRT-LM-PiperOrigin-RevId: 861722316

Speculative partial fix for #4727

a1fd171

We need to pick up commit bazelbuild/apple_support@44c43c7 in every dependency of LiteRT LM. LiteRT-LM-PiperOrigin-RevId: 861425587

copybara-service bot force-pushed the litert_lm_pr_861425587 branch from c9066b1 to a1fd171 Compare January 27, 2026 18:41

protobird-git closed this Feb 1, 2026

protobird-git force-pushed the main branch from 89d8ef5 to 111107f Compare February 1, 2026 01:53

protobird-git deleted the litert_lm_pr_861425587 branch February 1, 2026 03:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speculative partial fix for #4727 #1291

Speculative partial fix for #4727 #1291

Uh oh!

copybara-service bot commented Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants