Dev by shubhammalhotra28 · Pull Request #345 · RunanywhereAI/runanywhere-sdks

shubhammalhotra28 · 2026-02-10T22:37:15Z

Description

Tool calling in all 4 SDK's + VLM and Diffusion model support for SWIFT SDK!

Type of Change

Bug fix
New feature
Documentation update
Refactoring

Testing

Lint passes locally
Added/updated tests for changes

Platform-Specific Testing (check all that apply)

Swift SDK / iOS Sample:

Tested on iPhone (Simulator or Device)
Tested on iPad / Tablet
Tested on Mac (macOS target)

Kotlin SDK / Android Sample:

Tested on Android Phone (Emulator or Device)
Tested on Android Tablet

Flutter SDK / Flutter Sample:

Tested on iOS
Tested on Android

React Native SDK / React Native Sample:

Tested on iOS
Tested on Android

Labels

Please add the appropriate label(s):

SDKs:

Swift SDK - Changes to Swift SDK (sdk/runanywhere-swift)
Kotlin SDK - Changes to Kotlin SDK (sdk/runanywhere-kotlin)
Flutter SDK - Changes to Flutter SDK (sdk/runanywhere-flutter)
React Native SDK - Changes to React Native SDK (sdk/runanywhere-react-native)
Commons - Changes to shared native code (sdk/runanywhere-commons)

Sample Apps:

iOS Sample - Changes to iOS example app (examples/ios)
Android Sample - Changes to Android example app (examples/android)
Flutter Sample - Changes to Flutter example app (examples/flutter)
React Native Sample - Changes to React Native example app (examples/react-native)

Checklist

Code follows project style guidelines
Self-review completed
Documentation updated (if needed)

Screenshots

Attach relevant UI screenshots for changes (if applicable):

Mobile (Phone)
Tablet / iPad
Desktop / Mac

Greptile Overview

Greptile Summary

This PR merges the dev branch into main, introducing three major feature additions across all SDK platforms (Swift, Kotlin, Flutter, React Native):

Major Changes

1. Tool Calling System (~3,500 lines)

Centralized C++ implementation in runanywhere-commons as single source of truth
Support for multiple LLM formats: DEFAULT (<tool_call>JSON</tool_call>) and LFM2 (<|tool_call_start|>[func(args)]<|tool_call_end|>)
Comprehensive parsing with edge case handling (missing tags, unquoted keys, brace matching)
Thin platform wrappers in Swift, Kotlin, Flutter, and React Native
All SDKs delegate to commons layer (no duplicate parsing logic)
Integration in all example apps with UI components for tool call visualization

2. Diffusion/Image Generation (~2,800 lines)

New diffusion feature for text-to-image, image-to-image, and inpainting
Backend support for CoreML (Apple) and ONNX (cross-platform)
Comprehensive type system with tokenizer, model registry, and platform services
Integration across all SDKs with view models and UI in example apps

3. Vision-Language Models (VLM) (~2,600 lines)

Multimodal AI support for image understanding with LlamaCPP backend
Image utilities with base64 decoding and stb_image integration
Camera integration in iOS example app
Full SDK support across platforms with FFI bridges

Additional Changes

Added nlohmann/json v3.11.3 library (24,765 lines) for React Native JSON parsing
Secure math expression parser in React Native example (avoids eval())
Build system updates across all platforms
UI components and view models in all example apps
Package dependency updates

Architecture

Follows established pattern: C++ commons for core logic, thin platform wrappers
Maintains separation of concerns between SDK core and example apps
Consistent API surface across all platform SDKs

Confidence Score: 3/5

This PR is moderately safe to merge but requires thorough testing due to its massive scope
Score of 3 reflects a well-architected feature addition with ~60K lines of changes across 224 files. The code quality is high with proper abstractions and consistent patterns, but the sheer size introduces integration risk. The PR description is incomplete (all checkboxes unchecked, no testing confirmation), which is concerning for changes of this magnitude. While no critical bugs were found in the review, comprehensive platform-specific testing is essential before merge.
Pay close attention to the new C++ tool calling implementation (sdk/runanywhere-commons/src/features/llm/tool_calling.cpp) for edge cases, the diffusion and VLM backend integrations, and verify that all platform-specific example apps have been tested on their respective platforms

Important Files Changed

Filename	Overview
sdk/runanywhere-commons/include/rac/features/llm/rac_tool_calling.h	Comprehensive tool calling API header with support for multiple LLM formats (DEFAULT, LFM2), includes parsing, prompt formatting, and JSON utilities
sdk/runanywhere-commons/src/features/llm/tool_calling.cpp	Single source of truth implementation for tool calling with manual JSON parsing, format detection, and comprehensive edge case handling
sdk/runanywhere-commons/include/rac/features/diffusion/rac_diffusion_types.h	Diffusion (image generation) type definitions with support for text-to-image, image-to-image, and inpainting workflows
sdk/runanywhere-commons/include/rac/features/vlm/rac_vlm_types.h	Vision-Language Model (VLM) type definitions for multimodal AI capabilities with image understanding
sdk/runanywhere-react-native/packages/core/cpp/third_party/nlohmann/json.hpp	Third-party nlohmann/json v3.11.3 library (24,765 lines) for JSON parsing in React Native bridge
examples/react-native/RunAnywhereAI/src/utils/mathParser.ts	Secure math expression evaluator using recursive descent parser (avoids eval), supports basic arithmetic with proper precedence
sdk/runanywhere-commons/src/backends/llamacpp/rac_vlm_llamacpp.cpp	LlamaCPP backend for VLM with image embedding, chat template handling, and multimodal prompt construction
sdk/runanywhere-commons/src/features/diffusion/diffusion_component.cpp	Diffusion component implementation with model lifecycle, backend routing, and generation pipeline orchestration
sdk/runanywhere-commons/src/features/vlm/vlm_component.cpp	VLM component with initialization, health checks, and backend integration for vision-language tasks

Sequence Diagram

sequenceDiagram
    participant App as Example App
    participant SDK as Platform SDK<br/>(Swift/Kotlin/Flutter/RN)
    participant Bridge as Platform Bridge
    participant Commons as runanywhere-commons<br/>(C++)
    participant LLM as LLM Backend

    Note over App,LLM: Tool Calling Flow

    App->>SDK: generateWithTools(prompt, tools, options)
    SDK->>Bridge: buildInitialPrompt()
    Bridge->>Commons: rac_tool_call_build_initial_prompt()
    Commons-->>Bridge: formatted prompt with tool instructions
    Bridge-->>SDK: system prompt + user prompt
    
    SDK->>LLM: generate(formatted_prompt)
    LLM-->>SDK: raw output with tool call tags
    
    SDK->>Bridge: parseToolCall(llm_output)
    Bridge->>Commons: rac_tool_call_parse()
    Note over Commons: Auto-detect format<br/>(DEFAULT vs LFM2)
    Note over Commons: Parse JSON with<br/>edge case handling
    Commons-->>Bridge: ToolCall{name, args, callId}
    Bridge-->>SDK: parsed tool call
    
    SDK->>App: toolRegistry.execute(toolName, args)
    App-->>SDK: tool result
    
    SDK->>Bridge: buildFollowupPrompt()
    Bridge->>Commons: rac_tool_call_build_followup_prompt()
    Commons-->>Bridge: followup prompt with result
    Bridge-->>SDK: formatted followup
    
    SDK->>LLM: generate(followup_prompt)
    LLM-->>SDK: final response
    SDK-->>App: GenerationResult{text, toolCalls, toolResults}

- Add ToolCallingBridge in C++ for parsing tool_call tags from LLM output - Handle edge cases: missing closing tags, unquoted JSON keys - Add tool registration and prompt formatting in C++ bridge - TypeScript orchestration layer calls C++ for parsing, handles execution - Add Llama 3.2 3B model to example app (suitable for tool calling) - Update ChatScreen with tool calling demo (weather API example) Architecture: - C++ handles: parsing, validation, prompt formatting - TypeScript handles: tool registration (stores executors), execution (needs JS APIs) Co-Authored-By: jm

- Replace console.log with SDKLogger for consistency - Use ?? instead of || for maxToolCalls to respect explicit 0 - Parse argumentsJson if it's a string from C++ - Update comments to accurately reflect architecture (C++ parses, TS handles registry) - Remove toolsUsed field from analytics (not in type) - Fix doc comment for parseToolCallFromOutput return format

Signed-off-by: Hyunoh-Yeo <hyunoh.yeo@gmail.com>

…fix later

Signed-off-by: Hyunoh-Yeo <hyunoh.yeo@gmail.com>

feat: Add tool calling support with C++

updating git ignore

fix: ios swift app is not deleting models once downloaded + Metadata display bug

SDK Changes: - Add ToolValue enum for type-safe JSON representation (string, number, bool, array, object, null) - Add ToolCallingTypes.swift with type definitions using ToolValue - Add ToolCallParser.swift with pure Swift parser for <tool_call> tags - Add RunAnywhere+ToolCalling.swift with public API (registerTool, executeTool, generateWithTools) Example App Changes: - Add ToolCallInfo to Message model using ToolValue for serialization - Add ToolCallViews.swift with minimal UI (indicator + detail sheet) - Add ToolSettingsView.swift with tool registration settings and 3 demo tools - Update ChatMessageComponents to show tool call indicator on messages - Update ChatInterfaceView to show "Tools enabled" badge - Update LLMViewModel to integrate tool calling into generation flow Bug Fixes: - Fix LLMViewModel+Analytics.swift nil coalescing warnings - Simplify ModelSelectionSheet VLM handling

…ved argument handling - Updated ToolCallViews to use ToolValue for arguments and results. - Refactored ToolSettingsView to implement real API calls for weather and time tools using Open-Meteo API. - Improved ToolCallParser to extract tool names and arguments with multiple fallback strategies. - Added WeatherService for fetching real-time weather data and handling geocoding. - Enhanced error handling in tool execution and argument parsing.

Implements VLM feature with clean, minimal API: C++ Core (runanywhere-commons): - Add rac_vlm_types.h - VLM type definitions - Add rac_vlm_service.h - VLM service vtable and C API - Add rac_vlm_component.h - VLM component lifecycle - Add rac_vlm_llamacpp.h - LlamaCPP VLM backend API - Add rac_image_utils.h - Image utilities - Add RAC_CAPABILITY_VISION_LANGUAGE capability - Add RAC_RESOURCE_TYPE_VLM_MODEL to lifecycle Swift SDK (runanywhere-swift): - Add VLMTypes.swift - VLMImage, VLMGenerationOptions, VLMGenerationResult - Add RunAnywhere+VisionLanguage.swift - describeImage, processImageStream - Add RunAnywhere+VLMModels.swift - loadVLMModel helper - Add CppBridge+VLM.swift - Thread-safe VLM component management iOS Sample App: - Add VLMCameraView.swift - Camera interface for VLM - Add VLMViewModel.swift - Camera/VLM state management - Update ModelSelectionSheet with .vlm context Note: VLM backend conditionally compiled via RAC_VLM_USE_MTMD

…nga/vlm # Conflicts: # examples/ios/RunAnywhereAI/RunAnywhereAI/Features/Models/ModelSelectionSheet.swift

Adds a floating action button (camera icon) in the bottom-right of the chat interface. Tapping it opens the VLM camera view in a full-screen cover for image analysis.

Adds camera permission description to both Debug and Release build configurations to prevent crash when accessing camera.

Registers two VLM models for vision language features: - SmolVLM 500M Instruct (Q8_0) - Ultra-lightweight for mobile - Qwen2-VL 2B Instruct (Q4_K_M) - Small but capable

…vailable feature for SWIFT right now

Add diffusion support for apple only - swift sdk + swift sample app

[Swift-sdk] VLM Support + Diffusion support

coderabbitai · 2026-02-10T22:37:25Z

Important

Review skipped

Too many files!

This PR contains 218 files, which is 68 over the limit of 150.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch dev

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

ellipsis-dev · 2026-02-10T22:37:34Z

⚠️ This PR is too big for Ellipsis, but support for larger PRs is coming soon. If you want us to prioritize this feature, let us know at help@ellipsis.dev

Generated with ❤️ by ellipsis.dev

shubhammalhotra28 · 2026-02-10T22:38:52Z

@coderabbitai please do a detailed review. Thanks!

shubhammalhotra28 · 2026-02-10T22:39:03Z

@greptile please do a detailed review. Thanks!

coderabbitai · 2026-02-10T22:39:08Z

@shubhammalhotra28, I'll conduct a detailed review of PR #345 for you. Let me analyze all the changes thoroughly.

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

shubhammalhotra28 · 2026-02-10T23:19:11Z

Merging dev to main to add the features for :

Tool Calling in all the 4 SDK's
VLM support in ios
Diffusion model support in ios using coreML

Merging dev to main

jmakwana06 and others added 30 commits January 24, 2026 21:22

fix(ios): not deleting models once downloaded

ea1ed92

Signed-off-by: Hyunoh-Yeo <hyunoh.yeo@gmail.com>

fix(ios): not deleting models once downloaded

5e1244b

Signed-off-by: Hyunoh-Yeo <hyunoh.yeo@gmail.com>

fix(ios): not deleting models once downloaded

e827a4d

Signed-off-by: Hyunoh-Yeo <hyunoh.yeo@gmail.com>

fix(ios): wrong metadata displaying

bf1fd5e

Signed-off-by: Hyunoh-Yeo <hyunoh.yeo@gmail.com>

fix(ios): wrong metadata displaying

a766075

Signed-off-by: Hyunoh-Yeo <hyunoh.yeo@gmail.com>

fixes

cc577cf

adding tab - to just see what's happenin when doing tool call - will …

1f1b42a

…fix later

fix(ios): disable trash for Platform LLM/TTS

cd26d3a

Signed-off-by: Hyunoh-Yeo <hyunoh.yeo@gmail.com>

Merge pull request #293 from j-makwana/feature/tool-calling

1ec933e

feat: Add tool calling support with C++

git ignore updates

f58f2db

git ignore updates

21c8ca8

Merge pull request #300 from RunanywhereAI/feature/tool-calling

6703de9

updating git ignore

Merge pull request #298 from Hyunoh-Yeo/bugfix/272

9311ff8

fix: ios swift app is not deleting models once downloaded + Metadata display bug

intial changes

9d6a9ba

stage changes

066bcfc

updates

9fadba7

updates

cab2873

bug fixes

4ffbe2b

bug fixes

943f4f6

small change

cf43e7b

fixes

51a410a

Merge remote-tracking branch 'origin/smonga/tool_call_swift' into smo…

6fd772c

…nga/vlm # Conflicts: # examples/ios/RunAnywhereAI/RunAnywhereAI/Features/Models/ModelSelectionSheet.swift

feat: Add floating VLM camera button to chat view

1929415

Adds a floating action button (camera icon) in the bottom-right of the chat interface. Tapping it opens the VLM camera view in a full-screen cover for image analysis.

fix: Add NSCameraUsageDescription for VLM camera feature

18fb754

Adds camera permission description to both Debug and Release build configurations to prevent crash when accessing camera.

feat: Add SmolVLM 500M and Qwen2-VL 2B to model registry

67ea49b

Registers two VLM models for vision language features: - SmolVLM 500M Instruct (Q8_0) - Ultra-lightweight for mobile - Qwen2-VL 2B Instruct (Q4_K_M) - Small but capable

shubhammalhotra28 added 21 commits February 7, 2026 18:55

fixes

fc58f2d

fixes

643c2c1

fixed

a9fec96

changes

a19f695

cleanuo

eb06a90

some more changes

9ff6c66

cleanup

e467a58

react native fix

0b6d4fb

fixed react native

2ce2b03

fixing the script

a9a6232

fixing

7dc49d1

fix

e951815

minor fix

0987347

addressing comments

07ee370

removing the stale code not needed since coreml / swift is the only a…

389c1bf

…vailable feature for SWIFT right now

some changes in swift layer

abdaab0

Merge pull request #327 from RunanywhereAI/add-diffusion

f63675a

Add diffusion support for apple only - swift sdk + swift sample app

minor chanegs

be76bda

minor fixing the UI

70bcd66

ui fix

64c9895

Merge pull request #304 from RunanywhereAI/smonga/vlm+diffusion

6c9d815

[Swift-sdk] VLM Support + Diffusion support

Merge main into dev

329cbec

shubhammalhotra28 merged commit 2c7dbf6 into main Feb 10, 2026
12 of 17 checks passed

ManthanNimodiya pushed a commit to ManthanNimodiya/runanywhere-sdks that referenced this pull request Feb 23, 2026

Merge pull request RunanywhereAI#345 from RunanywhereAI/dev

8da94b1

Merging dev to main

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dev#345

Dev#345
shubhammalhotra28 merged 126 commits intomainfrom
dev

shubhammalhotra28 commented Feb 10, 2026 •

edited

Loading

Uh oh!

coderabbitai bot commented Feb 10, 2026 •

edited

Loading

Review skipped

Uh oh!

ellipsis-dev bot commented Feb 10, 2026

Uh oh!

shubhammalhotra28 commented Feb 10, 2026

Uh oh!

shubhammalhotra28 commented Feb 10, 2026

Uh oh!

coderabbitai bot commented Feb 10, 2026

Uh oh!

shubhammalhotra28 commented Feb 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

shubhammalhotra28 commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Testing

Platform-Specific Testing (check all that apply)

Labels

Checklist

Screenshots

Greptile Overview

Greptile Summary

Major Changes

Additional Changes

Architecture

Confidence Score: 3/5

Important Files Changed

Sequence Diagram

Uh oh!

coderabbitai bot commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

ellipsis-dev bot commented Feb 10, 2026

Uh oh!

shubhammalhotra28 commented Feb 10, 2026

Uh oh!

shubhammalhotra28 commented Feb 10, 2026

Uh oh!

coderabbitai bot commented Feb 10, 2026

Uh oh!

shubhammalhotra28 commented Feb 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

shubhammalhotra28 commented Feb 10, 2026 •

edited

Loading

coderabbitai bot commented Feb 10, 2026 •

edited

Loading