Intelligent Agent Routing System

Quickstart Guide

This quickstart guide will help you set up and use the agent routing system. Follow the steps below to get started.

Step 1: Create Agent Routing Specifications

First, download lmos-router dependency from maven central

implementation("org.eclipse.lmos:lmos-router-llm:x.y.z")

then create the agent routing specifications using SimpleAgentRoutingSpecProvider and AgentRoutingSpecBuilder.

val agentRoutingSpecsProvider = SimpleAgentRoutingSpecProvider()
    .add(
        AgentRoutingSpecBuilder()
            .name("offer-agent")
            .description("This agent is responsible for offer management")
            .version("1.0.0")
            .address(Address(uri = "/agents/offer-agent"))
            .build()
    )
    .add(
        AgentRoutingSpecBuilder()
            .name("service-agent")
            .description("This agent is responsible for service management")
            .version("1.0.0")
            .address(Address(uri = "/agents/service-agent"))
            .build()
    )

Step 2: Initialize the Agent Routing Spec Resolver

Next, initialize the LLMAgentRoutingSpecsResolver with the agentRoutingSpecsProvider.

val agentRoutingSpecResolver = LLMAgentRoutingSpecsResolver(
    agentRoutingSpecsProvider,
    modelClient = DefaultModelClient(
        DefaultModelClientProperties(openAiApiKey = "your-openai-api-key") //Defaults to System.getenv("OPENAI_API_KEY")
    )
)

Step 3: Pass the Context and User Message

Set up the context and input messages that will be used to resolve the appropriate agent.

val context = Context(listOf(AssistantMessage("Hello")))
val input = UserMessage("Can you help me find a new phone?")

Step 4: Resolve the Agent

Finally, use the agentRoutingSpecResolver to resolve the appropriate agent based on the context and input messages.

val result = agentRoutingSpecResolver.resolve(context, input)

The result should return offer-agent, indicating that the "offer-agent" is responsible for handling the user's request. Now you can use the address uri to route the user to the appropriate agent.

For spring cloud gateway, refer to the Demo.

Overview

This project routes user queries to the most suitable agent based on their capabilities using Language Model (LLM), Vector-based approaches, and a new Hybrid approach.

Introduction

The Intelligent Agent Routing System directs user queries to the best-suited agent based on their capabilities using three methods:

LLM-based approach: Uses a language model to understand and match queries with agent capabilities.
Vector-based approach: Uses vector embeddings to find the most similar agent based on the query.
Hybrid approach: Extracts abstract requirements from the query using an LLM and then searches for an agent using semantic similarity.

Routing Methods

LLM-Based Approach

Uses advanced language models like OpenAI's GPT-4o mini to understand the context and semantics of user queries.

Pros:

Understands complex queries and context.
Flexible and adaptable to various scenarios.
Utilizes state-of-the-art NLP techniques.

Cons:

Expensive due to commercial language model costs.
Higher response times.
Dependent on external APIs with potential rate limits.

Vector-Based Approach

Uses vector embeddings to represent queries and agent capabilities, comparing them using cosine similarity.

Pros:

Fast and efficient for large-scale data.
Scalable to handle more agents and queries.
Independent of external APIs.

Cons:

Limited in understanding complex queries.
Requires initial setup and regular updates.
Needs maintenance for embedding updates.

Hybrid Approach

Extracts abstract requirements from the query using an LLM and then searches for an agent using semantic similarity.

Pros:

Balances the strengths of both LLM and Vector-based approaches.
Better understanding of complex queries than vector-based alone.
More efficient than LLM-based alone.

Cons:

Still dependent on external APIs for LLM.
Requires integration of both LLM and vector-based systems.

Comparison Table

Feature	LLM-Based Approach	Vector-Based Approach	Hybrid Approach
Contextual Understanding	High	Moderate	High
Flexibility	High	Moderate	High
Efficiency	Moderate	High	High
Scalability	Moderate	High	High
Cost	High	Low	High
Latency	Higher	Lower	High
Dependency	High	Low	High
Setup Complexity	Low	High	High
Maintenance	Low	High	High

Modules

Core

Contains foundational classes and interfaces:

ChatMessage: Represents different types of chat messages.
Context: Represents the conversation context.
AgentRoutingSpec: Represents agent routing specifications.
AgentRoutingSpecsProvider: Interface for providing agent routing specifications.
AgentRoutingSpecsResolver: Interface for resolving agent routing specifications.
Result: Utility class for handling success and failure cases.

LLM

Handles agent routing specifications using a language model:

DefaultModelClient: Client for calling the OpenAI model.
LLMAgentRoutingSpecsResolver: Resolves agent routing specifications using a language model.
ModelPromptProvider: Provides prompts for the language model.

Vector

Handles agent routing specifications using vector embeddings:

DefaultEmbeddingClient: Client for embedding text using a local service.
OpenAIEmbeddingClient: Client for embedding text using the OpenAI API.
VectorAgentRoutingSpecsResolver: Resolves agent routing specifications using vector similarity search.
VectorSearchClient: Interface for searching similar vectors.
VectorSeedClient: Interface for seeding vectors.

Hybrid

Combines LLM and vector-based approaches:

HybridAgentRoutingSpecsResolver: Resolves agent routing specifications using a hybrid approach.

LLM Spring boot starter

Spring Boot starter for the LLM-based agent routing system:

LLMAgentRoutingSpecsResolverAutoConfiguration: Auto-configuration for the LLM-based agent routing system.
LLMAgentRoutingSpecsResolverProperties: Configuration properties for the LLM-based agent routing system.
LLMAgentRoutingSpecsResolverService: Service for resolving agent routing specifications.

Vector Spring boot starter

Spring Boot starter for the Vector-based agent routing system:

VectorAgentRoutingSpecsResolverAutoConfiguration: Auto-configuration for the Vector-based agent routing system.
VectorAgentRoutingSpecsResolverProperties: Configuration properties for the Vector-based agent routing system.
VectorAgentRoutingSpecsResolverService: Service for resolving agent routing specifications.
VectorSeedService: Service for seeding vectors.
VectorSearchService: Service for searching similar vectors.

Hybrid Spring boot starter

Spring Boot starter for the Hybrid-based agent routing system:

HybridAgentRoutingSpecsResolverAutoConfiguration: Auto-configuration for the Hybrid-based agent routing system.
HybridAgentRoutingSpecsResolverProperties: Configuration properties for the Hybrid-based agent routing system.
HybridAgentRoutingSpecsResolverService: Service for resolving agent routing specifications.

Demo

Sample Spring Boot application demonstrating the system:

AgentsApplication: Main application class.
AgentsController: REST controller for handling agent responses.
SuperRouteGatewayApplication: Spring Cloud Gateway application for routing requests.

Benchmarks

Evaluates the performance of the LLM-based, Vector-based, and Hybrid resolvers:

LLM-based Resolver: Processes 2000 samples.
Vector-based Resolver: Processes 5000 samples.
Hybrid Resolver: To be added.

Refer to the Benchmarks for detailed instructions.

Confusion Matrix and Accuracy

The benchmarks include confusion matrices and accuracy metrics for all methods.

LLM-Based Resolver

Vector-Based Resolver

Setup and Installation

You can download the dependencies from maven central by adding the following dependencies to your project:

LLM-Based Approach Spring Boot Starter

implementation("org.eclipse.lmos:lmos-router-llm-spring-boot-starter:x.y.z")

Or using Maven:

<dependency>
    <groupId>org.eclipse.lmos</groupId>
    <artifactId>lmos-router-llm-spring-boot-starter</artifactId>
    <version>x.y.z</version>
</dependency>

Vector-Based Approach Spring Boot Starter

implementation("org.eclipse.lmos:lmos-router-vector-spring-boot-starter:x.y.z")

Or using Maven:

<dependency>
    <groupId>org.eclipse.lmos</groupId>
    <artifactId>lmos-router-vector-spring-boot-starter</artifactId>
    <version>x.y.z</version>
</dependency>

Hybrid Approach Spring Boot Starter

implementation("org.eclipse.lmos:lmos-router-hybrid-spring-boot-starter:x.y.z")

Or using Maven:

<dependency>
    <groupId>org.eclipse.lmos</groupId>
    <artifactId>lmos-router-hybrid-spring-boot-starter</artifactId>
    <version>x.y.z</version>
</dependency>

No framework dependencies

If you are not using Spring Boot, you can add the following dependencies:

LLM-Based Approach

implementation("org.eclipse.lmos:lmos-router-llm:x.y.z")

Vector-Based Approach

implementation("org.eclipse.lmos:lmos-router-vector:x.y.z")

Hybrid Approach

implementation("org.eclipse.lmos:lmos-router-hybrid:x.y.z")

or you can build the project from source:

Clone the repository:

git clone https://github.com/eclipse-lmos/lmos-router.git
cd lmos-router

Set environment variables: (If running Flow tests, they can be enabled by setting gradle project property runFlowTests=true)
- OPENAI_API_KEY: Your OpenAI API key.
- VECTOR_SEED_JSON_FILE_PATH: Path to the JSON file containing seed vectors.
Build the project:

./gradlew build

Demo

To run the demo:

Refer to the Demo for detailed instructions.

Contributing

Contributions are welcome! Please read the contributing guidelines for more information.

Code of Conduct

This project has adopted the Contributor Covenant in version 2.1 as our code of conduct. Please see the details in our CodeOfConduct.md. All contributors must abide by the code of conduct.

By participating in this project, you agree to abide by its Code of Conduct at all times.

Licensing

Sourcecode licensed under the Apache License, Version 2.0 (the "License"); you may not use this project except in compliance with the License.

This project follows the REUSE standard for software licensing.
Each file contains copyright and license information, and license texts can be found in the ./LICENSES folder. For more information visit https://reuse.software/.

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the LICENSE for the specific language governing permissions and limitations under the License.

Agent Classifier

This section of the README refers to the lmos-classifier-* modules, which are intended to replace the older lmos-router-* modules.

The LMOS Agent Classifier library allows you to set up an agent classification system that identifies the most appropriate agent based on the conversation and system context, using the following complementary classifier strategies:

Embedding-based Classification: Finds the most qualified agent using a semantic vector search and a ranking algorithm.
LLM-based Classification: Utilizes a LLM to select the most appropriate agent based on the conversation context.
Hybrid Classification: Combines semantic retrieval with LLM-based reasoning:
- Fast-Track Strategy: First performs the Embedding-based Classification to find a matching agent. If no confident match is found, the system falls back to an LLM. The agents retrieved during the semantic search are passed to the LLM, enabling it to make an informed decision.
- RAG Strategy: This strategy follows the classic RAG approach. It first retrieves a relevant subset of agents using semantic search. Then, an LLM selects the most appropriate agent from this set.

In the initial version, classification returns a single best-matching agent. A future extension could allow multiple candidates to be considered, including coordination patterns if needed.

Module Overview

Each classification strategy has a dedicated implementation module as well as a Spring Boot Starter for easy integration if needed.

`lmos-classifier-core`

Contains the common models and the classifier interfaces.

`lmos-classifier-vector`

Implementation of the vector-based classification strategy using semantic similarity. It retrieves and ranks agents based on embedding similarity.

Spring Boot Integration: lmos-classifier-vector-spring-boot-starter

`lmos-classifier-llm`

Implementation of the LLM-based classification strategy, which uses a large language model to select the most suitable agent.

Spring Boot Integration: lmos-classifier-llm-spring-boot-starter

`lmos-classifier-hybrid`

Implements the hybrid strategies FastTrackAgentClassifier and RagAgentClassifier.

Spring Boot Integration: lmos-classifier-hybrid-spring-boot-starter

`lmos-classifier-workbench-demo-controller`

A Spring Boot example application that demonstrates how to configure and use the available classifier strategies via the Spring Boot starter modules. It exposes all supported classification strategies through HTTP endpoints and serves as a practical reference for testing and comparing different classifier approaches.

Classifier Guide

This section explains how to use the available classifier implementations. There are two main ways to use the classifiers:

Programmatic Usage – Directly instantiate and configure classifiers using the provided builders.
Spring Boot Starter – Use the Spring Boot starter modules to easily configure and wire classifiers through the application.yaml.

Programmatic Usage

All classifier implementations can be instantiated and configured manually. Therefore, each classifier exposes a builder.

Example for LLM-based classification:

val chatModel = LangChainChatModelFactory.createClient(
    ChatModelClientProperties(
        provider = "azure",
        apiKey = "your-api-key",
        baseUrl = "https://model-base-url.com",
        model = "gpt-35-turbo",
        maxTokens = 512,
        temperature = 0.2,
        logRequestsAndResponses = false,
    )
)

val classifier = DefaultModelAgentClassifier
    .builder()
    .withChatModel(chatModel)
    .build()

Further examples on how to use the builders and their related components can be found in the Spring Boot starter auto-configuration classes: ModelAgentClassifierAutoConfiguration, EmbeddingAgentClassifierAutoConfiguration, FastTrackAgentClassifierAutoConfiguration, and RagAgentClassifierAutoConfiguration.

Details on available configuration options for LLMs and embedding models can be found in the Spring Boot Starter chapter.

Spring Boot Starter

Enable Classifier

The corresponding Spring Boot starter project must be added as a dependency to your application, and the classifier strategies must then be enabled explicitly in the application.yaml file.

lmos:
  router:
    classifier:
      llm:
        enabled: true
      vector:
        enabled: false
      hybrid-rag:
        enabled: false
      hybrid-fast-track:
        enabled: false

Only enabled classifiers will be instantiated. You can activate one or more simultaneously.

LLM Configuration

If an LLM is involved in the classification process, it must be configured in the application.yaml. E.g.:

lmos:
  router:
    llm:
      provider: azure_openai
      api-key: your-api-key
      base-url: https://model-base-url.com
      model: gpt-4

Refer to the table to see which LLM providers are supported and what configuration options are available for each.

Provider	`api-key`	`base-url`	`model`	Optional Params
`openai`	✅	❌	✅	`maxTokens`, `temperature`
`azure_openai`	✅	✅	✅ (deployment)	`maxTokens`, `temperature`
`azure_openai_identity`	❌	✅	✅	`maxTokens`, `temperature`
`anthropic`	✅	❌	✅	`maxTokens`, `temperature`
`gemini`	✅	❌	✅	`maxTokens`, `temperature`
`ollama`	❌	✅	✅	`temperature`
`other`	✅	✅	✅	`maxTokens`, `temperature`

The azure_openai_identity relies on environment-based authentication (Azure Identity SDK).

Embedding Model Configuration

To enable semantic classification, an embedding model must be configured. Currently, three providers are supported. The required configuration parameters depend on the selected provider and model.

lmos:
  router:
    embedding:
      model:
        provider: openai                                           # openai, huggingface, local_onnx
        base-url: https://my-api.openai.com/v1/embeddings          # Required for openai
        model-name: hugginface-model-name                          # Required for huggingface
        api-key: hugginface-api-key                                # Required for openai and huggingface
        model-path: /path/to/local-model.onnx                      # Required for local_onnx
        tokenizer-path: /path/to/local-tokenizer.json              # Required for local_onnx

Refer to the table to see which providers are supported and what configuration options are available for each.

Provider	Required Settings
`openai`	`base-url`, `api-key`
`huggingface`	`model-name`, `api-key`
`local_onnx`	`model-path`, `tokenizer-path`

Embedding Store Configuration

In addition to an embedding model, a store must be configured to persist and query the vector embeddings.

lmos:
  router:
    embedding:
      store:
        host: localhost
        port: 6334
        tlsEnabled: false
        apiKey: my-api-key

Defines the connection to the external embedding store.
Supports TLS and API key-based authentication.

Embedding Ranking Configuration

For the Embedding-based Classification, a ranker is used. The goal of the ranker is to determine the most qualified agent based on scores, thresholds, or other heuristics.

There is currently a default implementation (EmbeddingScoreRanker) for the ranker, where the agent with the highest cumulative score is only selected if:

The score difference to the second-best agent exceeds a minimum distance
The total and mean scores exceed predefined thresholds
And the relative score difference is sufficiently large

Defaults threshold values are provided, but tuning is highly recommended. Optimal values depend on:

Number and type of agents
Embedding model behavior
Language characteristics
Well defined capabilities examples

The ranking thresholds can be configured as follows:

lmos:
  router:
    embedding:
      ranking:
        maxEmbeddings: 15
        minWeight: 5.0
        minDistance: 4.0
        minMeanScore: 0.5
        minRealDistance: 0.3

Name		Name	Last commit message	Last commit date
Latest commit History 181 Commits
.github		.github
LICENSES		LICENSES
benchmarks		benchmarks
gradle/wrapper		gradle/wrapper
lmos-classifier-core-spring-boot-starter		lmos-classifier-core-spring-boot-starter
lmos-classifier-core/src/main/kotlin/org/eclipse/lmos/classifier/core		lmos-classifier-core/src/main/kotlin/org/eclipse/lmos/classifier/core
lmos-classifier-hybrid-spring-boot-starter		lmos-classifier-hybrid-spring-boot-starter
lmos-classifier-hybrid		lmos-classifier-hybrid
lmos-classifier-llm-spring-boot-starter		lmos-classifier-llm-spring-boot-starter
lmos-classifier-llm		lmos-classifier-llm
lmos-classifier-vector-spring-boot-starter		lmos-classifier-vector-spring-boot-starter
lmos-classifier-vector		lmos-classifier-vector
lmos-classifier-workbench-demo-controller		lmos-classifier-workbench-demo-controller
lmos-router-core		lmos-router-core
lmos-router-hybrid-spring-boot-starter		lmos-router-hybrid-spring-boot-starter
lmos-router-hybrid		lmos-router-hybrid
lmos-router-llm-in-spring-cloud-gateway-demo		lmos-router-llm-in-spring-cloud-gateway-demo
lmos-router-llm-spring-boot-starter		lmos-router-llm-spring-boot-starter
lmos-router-llm		lmos-router-llm
lmos-router-vector-spring-boot-starter		lmos-router-vector-spring-boot-starter
lmos-router-vector		lmos-router-vector
.editorconfig		.editorconfig
.gitignore		.gitignore
CodeOfConduct.md		CodeOfConduct.md
Contributing.md		Contributing.md
README.md		README.md
REUSE.toml		REUSE.toml
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts

eclipse-lmos/lmos-router

Folders and files

Latest commit

History

Repository files navigation

Intelligent Agent Routing System

Quickstart Guide

Step 1: Create Agent Routing Specifications

Step 2: Initialize the Agent Routing Spec Resolver

Step 3: Pass the Context and User Message

Step 4: Resolve the Agent

Overview

Table of Contents

Introduction

Routing Methods

LLM-Based Approach

Vector-Based Approach

Hybrid Approach

Comparison Table

Modules

Confusion Matrix and Accuracy

LLM-Based Resolver

Vector-Based Resolver

Setup and Installation

LLM-Based Approach Spring Boot Starter

Vector-Based Approach Spring Boot Starter

Hybrid Approach Spring Boot Starter

No framework dependencies

LLM-Based Approach

Vector-Based Approach

Hybrid Approach

Demo

Contributing

Code of Conduct

Licensing

Agent Classifier

Module Overview

lmos-classifier-core

lmos-classifier-vector

lmos-classifier-llm

lmos-classifier-hybrid

lmos-classifier-workbench-demo-controller

Classifier Guide

Programmatic Usage

Spring Boot Starter

Enable Classifier

LLM Configuration

Embedding Model Configuration

Embedding Store Configuration

Embedding Ranking Configuration

About

Resources

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Uh oh!

Uh oh!

Languages

`lmos-classifier-core`

`lmos-classifier-vector`

`lmos-classifier-llm`

`lmos-classifier-hybrid`

`lmos-classifier-workbench-demo-controller`