Demo/llamastack budget by Hadar301 · Pull Request #6 · RHEcosystemAppEng/rhoai-litellm-poc

Hadar301 · 2026-01-11T11:50:20Z

fix:

Add Llamastack client library to ui req.
Add route clause to Llamastack.
fix: ui image tag.

added budget management with Llamastack.
added documentation regarding our findings for using:

Client with LiteLLM as backend

 lite_client = LlamaStackClient(
    base_url=LITELLM_URL,  # Your LiteLLM URL
    api_key="...",
)

VS.
Client with Llamastack server configured with LiteLLM as backend

llama_stack_client = LlamaStackClient(base_url=LLAMASTACK_URL)

Update: use the X-LlamaStack-Provider-Data header for dynamic authentication, now the demo is working with llamastack server

llamastack_client = LlamaStackClient(
        base_url=LLAMASTACK_URL,  # Connect to LlamaStack server
        provider_data={
            "vllm_api_token": budgeted_api_key  # Pass budgeted key via X-LlamaStack-Provider-Data
        }
    )

1. add llamastackclient library to ui req 2. add route clause to llamastack

2. added documentaion regarding our findings for using: # Client with LiteLLM as backend lite_client = LlamaStackClient( base_url=LITELLM_URL, # Your LiteLLM URL api_key="...", ) VS. # Client with Llamastack server configured with LiteLLM as backend llama_stack_client = LlamaStackClient(base_url=LLAMASTACK_URL)

now the demo is working with llamastack server using litellm as backend

johnson2500 · 2026-01-12T14:35:31Z

demos/llamastack_budget_test.py

+
+def create_budget_key(base_url: str, master_key: str, max_budget: float) -> dict:
+    """Create a virtual key with a specified budget limit."""
+    url = f"{base_url}/key/generate"


Ahh I see it now

I'll see if we can use the litellm python library for this instead of using requests

* Add redis deployment * update budgest test (can't use the same message since we use cache)

… have cache

johnson2500

Approved. This looks very nice, and well documented. Thanks. I think later we can look at mine which can help with the API Keys.

Hadar301 added 2 commits January 11, 2026 13:46

fix:

42836c7

1. add llamastackclient library to ui req 2. add route clause to llamastack

Hadar301 requested a review from johnson2500 January 11, 2026 11:50

Hadar301 added 3 commits January 11, 2026 13:59

fix: tag format (commit sha and latest)

69466b2

use the X-LlamaStack-Provider-Data header for dynamic authentication.

f5e1399

now the demo is working with llamastack server using litellm as backend

set cost for input&output of the ollama model

84636e4

Hadar301 marked this pull request as draft January 12, 2026 13:48

johnson2500 reviewed Jan 12, 2026

View reviewed changes

Hadar301 marked this pull request as ready for review January 13, 2026 08:39

Feature/redis cache (#7)

5e96675

* Add redis deployment * update budgest test (can't use the same message since we use cache)

Hadar301 marked this pull request as draft January 13, 2026 08:57

Hadar301 added 2 commits January 13, 2026 11:11

update the test same as budget_test.py - use prompts array since we…

99c6890

… have cache

reduce cache time to live

95106da

Hadar301 marked this pull request as ready for review January 13, 2026 09:11

Hadar301 requested a review from johnson2500 January 13, 2026 09:11

Merge branch 'main' into demo/llamastack-budget

4ffba0d

johnson2500 approved these changes Jan 13, 2026

View reviewed changes

Hadar301 merged commit 683bed0 into main Jan 13, 2026
1 check passed

Hadar301 mentioned this pull request Jan 13, 2026

[HOLD UNTL DEPENDENT PR IS MERGED] feat(demo): Add demo to display the interchangeability of liteLLM and LLamaStack. #8

Closed

Hadar301 deleted the demo/llamastack-budget branch January 14, 2026 09:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Demo/llamastack budget#6

Demo/llamastack budget#6
Hadar301 merged 9 commits intomainfrom
demo/llamastack-budget

Hadar301 commented Jan 11, 2026 •

edited

Loading

Uh oh!

johnson2500 Jan 12, 2026

Uh oh!

Hadar301 Jan 13, 2026

Uh oh!

johnson2500 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

Hadar301 commented Jan 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

johnson2500 Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Hadar301 Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

johnson2500 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Hadar301 commented Jan 11, 2026 •

edited

Loading