Simplifying Llama Stack Deployment with Kubernetes Operator #1707
Replies: 5 comments 3 replies
-
In addition to the Operator/Controller code, this will be a suitable location for hosting artifact deployments (Kustomize, Helm, bare YAML, etc.). |
Beta Was this translation helpful? Give feedback.
-
Hi @raghotham here is the repo you requested for the k8s operator https://github.com/opendatahub-io/llama-stack-k8s-operator/tree/main. It would be great to include this under https://github.com/meta-llama |
Beta Was this translation helpful? Give feedback.
-
Hi @raghotham, following up from the community meeting request. Did you get a chance to look at the operator repo and if we could add it under https://github.com/meta-llama ? |
Beta Was this translation helpful? Give feedback.
-
Can you make this an issue? |
Beta Was this translation helpful? Give feedback.
-
Closing, we are tracking the effort in #1814, thanks! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hello everyone! I, alongwith @leseb and @rhuss would like to initiate a discussion about enhancing Llama Stack’s deployment capabilities by introducing a Kubernetes Operator.
Why the operator?
An Operator can provide several advantages for Llama-Stack server deployments -
making Llama Stack easier to run in a Kubernetes environment.
How it fits to Llama Stack?
The Operator could handle:
Repository Structure
To keep things modular, we propose hosting this operator in a separate GitHub repository(e.g llama-stack-k8s-operator). A separate Github repository ensures separation of concerns between the Llama Stack server and deployment strategies. It will also provide a dedicated space for Kubernetes-specific issues and contributions.
We are seeking community's feedback on leveraging Kubernetes operators for efficient deployment and management of Llama Stack server instances. Based on this feedback, we would like to share details of our prototype, including the Custom Resource Definition (CRD) and controller implementation, in subsequent discussions.
Beta Was this translation helpful? Give feedback.
All reactions