feat: update kserve setup info (#141)

oleksii-donets · web-flow · commit 7cf1a743717f · 2026-02-13T17:43:32.000+02:00
diff --git a/docs/kserve.md b/docs/kserve.md
@@ -52,19 +52,13 @@ Replace `your_namespace` with the desired namespace where you want to install KS
 
 For more detailed configuration and usage, refer to the [KServe Documentation](https://kserve.github.io/website/docs/admin-guide/kubernetes-deployment).
 
-
-
-Here's a README section that explains the application of necessary roles and role bindings for managing permissions within the Kubernetes cluster:
-
----
-
 # Role and RoleBinding Configuration for AI DIAL Admin
 
 This section describes the configuration of necessary roles and role bindings to manage permissions for the AI DIAL Admin components within the Kubernetes cluster. These configurations ensure that the appropriate access controls are in place for managing inference services and related resources.
 
 ## Role Configuration
 
-The following Role configuration grants permissions to manage inference services and other resources within the `kserve-models` namespace.
+The following Role configuration grants permissions to manage inference services and other resources within the `<model-namespace>` namespace.
 
 ### Role Manifest
 
@@ -73,7 +67,7 @@ apiVersion: rbac.authorization.k8s.io/v1
 kind: Role
 metadata:
   name: ai-dial-admin-deployment-role
-  namespace: kserve-models
+  namespace: <model-namespace>
 rules:
   - apiGroups:
       - 'serving.kserve.io'
@@ -116,7 +110,7 @@ apiVersion: rbac.authorization.k8s.io/v1
 kind: RoleBinding
 metadata:
   name: ai-dial-admin-deployment-role
-  namespace: kserve-models
+  namespace: <model-namespace>
 subjects:
   - kind: ServiceAccount
     name: ai-dial-test-admin-deployment-manager-backend
@@ -135,7 +129,57 @@ To apply these configurations to your Kubernetes cluster, follow these steps:
 
 2. **Apply the Manifests**: Use the `kubectl` command-line tool to apply the manifests to your cluster. Run the following commands in your terminal:
 
-   ```bash
-   kubectl apply -f role.yaml
-   kubectl apply -f rolebinding.yaml
-   ```
+```bash
+kubectl apply -f role.yaml
+kubectl apply -f rolebinding.yaml
+```
+
+# Using Hugging Face Token
+
+To deploy models from private Hugging Face repositories, follow these steps:
+
+1. **Update the default `ClusterStorageContainer`**: Remove the `hf://` entry from its `supportedUriFormats` list. This prevents conflicts between the default and custom storage containers when resolving Hugging Face model URIs. Note that `ClusterStorageContainer` is a cluster-scoped resource, so this change applies globally.
+
+2. **Create a Kubernetes secret** with your Hugging Face access token in the namespace where your models will be deployed:
+
+```bash
+kubectl create secret generic hf-secret \
+ --from-literal=HF_TOKEN=<your_hf_token_here> \
+ -n <model-namespace>
+```
+
+3. **Create a custom `ClusterStorageContainer`** that references the secret:
+
+```yaml
+apiVersion: "serving.kserve.io/v1alpha1"
+kind: ClusterStorageContainer
+metadata:
+ name: hf-hub
+spec:
+ container:
+   image: kserve/storage-initializer:v0.16.0
+   name: storage-initializer
+   env:
+     - name: HF_TOKEN
+       valueFrom:
+         secretKeyRef:
+           name: hf-secret
+           key: HF_TOKEN
+           optional: false
+   resources:
+     limits:
+       cpu: "1"
+       memory: 1Gi
+     requests:
+       cpu: 100m
+       memory: 100Mi
+   securityContext:
+     allowPrivilegeEscalation: false
+     capabilities:
+       drop:
+       - ALL
+     privileged: false
+     runAsNonRoot: true
+ supportedUriFormats:
+   - prefix: hf://
+```