You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: examples/kfto-dreambooth/README.md
+32-13Lines changed: 32 additions & 13 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -13,18 +13,6 @@ This example is based on HuggingFace DreamBooth Hackathon example - https://hugg
13
13
* Sufficient worker nodes for your configuration(s) with NVIDIA GPUs (Ampere-based or newer recommended) or AMD GPUs (AMD Instinct MI300X or newer recommended)
14
14
* AWS S3 storage available
15
15
16
-
> [!IMPORTANT]
17
-
> **Kueue Integration (RHOAI 2.21+):**
18
-
> * If using RHOAI 2.21+, the example supports Kueue integration for workload management:
19
-
> * When using Kueue:
20
-
> * Follow the [Configure Kueue (Optional)](#configure-kueue-optional) section to set up required resources
21
-
> * Add the local-queue name label to your job configuration to enforce workload management
22
-
> * You can skip Kueue usage by:
23
-
> * Disabling the existing `kueue-validating-admission-policy-binding`
24
-
> * Omitting the local-queue-name label in your job configuration
25
-
>
26
-
> **Note:** Kueue Enablement via Validating Admission Policy was introduced in RHOAI-2.21. You can skip this section if using an earlier RHOAI release version.
27
-
28
16
29
17
## Setup
30
18
@@ -57,4 +45,35 @@ This example is based on HuggingFace DreamBooth Hackathon example - https://hugg
57
45
* From the workbench, clone this repository, i.e., `https://github.com/opendatahub-io/distributed-workloads.git`
58
46
* Navigate to the `distributed-workloads/examples/kfto-dreambooth` directory and open the `dreambooth` notebook
59
47
60
-
You can now proceed with the instructions from the notebook. Enjoy!
48
+
You can now proceed with the instructions from the notebook. Enjoy!
49
+
50
+
> [!IMPORTANT]
51
+
> **Kueue Integration (RHOAI 2.21+):**
52
+
> * If using RHOAI 2.21+, the example supports Kueue integration for workload management:
53
+
> * When using Kueue:
54
+
> * Follow the [Configure Kueue (Optional)](#configure-kueue-optional) section to set up required resources
55
+
> * Add the local-queue name label to your job configuration to enforce workload management
56
+
> * You can skip Kueue usage by:
57
+
> * Disabling the existing `kueue-validating-admission-policy-binding`
58
+
> * Omitting the local-queue-name label in your job configuration
59
+
>
60
+
> **Note:** Kueue Enablement via Validating Admission Policy was introduced in RHOAI-2.21. You can skip this section if using an earlier RHOAI release version.
61
+
62
+
### Configure Kueue (Optional)
63
+
64
+
> [!NOTE]
65
+
> This section is only required if you plan to use Kueue for workload management (RHOAI 2.21+) or Kueue is not already configured in your cluster.
66
+
67
+
* Update the `nodeLabels` in the `workshops/kueue/resources/resource_flavor.yaml` file to match your AI worker nodes
Copy file name to clipboardExpand all lines: examples/kfto-feast/README.md
+37-18Lines changed: 37 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -43,24 +43,6 @@ By integrating Feast into the fine-tuning pipeline, we ensure that the training
43
43
* If using PEFT LoRA/QLoRA techniques, then can use NVIDIA GPUs (G4dn)
44
44
* AWS S3 storage available
45
45
46
-
> [!IMPORTANT]
47
-
> **Hugging Face Token Requirements:**
48
-
> * You will need a Hugging Face token if using gated models:
49
-
> * The examples use gated Llama models that require a token (e.g., https://huggingface.co/meta-llama/Llama-3.1-8B)
50
-
> * Set the `HF_TOKEN` environment variable in your job configuration
51
-
> * Note: You can skip the token if switching to non-gated models
52
-
>
53
-
> **Kueue Integration (RHOAI 2.21+):**
54
-
> * If using RHOAI 2.21+, the example supports Kueue integration for workload management:
55
-
> * When using Kueue:
56
-
> * Follow the [Configure Kueue (Optional)](#configure-kueue-optional) section to set up required resources
57
-
> * Add the local-queue name label to your job configuration to enforce workload management
58
-
> * You can skip Kueue usage by:
59
-
> * Disabling the existing `kueue-validating-admission-policy-binding`
60
-
> * Omitting the local-queue-name label in your job configuration
61
-
>
62
-
> **Note:** Kueue Enablement via Validating Admission Policy was introduced in RHOAI-2.21. You can skip this section if using an earlier RHOAI release version.
63
-
64
46
---
65
47
66
48
@@ -127,4 +109,41 @@ By following this notebook, you'll gain hands-on experience in setting up a **fe
127
109
128
110
You can now proceed with the instructions from the notebook. Enjoy!
129
111
112
+
> [!IMPORTANT]
113
+
> **Hugging Face Token Requirements:**
114
+
> * You will need a Hugging Face token if using gated models:
115
+
> * The examples use gated Llama models that require a token (e.g., https://huggingface.co/meta-llama/Llama-3.1-8B)
116
+
> * Set the `HF_TOKEN` environment variable in your job configuration
117
+
> * Note: You can skip the token if switching to non-gated models
118
+
>
119
+
> **Kueue Integration (RHOAI 2.21+):**
120
+
> * If using RHOAI 2.21+, the example supports Kueue integration for workload management:
121
+
> * When using Kueue:
122
+
> * Follow the [Configure Kueue (Optional)](#configure-kueue-optional) section to set up required resources
123
+
> * Add the local-queue name label to your job configuration to enforce workload management
124
+
> * You can skip Kueue usage by:
125
+
> * Disabling the existing `kueue-validating-admission-policy-binding`
126
+
> * Omitting the local-queue-name label in your job configuration
127
+
>
128
+
> **Note:** Kueue Enablement via Validating Admission Policy was introduced in RHOAI-2.21. You can skip this section if using an earlier RHOAI release version.
129
+
130
+
### Configure Kueue (Optional)
131
+
132
+
> [!NOTE]
133
+
> This section is only required if you plan to use Kueue for workload management (RHOAI 2.21+) or Kueue is not already configured in your cluster.
134
+
135
+
* Update the `nodeLabels` in the `workshops/kueue/resources/resource_flavor.yaml` file to match your AI worker nodes
0 commit comments