You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: content/learning-paths/servers-and-cloud-computing/multiarch_ollama_on_gke/0-spin_up_gke_cluster.md
+29-12Lines changed: 29 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ layout: learningpathall
8
8
9
9
## Project overview
10
10
11
-
Arm CPUs are widely used in AI/ML use cases. In this Learning Path, you will learn how to run [Ollama](https://ollama.com/) on Arm-based CPUs in a hybrid architecture (amd64 and arm64) K8s cluster.
11
+
Arm CPUs are widely used in Kubernetes AI/ML use cases. In this Learning Path, you learn how to run [Ollama](https://ollama.com/) on Arm-based CPUs in a hybrid architecture (amd64 and arm64) K8s cluster.
12
12
13
13
To demonstrate this, you can bring up an initial Kubernetes cluster (depicted as "*1. Initial Cluster (amd64)*" in the image below) with an amd64 node running an Ollama Deployment and Service.
14
14
@@ -49,11 +49,7 @@ Although this will work in all regions and zones where C4 and C4a instance types
49
49
50
50
8. Click on *NODE POOLS*->*Nodes*
51
51
9. For *Series*, select *C4*
52
-
10. For *Machine Type*, select *c4-standard-4*
53
-
54
-
{{% notice Note %}}
55
-
The chosen node types support only one pod per node. If you wish to run multiple pods per node, each node should provide about 10GB memory per pod.
@@ -63,11 +59,33 @@ It will take a few moments, but when the green checkmark is showing next to the
63
59
64
60
### Connect to the cluster
65
61
66
-
{{% notice Note %}}
67
-
The following assumes you have gcloud and kubectl already installed. If not, please follow the instructions on the first page under "Prerequisites".
68
-
{{% /notice %}}
62
+
Before continuing, make sure you have *kubectl* and *gcloud* installed. You can verify by running each command, for example, entering *gcloud* and enter:
69
63
70
-
You'll first setup your newly created K8s cluster credentials using the gcloud utility. Enter the following in your command prompt (or cloud shell), and make sure to replace "YOUR_PROJECT_ID" with the ID of your GCP project:
64
+
```bash
65
+
gcloud
66
+
```
67
+
should return
68
+
```output
69
+
ERROR: (gcloud) Command name argument expected.
70
+
...
71
+
```
72
+
and entering *kubectl* and enter should return:
73
+
74
+
```output
75
+
kubectl controls the Kubernetes cluster manager.
76
+
77
+
Find more information at: https://kubernetes.io/docs/reference/kubectl/
78
+
...
79
+
```
80
+
If you get something similar to:
81
+
82
+
```output
83
+
command not found
84
+
```
85
+
86
+
Please follow prerequisite instructions on the first page to install the missing utilities.
87
+
88
+
With prerequisites out of the way, you will next setup your newly created K8s cluster credentials using the gcloud utility. Enter the following in your command prompt (or cloud shell), and make sure to replace "YOUR_PROJECT_ID" with the ID of your GCP project:
71
89
72
90
```bash
73
91
export ZONE=us-central1
@@ -81,8 +99,7 @@ If you get the message:
81
99
```output
82
100
CRITICAL: ACTION REQUIRED: gke-gcloud-auth-plugin, which is needed for continued use of kubectl, was not found or is not executable. Install gke-gcloud-auth-plugin for use with kubectl by following https://cloud.google.com/kubernetes-engine/docs/how-to/cluster-access-for-kubectl#install_plugin
Copy file name to clipboardExpand all lines: content/learning-paths/servers-and-cloud-computing/multiarch_ollama_on_gke/2-deploy-arm64.md
+2-2Lines changed: 2 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -26,7 +26,7 @@ To add Arm nodes to the cluster:
26
26
7. Select *C4A* : *c4a-standard-4* for Machine *Configuration/Type*.
27
27
28
28
{{% notice Note %}}
29
-
To compare amd64 and arm64 performance, the c4a-standard-4 is used as the arm64 equivalent of the previously deployed c4-standard-4 in the amd64 node pool.
29
+
To compare amd64 and arm64 performance, the c4a-standard-4 is used as the arm64 equivalent of the previously deployed c4-standard-8 in the amd64 node pool.
30
30
{{% /notice %}}
31
31
32
32

@@ -146,7 +146,7 @@ service/ollama-arm64-svc created
146
146
service/ollama-multiarch-svc created
147
147
```
148
148
149
-
2. Get the status of the pods and the services by running the following:
149
+
2. Get the status of the nodes, pods, and services by running the following:
0 commit comments