You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/build-s390x.md
+14-14Lines changed: 14 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -111,31 +111,31 @@ All models need to be converted to Big-Endian. You can achieve this in three cas
111
111
112
112
## IBM zDNN Accelerator
113
113
114
-
*No direction at the moment.*
114
+
*Only available in IBM z16 and onwards. No direction at the moment.*
115
115
116
116
## IBM Spyre Accelerator
117
117
118
118
*No direction at the moment.*
119
119
120
-
## Performance Optimization
121
-
122
-
### Virtualization Setup
123
-
124
-
We strongly recommend using only LPAR (Type-1) virtualization to get the most performance.
120
+
## Performance Tuning
125
121
126
-
Note: Type-2 virtualization is not supported at the moment, while you can get it running, the performance will not be the best.
122
+
### 1. Virtualization Setup
127
123
128
-
### IFL (Core) Count
124
+
We strongly recommend using only LPAR (Type-1) virtualization to get the most performance.
125
+
126
+
Note: Type-2 virtualization is not supported at the moment, while you can get it running, the performance will not be the best.
129
127
130
-
We recommend a minimum of 8 shared IFLs assigned to the LPAR. Increasing the IFL count past 8 shared IFLs will only improve Prompt Processing performance but not Token Generation.
128
+
### 2. IFL (Core) Count
131
129
132
-
Note: IFL count does not equate to vCPU count.
130
+
We recommend a minimum of 8 shared IFLs assigned to the LPAR. Increasing the IFL count past 8 shared IFLs will only improve Prompt Processing performance but not Token Generation.
131
+
132
+
Note: IFL count does not equate to vCPU count.
133
133
134
-
### SMT vs NOSMT (Simultaneous Multithreading)
134
+
### 3. SMT vs NOSMT (Simultaneous Multithreading)
135
135
136
-
We strongly recommend disabling SMT via the kernel boot parameters as it negatively affects performance. Please refer to your Linux distribution's guide on disabling SMT via kernel boot parameters.
136
+
We strongly recommend disabling SMT via the kernel boot parameters as it negatively affects performance. Please refer to your Linux distribution's guide on disabling SMT via kernel boot parameters.
137
137
138
-
### BLAS vs NOBLAS
138
+
### 4. BLAS vs NOBLAS
139
139
140
-
We strongly recommend using BLAS for llama.cpp as there are no custom kernels for s390x for llama.cpp at the moment.
140
+
We strongly recommend using BLAS for llama.cpp as there are no custom kernels for s390x for llama.cpp at the moment.
0 commit comments