Commit c1696ce
committed
[ML] Fix double-counting of inference memory in the assignment rebalancer (elastic#133919)
The static method TrainedModelAssignmentRebalancer.getNodeFreeMemoryExcludingPerNodeOverheadAndNativeInference was used to subtract load.getAssignedNativeInferenceMemory() from load.getFreeMemoryExcludingPerNodeOverhead(). However, in NodeLoad.getFreeMemoryExcludingPerNodeOverhead(), native inference memory was already subtracted as part of the getAssignedJobMemoryExcludingPerNodeOverhead() calculation.
This led to double-counting of the native inference memory. Avoiding this double-counting allows us to remove the private method getNodeFreeMemoryExcludingPerNodeOverheadAndNativeInference() entirely.1 parent 29cb067 commit c1696ce
File tree
2 files changed
+6
-7
lines changed- docs/changelog
- x-pack/plugin/ml/src/main/java/org/elasticsearch/xpack/ml/inference/assignment
2 files changed
+6
-7
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
Lines changed: 1 addition & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
303 | 303 | | |
304 | 304 | | |
305 | 305 | | |
306 | | - | |
307 | | - | |
308 | | - | |
| 306 | + | |
309 | 307 | | |
310 | 308 | | |
311 | 309 | | |
| |||
322 | 320 | | |
323 | 321 | | |
324 | 322 | | |
325 | | - | |
326 | | - | |
327 | | - | |
328 | | - | |
329 | 323 | | |
330 | 324 | | |
331 | 325 | | |
| |||
0 commit comments