-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Description
What is the problem you're trying to solve
Currently, in volcano colocation scenario, When the usage of a resource exceeds a threshold (such as CPU), six consecutive instances of exceeding the threshold (detected every 10 seconds) are required before eviction of BE pods. Currently, this is hard-coded, we need to make it configurable for users.
The highUsageCountLimit constant is currently hardcoded:
https://github.com/volcano-sh/volcano/blob/7b14346ab7a7c46814da3951a21ee3cad9ccf5c0/pkg/agent/events/probes/nodemonitor/node_monitor.go#L201C47-L201C66
as are the frequencies of detect and monitor:
volcano/pkg/agent/events/probes/nodemonitor/node_monitor.go
Lines 87 to 88 in 7b14346
| go wait.Until(m.utilizationMonitoring, 10*time.Second, stop) | |
| go wait.Until(m.detect, 10*time.Second, stop) |
Describe the solution you'd like
We need to support configurable values, we can still set 10 seconds and 6 times for highUsageLimit by default
Additional context
No response
Documentation Updates
- This feature requires design or user documentation changes.
- If documentation changes are required, I will ensure the relevant documents are updated and published to the Volcano official website (https://volcano.sh) via the volcano-sh/website repository.