-
Notifications
You must be signed in to change notification settings - Fork 222
Description
Target Version
Current version (0.10.0+) - New Architecture
Feature Description
Comparing documentations of versions 0.9.0 and 0.10.0, as well as their deployments on my clusters, the archive version has more metrics per container and process than the newest one.
I'm specifically talking about the "kepler_container_joules_total " metrics, which included "CPU, dram, gpus, and other host components" ( 0.9.0 docs - https://sustainable-computing.io/archive/design/metrics/#kepler-metrics-for-container-energy-consumption).
The newest version only supports cpu related metrics for containers - kepler_container_cpu_joules_total (0.10.0 - https://sustainable-computing.io/kepler/design/metrics/)
I've read the Slack announcement, including the new "limitations" section, which talks about only supporting the RAPL/powercap framework. I've also read the Slack comments regarding the "past models being way off on power consumption"
Will the DRAM related metrics be back eventually in newer Kepler versions?
Problem Statement
Less metrics available than before in 0.10.0 (specifically dropping DRAM support)
Proposed Solution
A comment regarding DRAM/GPU power metrics's place in the Kepler Roadmap
Alternatives Considered
No response
Additional Context
No response