Skip to content

DRAM metrics not supported in 0.10.0 post rewrite? kepler_container_joules_total missing #2363

@MA3CIN

Description

@MA3CIN

Target Version

Current version (0.10.0+) - New Architecture

Feature Description

Comparing documentations of versions 0.9.0 and 0.10.0, as well as their deployments on my clusters, the archive version has more metrics per container and process than the newest one.

I'm specifically talking about the "kepler_container_joules_total " metrics, which included "CPU, dram, gpus, and other host components" ( 0.9.0 docs - https://sustainable-computing.io/archive/design/metrics/#kepler-metrics-for-container-energy-consumption).
The newest version only supports cpu related metrics for containers - kepler_container_cpu_joules_total (0.10.0 - https://sustainable-computing.io/kepler/design/metrics/)

I've read the Slack announcement, including the new "limitations" section, which talks about only supporting the RAPL/powercap framework. I've also read the Slack comments regarding the "past models being way off on power consumption"

Image

Will the DRAM related metrics be back eventually in newer Kepler versions?

Problem Statement

Less metrics available than before in 0.10.0 (specifically dropping DRAM support)

Proposed Solution

A comment regarding DRAM/GPU power metrics's place in the Kepler Roadmap

Alternatives Considered

No response

Additional Context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions