Decide on data structure for Reporting client

### What's needed?

A data structure that can expose the metrics as well as the states per microgrid and component.

### Proposed solution

Option to be discussed:
1) Full (artificial) mapping from protobuf to Python datatype
2) Helpers (such as namedtuples) to work 

### Use cases

Not only metrics, but also states, warnings and errors should be exposed.

### Alternatives and workarounds

Any alternative ideas are welcome.

### Additional context

As discussed in API usergroup meeting on 6th May 2024:
Currently, the `ReportingApiClient` only exposes `MetricSamples` using a namedtuple structure.

To additionally expose `States`: 
* At any given timestamp, a component can have multiple states, such as `charging` and `relay_open`
* Practically, whenever you get a metric you can also get the states information


The [example output structure from the protobuf](https://github.com/frequenz-floss/frequenz-api-reporting/blob/bf2cf8deb9f4f1b85e3e9a792a2114a15cee394c/proto/frequenz/api/reporting/v1/reporting.proto#L231) is as follows with the [entry point to the GRPC](https://github.com/frequenz-io/frequenz-service-reporting/blob/c5c10668a7008482ee35b92f5a5e26282f474adf/src/server.rs#L48) returning those messages.
```
> // Response containing historical microgrid component metrics in one or multiple microgrids
> //
> // Each microgrid's components are provided as timeseries data structures that encapsulate
> // metrics, bounds, errors and operational state and their associated timestamps for each component
> // within the specified time range.
> //
> // !!! example
> //     Example output structure:
> //     ```
> //     microgrids: [
> //       {
> //         microgrid_id: 1,
> //         components: [
> //           {
> //             component_id: 13,
> //             metric_samples: [
> //               /* list of metrics for multiple timestamps */
> //               { sampled_at: "2023-10-01T00:00:00Z", metric: "DC_VOLTAGE_V", sample: {...}, bounds: {...} },
> //               { sampled_at: "2023-10-01T00:00:00Z", metric: "DC_CURRENT_A", sample: {...}, bounds: {...} }
> //               { sampled_at: "2023-10-01T00:05:00Z", metric: "DC_VOLTAGE_V", sample: {...}, bounds: {...} },
> //               { sampled_at: "2023-10-01T00:05:00Z", metric: "DC_CURRENT_A", sample: {...}, bounds: {...} }
> //             ],
> //             states: [
> //               /* list of states for multiple timestamps */
> //               { sampled_at: "2023-10-01T00:00:13.12Z", states: [...], errors: [...], warnings: [...] },
> //               { sampled_at: "2023-10-01T00:02:22.01Z", states: [...], errors: [...], warnings: [...] },
> //               { sampled_at: "2023-10-01T00:05:02.32Z", states: [...], errors: [...], warnings: [...] },
> //             ]
> //           },
> //           {
> //             component_id: 243,
> //             metric_samples: [ ... ],
> //             states: [ ... ]
> //           },
> //         ]
> //       },
> //       {
> //         microgrid_id: 2,
> //         components: [ ... ]
> //       }
> //     ]
> //     ```
```

To retrieve `MetricSample` information, we use the following [helper in the ReportingApiClient](https://github.com/frequenz-floss/frequenz-client-reporting-python/blob/b4110f9d64e2c24bd0558c2245e8c3ed09266947/src/frequenz/client/reporting/_client.py#L34):
```
MetricSample = namedtuple(
    "MetricSample", ["timestamp", "microgrid_id", "component_id", "metric", "value"]
)
"""Type for a sample of a time series incl. metric type, microgrid and component ID

A named tuple was chosen to allow safe access to the fields while keeping the
simplicity of a tuple. This data type can be easily used to create a numpy array
or a pandas DataFrame.
"""
```




Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Decide on data structure for Reporting client #29

What's needed?

Proposed solution

Use cases

Alternatives and workarounds

Additional context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Decide on data structure for Reporting client #29

Description

What's needed?

Proposed solution

Use cases

Alternatives and workarounds

Additional context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions