Skip to content
This repository was archived by the owner on Nov 3, 2025. It is now read-only.

Metrics collection for workflows deployed as knative services #550

@jianrongzhang89

Description

@jianrongzhang89

Description

When a workflow is deployed as a serverless Knative Service, and a new workflow instance is triggered, a pod for the workflow gets started automatically, and after the instance is finished, Knative will automatically terminate the pod by scaling down the corresponding k8s deployment to zero replica. The workflow pods last for a short period before they get terminated.

As as result, Prometheus may not have the chance to scrap the metrics from the workflow on time and they may miss such metrics if the pods are already terminated and this leads to the accuracy issue of the dashboards.

This issue is created to implement a solution to overcome such limitation and implement a metrics collector as a push gateway, and a Kogito extension for the workflows to push their metrics to the collector, and prometheus will then scrap metrics from the collector instead. Knative documentation uses such collector for its own components as an example:
https://knative.dev/docs/eventing/observability/metrics/collecting-metrics/#understanding-the-collector

Implementation ideas

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions