feat: openshift mode to allow both logging and network #19249

jpinsonneau · 2025-09-22T10:40:43Z

What this PR does / why we need it:

This PR implements a new Openshift mode supporting both logging and network logs at same time. This is a trivial change despite the PR is quite large.

Which issue(s) this PR fixes:
Fixes #

Special notes for your reviewer:

You will need observatorium/opa-openshift#37 to test this PR.

Checklist

Reviewed the CONTRIBUTING.md guide (required)
Documentation added
Tests updated
Title matches the required conventional commits format, see here
- Note that Promtail is considered to be feature complete, and future development for logs collection will be in Grafana Alloy. As such, feat PRs are unlikely to be accepted unless a case can be made for the feature actually being a bug fix to existing behavior.
Changes that require user attention or interaction to upgrade are documented in docs/sources/setup/upgrade/_index.md
If the change is deprecating or removing a configuration option, update the deprecated-config.yaml and deleted-config.yaml files respectively in the tools/deprecated-config-checker directory. Example PR

memodi · 2025-12-12T04:08:06Z

@jpinsonneau - I did a performance test on this and loki is able ingest both logging and network logs well, I am not seeing prominent log loss or any errors for loki.

I set up 5 worker nodes and for logging had 8 pods within each NS generating 90,000 lines of logs lines every 30 mins. I had 5 such namespaces, bringing total of 40 pods. I also added netobserv workload on top of it.

You can see below data for one of such ns which was generating logs at above mentioned rate, for most part if collected 720000 log lines (90,000 * 8 (number of pod replicas) starting at 30 mins mark. There was some loss in the trailing data but unsure if it was something that collector dropped or loki. I did not see Loki errors during the test.

$ curl --globoff -k -H "Authorization: Bearer $TOKEN" https://lokistack-netobserv-loki.apps.memodi-shared-loki.qe-lrc.devcluster.openshift.com/api/logs/v1/application/loki/api/v1/query_range --data-urlencode 'query=sum(count_over_time({log_type="application",kubernetes_namespace_name="log-gen-3"}[60m]))' --data-urlencode 'step=5m' | jq '.data.result[0].values'
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100  3272    0  3139  100   133   6024    255 --:--:-- --:--:-- --:--:--  6268
[
  [
    1765508400,
    "286650"
  ],
  [
    1765508700,
    "406275"
  ],
  [
    1765509000,
    "525950"
  ],
  [
    1765509300,
    "645575"
  ],
  [
    1765509600,
    "720000"
  ],
  [
    1765509900,
    "720000"
  ],
  [
    1765510200,
    "720000"
  ],
  [
    1765510500,
    "720000"
  ],
  [
    1765510800,
    "720000"
  ],
  [
    1765511100,
    "720000"
  ],
  [
    1765511400,
    "672625"
  ],
  [
    1765511700,
    "553000"
  ],
  [
    1765512000,
    "433350"
  ]
]

jpinsonneau · 2025-12-12T15:15:55Z

@jpinsonneau - I did a performance test on this and loki is able ingest both logging and network logs well, I am not seeing prominent log loss or any errors for loki.

I set up 5 worker nodes and for logging had 8 pods within each NS generating 90,000 lines of logs lines every 30 mins. I had 5 such namespaces, bringing total of 40 pods. I also added netobserv workload on top of it.

You can see below data for one of such ns which was generating logs at above mentioned rate, for most part if collected 720000 log lines (90,000 * 8 (number of pod replicas) starting at 30 mins mark. There was some loss in the trailing data but unsure if it was something that collector dropped or loki. I did not see Loki errors during the test.
$ curl --globoff -k -H "Authorization: Bearer $TOKEN" https://lokistack-netobserv-loki.apps.memodi-shared-loki.qe-lrc.devcluster.openshift.com/api/logs/v1/application/loki/api/v1/query_range --data-urlencode 'query=sum(count_over_time({log_type="application",kubernetes_namespace_name="log-gen-3"}[60m]))' --data-urlencode 'step=5m' | jq '.data.result[0].values'
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100  3272    0  3139  100   133   6024    255 --:--:-- --:--:-- --:--:--  6268
[
  [
    1765508400,
    "286650"
  ],
  [
    1765508700,
    "406275"
  ],
  [
    1765509000,
    "525950"
  ],
  [
    1765509300,
    "645575"
  ],
  [
    1765509600,
    "720000"
  ],
  [
    1765509900,
    "720000"
  ],
  [
    1765510200,
    "720000"
  ],
  [
    1765510500,
    "720000"
  ],
  [
    1765510800,
    "720000"
  ],
  [
    1765511100,
    "720000"
  ],
  [
    1765511400,
    "672625"
  ],
  [
    1765511700,
    "553000"
  ],
  [
    1765512000,
    "433350"
  ]
]

Awesome, thanks for testing this @memodi !
Let's try to grab more feedback. I'll also rebase and fix tests here

pull-request-size bot added the size/XL label Sep 22, 2025

jpinsonneau force-pushed the openshift_mode branch from 55a3176 to ef69466 Compare September 22, 2025 10:49

jpinsonneau mentioned this pull request Sep 23, 2025

OBSDA-357 Per tenant / group matchers config from flag observatorium/opa-openshift#37

Open

jpinsonneau force-pushed the openshift_mode branch from 7bb7e00 to c0d726d Compare September 23, 2025 14:01

jpinsonneau changed the title ~~DRAFT openshift mode~~ Openshift mode Sep 24, 2025

jpinsonneau force-pushed the openshift_mode branch from c0d726d to e789463 Compare September 24, 2025 14:11

jpinsonneau changed the title ~~Openshift mode~~ feat: openshift mode to allow both logging and network Sep 24, 2025

jpinsonneau force-pushed the openshift_mode branch from e789463 to 6ab8f6c Compare September 24, 2025 14:22

jpinsonneau added 2 commits December 12, 2025 16:20

feat: openshift mode to allow both logging and network

6d20796

fix test

53a0a79

jpinsonneau force-pushed the openshift_mode branch from 002ee46 to 53a0a79 Compare December 12, 2025 15:41

jpinsonneau marked this pull request as ready for review December 12, 2025 15:43

jpinsonneau requested review from a team, JoaoBraveCoding, periklis and xperimental as code owners December 12, 2025 15:43

JoaoBraveCoding added the sig/operator label Dec 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: openshift mode to allow both logging and network #19249

feat: openshift mode to allow both logging and network #19249

jpinsonneau commented Sep 22, 2025 •

edited

Loading

Uh oh!

memodi commented Dec 12, 2025

Uh oh!

jpinsonneau commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: openshift mode to allow both logging and network #19249

Are you sure you want to change the base?

feat: openshift mode to allow both logging and network #19249

Conversation

jpinsonneau commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

memodi commented Dec 12, 2025

Uh oh!

jpinsonneau commented Dec 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jpinsonneau commented Sep 22, 2025 •

edited

Loading