Add unified Hierarchical MetricCollection (torchmetrics) + unit tests #4689

Jyc323 · 2025-09-15T07:56:48Z

Summary

This PR introduces a hierarchical metrics module and a consistent entrypoint for training/eval. It includes leaf accuracy, full-path accuracy, prediction path-consistency ratios, and label-count–weighted precision into a single MetricCollection that plugs into existing OTX/torchmetrics flows.

LeafAccuracy
Macro-averaged correctness at the leaf level (final decision); mitigates class-imbalance bias.
FullPathAccuracy
Strict correctness across all levels; prevents inflated scores from partial matches.
InconsistentPathRatio (predictions)
Detects taxonomy-violating predictions (child not under parent); highlights model–hierarchy mismatches.
WeightedHierarchicalPrecision
Per-level macro precision aggregated with label-count weights; balances coarse vs. fine levels and stays robust to imbalance.

How to test

pytest -q lib/tests/unit/metrics/test_hier_metric_collection.py

Checklist

I have added unit tests to cover my changes.
I have added integration tests to cover my changes.
I have ran e2e tests and there is no issues.
I have added the description of my changes into CHANGELOG in my target branch (e.g., CHANGELOG in develop).
I have updated the documentation in my target branch accordingly (e.g., documentation in develop).
I have linked related issues.

License

I submit my code changes under the same Apache License that covers the project.
Feel free to contact the maintainers if that's a concern.
I have updated the license header for each file (see an example below).

# Copyright (C) 2025 Intel Corporation
# SPDX-License-Identifier: Apache-2.0

sovrasov

Thanks @Jyc323 for extending OTX h-cls metrics collection!
Considering #4706, it makes sense to create a chapter for h-cls in https://open-edge-platform.github.io/training_extensions/latest/guide/tutorials/advanced/index.html
As in the loss PR, there should be a way how OTX users can quickly utilize new metrics by modifying model recipe. Also, it might be an idea to extend OTX default h-cls metrics collection to make the default metrics more expressive right out of the box

lib/src/otx/metrics/hier_metric_collection.py

…d engine.test()

Jyc323 · 2025-09-19T06:01:52Z

Hi @sovrasov, thanks for the review and comments. Per your feedback, I removed the typing imports and now use the built-in generics (e.g., list, dict) in this push—please take another look. I also fixed all issues reported by tox -vv -e pre-commit.

As an extension, I integrated the current hierarchical accuracy calculation into hier_metric_collection_callable. When using otx.engine for training or testing, users can pass the metric parameter to enable it. I added a corresponding unit test at tests/unit/metrics/test_hier_metric_collection_from_engine.py. I’d appreciate your thoughts and any suggestions.

…d engine.test()

sovrasov · 2025-09-19T13:16:37Z

Hi @sovrasov, thanks for the review and comments. Per your feedback, I removed the typing imports and now use the built-in generics (e.g., list, dict) in this push—please take another look. I also fixed all issues reported by tox -vv -e pre-commit.

As an extension, I integrated the current hierarchical accuracy calculation into hier_metric_collection_callable. When using otx.engine for training or testing, users can pass the metric parameter to enable it. I added a corresponding unit test at tests/unit/metrics/test_hier_metric_collection_from_engine.py. I’d appreciate your thoughts and any suggestions.

Thanks for the update! I see there is a way how users can utilize the new metrics, but it's hard hard to find it without a careful review of OTX source code. That I'm suggesting here is to add a tutorial on how to use the new h-cls features via both OTX recipe and API. You can check other tutorials for instance, docs/source/guide/tutorials/advanced/peft.rst to have an idea how to write one. The docs can be built by running cd lib && tox -e build-doc

sovrasov · 2025-09-19T13:49:07Z

@kprokofi will temporally cover me, as I'm off next week

Jyc323 · 2025-09-20T22:59:51Z

@kprokofi I added the documentation under docs/source/guide/tutorials/advanced as suggestion. Please have a look and give me any feedback, since I'm new to the documentation tool.

library/docs/source/guide/tutorials/advanced/hier_metric_collection.rst

Jyc323 · 2025-09-29T22:44:30Z

Hi @sovrasov , thanks for the reviewing. I've reviewed the errors from the Unit-Test-with-Python3.12 run, but they don't appear to be caused by my changes. Could you please advise on what I should do?

=========================== short test summary info ============================
ERROR tests/unit/backend/native/models/detection/test_yolox.py - OSError: [Er...
ERROR tests/unit/backend/native/models/instance_segmentation/test_maskrcnn.py
ERROR tests/unit/backend/native/models/segmentation/test_dino_v2_seg.py - OSE...
ERROR tests/unit/backend/native/models/segmentation/test_segnext.py - OSError...
!!!!!!!!!!!!!!!!!!! Interrupted: 4 errors during collection !!!!!!!!!!!!!!!!!!!!
================== 22 warnings, 4 errors in 85.47s (0:01:25) ===================

As for 'Required Check lib-lint-and-test', I don't think the log provide enough information, could you please tell me where I can find detailed information?

Error: Required status checks failed. They must succeed before this pull request can be merged.
Error: Process completed with exit code 1.

Thanks for your help

sovrasov · 2025-09-30T07:31:38Z

Hi @sovrasov , thanks for the reviewing. I've reviewed the errors from the Unit-Test-with-Python3.12 run, but they don't appear to be caused by my changes. Could you please advise on what I should do?

=========================== short test summary info ============================
ERROR tests/unit/backend/native/models/detection/test_yolox.py - OSError: [Er...
ERROR tests/unit/backend/native/models/instance_segmentation/test_maskrcnn.py
ERROR tests/unit/backend/native/models/segmentation/test_dino_v2_seg.py - OSE...
ERROR tests/unit/backend/native/models/segmentation/test_segnext.py - OSError...
!!!!!!!!!!!!!!!!!!! Interrupted: 4 errors during collection !!!!!!!!!!!!!!!!!!!!
================== 22 warnings, 4 errors in 85.47s (0:01:25) ===================

As for 'Required Check lib-lint-and-test', I don't think the log provide enough information, could you please tell me where I can find detailed information?

Error: Required status checks failed. They must succeed before this pull request can be merged.
Error: Process completed with exit code 1.

Thanks for your help

Indeed, that can be unrelated CI errors already fixed in develop. Could you merge changes from develop and resolve conflicts first? Then, we can run the CI one more time

Jyc323 · 2025-10-02T00:31:33Z

Hi @sovrasov, I already merged and resolved the conflicting, please let me know if there are any issues. Thanks a lot

sovrasov · 2025-10-02T18:04:01Z

Unfortunately, CI is still not in the best shape, ETA for a fix is next week
BTW did you double-check if my latest review comment in this PR makes sense?

Jyc323 · 2025-10-03T00:05:20Z

Hi @sovrasov, I think your latest review comment makes sense. I already fixed it and pushed the modification. Please let me know anything else I need to do from my side. Have a great day!

Jyc323 added 2 commits September 15, 2025 00:14

add hierarchical metrics collection + unit tests

c0d27af

add hierarchical metrics collection + unit tests

a361bd5

Jyc323 requested review from Daankrol, ashwinvaidya17, eugene123tw, kprokofi, rajeshgangireddy, samet-akcay and sovrasov as code owners September 15, 2025 07:56

github-actions bot added the TEST Any changes in tests label Sep 15, 2025

rajeshgangireddy mentioned this pull request Sep 16, 2025

add Tree-Path KL Divergence loss for hier classification + unit test #4706

Open

8 tasks

sovrasov added the GSoC label Sep 17, 2025

sovrasov reviewed Sep 17, 2025

View reviewed changes

lib/src/otx/metrics/hier_metric_collection.py Outdated Show resolved Hide resolved

Jyc323 added 3 commits September 18, 2025 16:46

remove typing, fix errors from ruff

5f9f567

add how to use hier metric collection callable from engine.train() an…

40fcc0e

…d engine.test()

add how to use hier metric collection callable from engine.train() an…

5152ac3

…d engine.test()

add how to use hier metric collection callable from engine.train() an…

356a19f

…d engine.test()

sovrasov assigned kprokofi Sep 19, 2025

add the documentation for hier cls metric collection

e805da6

github-actions bot added the DOC Improvements or additions to documentation label Sep 20, 2025

sovrasov reviewed Sep 26, 2025

View reviewed changes

library/docs/source/guide/tutorials/advanced/hier_metric_collection.rst Outdated Show resolved Hide resolved

fix merging conflict

aa7130c

Jyc323 requested a review from a team as a code owner October 2, 2025 00:22

change the documentation error

c9c3d47

Merge branch 'develop' into hier_feat

fcdebe9

sovrasov approved these changes Oct 6, 2025

View reviewed changes

kprokofi approved these changes Oct 6, 2025

View reviewed changes

sovrasov merged commit 20ea100 into open-edge-platform:develop Oct 6, 2025
37 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add unified Hierarchical MetricCollection (torchmetrics) + unit tests #4689

Add unified Hierarchical MetricCollection (torchmetrics) + unit tests #4689

Uh oh!

Jyc323 commented Sep 15, 2025

Uh oh!

sovrasov left a comment •

edited

Loading

Uh oh!

Uh oh!

Jyc323 commented Sep 19, 2025

Uh oh!

sovrasov commented Sep 19, 2025

Uh oh!

sovrasov commented Sep 19, 2025

Uh oh!

Jyc323 commented Sep 20, 2025

Uh oh!

Uh oh!

Jyc323 commented Sep 29, 2025

Uh oh!

sovrasov commented Sep 30, 2025

Uh oh!

Jyc323 commented Oct 2, 2025

Uh oh!

sovrasov commented Oct 2, 2025 •

edited

Loading

Uh oh!

Jyc323 commented Oct 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add unified Hierarchical MetricCollection (torchmetrics) + unit tests #4689

Add unified Hierarchical MetricCollection (torchmetrics) + unit tests #4689

Uh oh!

Conversation

Jyc323 commented Sep 15, 2025

Summary

How to test

Checklist

License

Uh oh!

sovrasov left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Jyc323 commented Sep 19, 2025

Uh oh!

sovrasov commented Sep 19, 2025

Uh oh!

sovrasov commented Sep 19, 2025

Uh oh!

Jyc323 commented Sep 20, 2025

Uh oh!

Uh oh!

Jyc323 commented Sep 29, 2025

Uh oh!

sovrasov commented Sep 30, 2025

Uh oh!

Jyc323 commented Oct 2, 2025

Uh oh!

sovrasov commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Jyc323 commented Oct 3, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sovrasov left a comment •

edited

Loading

sovrasov commented Oct 2, 2025 •

edited

Loading