Skip to content

Commit 4c36d00

Browse files
vinnamkimSongki Choi
andauthored
Add documentation for the noisy label detection feature (#2034)
* Add documentation for noisy label detection feature Signed-off-by: Kim, Vinnam <[email protected]> * Update CHANGELOG.md Signed-off-by: Kim, Vinnam <[email protected]> * Add documentation Signed-off-by: Kim, Vinnam <[email protected]> * Fix typo Signed-off-by: Kim, Vinnam <[email protected]> * Fix small typo --------- Signed-off-by: Kim, Vinnam <[email protected]> Co-authored-by: Songki Choi <[email protected]>
1 parent 5b3116c commit 4c36d00

File tree

3 files changed

+29
-0
lines changed

3 files changed

+29
-0
lines changed

CHANGELOG.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,7 @@ All notable changes to this project will be documented in this file.
1111
- Support export of onnx models (<https://github.com/openvinotoolkit/training_extensions/pull/1976>)
1212
- Add option to save images after inference in OTX CLI demo together with demo in exportable code (<https://github.com/openvinotoolkit/training_extensions/pull/2005>)
1313
- Support storage cache in Apache Arrow using Datumaro for cls, det, seg tasks (<https://github.com/openvinotoolkit/training_extensions/pull/2009>)
14+
- Add noisy label detection for multi-class classification task (<https://github.com/openvinotoolkit/training_extensions/pull/1985>, <https://github.com/openvinotoolkit/training_extensions/pull/2034>)
1415

1516
### Enhancements
1617

docs/source/guide/explanation/additional_features/index.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -10,3 +10,4 @@ Additional Features
1010
hpo
1111
auto_configuration
1212
xai
13+
noisy_label_detection
Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,27 @@
1+
Noisy label detection
2+
=====================
3+
4+
OpenVINO™ Training Extensions provide a feature for detecting noisy labels during model training.
5+
With this feature, you can identify noisy labeled samples in your training dataset.
6+
Our algorithm accumulates the training loss dynamics during the model training
7+
and exports it to `Datumaro <https://github.com/openvinotoolkit/datumaro>`_.
8+
The training loss dynamics are then post-processed by exponential moving average (EMA),
9+
a strong criterion for detecting noisy label samples [1]_.
10+
Finally, Datumaro ranks the top-k samples, which can be considered as noisy labeled candidates.
11+
We provide an end-to-end example written in a Jupyter notebook, which you can find at the link in the note below.
12+
13+
In OpenVINO™ Training Extensions CLI, you can enable this feature
14+
by adding the argument ``--algo_backend.enable_noisy_label_detection true`` as follows.
15+
16+
.. code-block::
17+
18+
$ otx train ... params --algo_backend.enable_noisy_label_detection true
19+
20+
.. note::
21+
Currently, it only supports multi-class classification task and single GPU training.
22+
23+
.. note:: **Important!**
24+
The post-processing step to analyze the training loss dynamics requires `Datumaro <https://github.com/openvinotoolkit/datumaro>`_.
25+
Please see `this end-to-end Jupyter notebook example <https://github.com/openvinotoolkit/datumaro/blob/develop/notebooks/10_noisy_label_detection.ipynb>`_.
26+
27+
.. [1] Zhou, Tianyi, Shengjie Wang, and Jeff Bilmes. "Robust curriculum learning: from clean label detection to noisy label self-correction." International Conference on Learning Representations. 2021.

0 commit comments

Comments
 (0)