Add OnePixelShortcutAttack poisoning attack and its unit tests #2720

nicholasadriel · 2025-08-09T03:58:05Z

Description

This PR adds a new data poisoning attack class, OnePixelShortcutAttack, to the Adversarial Robustness Toolbox. The class is implemented under the art.attacks.poisoning module, and it introduces support for the One Pixel Shortcut (OPS) attack in ART. A corresponding unit test suite (test_one_pixel_shortcut_attack.py) is also included to validate the correct behavior of the attack implementation.

Motivation

The One Pixel Shortcut attack is a recently proposed poisoning technique that perturbs a single pixel in each training image (in a consistent location per class) to create "unlearnable" examples. This can dramatically degrade a model’s accuracy on clean data without altering the labels. By integrating OPS into IBM ART, we enable standardized evaluation of this attack using ART’s framework and estimators. The implementation has been tested on and extends support to popular image classification datasets such as CIFAR-10, CIFAR-100, UTKFace, and CelebA, ensuring the attack’s broad applicability. Incorporating OPS aligns with ART’s benchmarking and reproducibility goals, expanding the library’s coverage of state-of-the-art poisoning attacks.

Fixes

No open issue is associated with this PR (new feature contribution).

Type of change

New feature (non-breaking change which adds functionality)

Testing

Unit tests have been added in test_one_pixel_shortcut_attack.py to verify the implementation’s correctness:

Output shape: Ensures the poisoned data produced by OnePixelShortcutAttack has the same shape as the original input data (no unintended dimensionality changes).
Label preservation: Confirms that the attack does not alter the class labels of the dataset (the labels remain unchanged after poisoning).
Per-class pixel perturbation: Verifies that exactly one pixel per class is consistently perturbed across all images of that class, validating the intended one-pixel shortcut behavior.

All tests pass, confirming that the attack behaves as expected and integrates correctly with ART’s data and estimator APIs.

Test Configuration

No additional configuration or dependencies are required for this feature. The OnePixelShortcutAttack can be used out-of-the-box with ART’s existing classifiers and datasets, similar to other poisoning attacks in the library.

Checklist

My code follows the style guidelines of this project
I have performed a self-review of my own code
I have commented my code
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes
My changes have been tested using both CPU and GPU devices

Reference

Shutong Wu, Sizhe Chen, Cihang Xie, and Xiaolin Huang. One-pixel shortcut: On the learning preference of deep neural networks. In Proc. of ICLR 2023

codecov · 2025-08-11T12:36:51Z

Codecov Report

❌ Patch coverage is 98.64865% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 85.22%. Comparing base (293bd22) to head (8e7421b).
⚠️ Report is 31 commits behind head on dev_1.21.0.

Files with missing lines	Patch %	Lines
art/attacks/poisoning/one_pixel_shortcut_attack.py	98.63%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@              Coverage Diff               @@
##           dev_1.21.0    #2720      +/-   ##
==============================================
+ Coverage       83.30%   85.22%   +1.92%     
==============================================
  Files             330      331       +1     
  Lines           29539    29894     +355     
  Branches         5007     5023      +16     
==============================================
+ Hits            24607    25477     +870     
+ Misses           3516     2981     -535     
- Partials         1416     1436      +20

Files with missing lines	Coverage Δ
art/attacks/poisoning/__init__.py	`100.00% <100.00%> (ø)`
art/attacks/poisoning/one_pixel_shortcut_attack.py	`98.63% <98.63%> (ø)`

... and 276 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Copilot

Pull Request Overview

This PR adds a new data poisoning attack implementation called OnePixelShortcutAttack to the Adversarial Robustness Toolbox (ART). The attack perturbs a single pixel in each training image at a consistent location per class to create "unlearnable" examples that degrade model accuracy on clean data.

Implementation of the One Pixel Shortcut (OPS) attack as a new poisoning attack class
Comprehensive unit test suite validating attack behavior and integration with ART estimators
Updates to package dependencies and CI configurations

Reviewed Changes

Copilot reviewed 6 out of 6 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
art/attacks/poisoning/one_pixel_shortcut_attack.py	Core implementation of the OnePixelShortcutAttack class with pixel perturbation logic
tests/attacks/poison/test_one_pixel_shortcut_attack.py	Comprehensive unit tests covering various scenarios and edge cases
art/attacks/poisoning/init.py	Adds import for the new attack class
requirements_test.txt	Updates dependency versions for testing infrastructure
.github/workflows/dockerhub.yml	Updates Docker action versions
.github/workflows/ci-huggingface.yml	Adds safetensors dependency and updates filtering logic

tests/attacks/poison/test_one_pixel_shortcut_attack.py

art/attacks/poisoning/one_pixel_shortcut_attack.py

nicholasadriel · 2025-08-12T05:47:19Z

Update:

Worked on the Codecov patch coverage issue (previously 80.82192% with 14 lines missing) by extending unit tests to hit the remaining branches (shape routing for NHWC/NCHW, one‑hot label handling, empty‑class skip, and best‑coord guard).
Ran Black and fixed pycodestyle/mypy findings to address the Style Check.

Kindly re‑run CI and re‑review, happy to adjust further if needed. Thank you @beat-buesser

nicholasadriel · 2025-08-12T15:55:06Z

Hi @beat-buesser may I know why there is still 1 pending check regarding PyTorch 2.6.0 (Python 3.10) (Expected — Waiting for status to be reported) ?

I think from the last check, the only issue was the CI Style Checks but now it is successful. The Codecov part also passed this time with higher percentage of patch coverage.

Please let me know if there is something I need to further adjust, thank you!

beat-buesser · 2025-08-13T08:43:23Z

Hi @beat-buesser may I know why there is still 1 pending check regarding PyTorch 2.6.0 (Python 3.10) (Expected — Waiting for status to be reported) ?

It got replaced by PyTorch 2.8.0, I have now updated the settings for this target branch.

beat-buesser · 2025-08-13T08:44:20Z

I'll add a review in the coming days.

beat-buesser

Hi @nicholasadriel Thank you for your pull request. I have added a few review comments, could you please take a look?

Are you able to reproduce the results of paper by Wu et al. with this code?

art/attacks/poisoning/one_pixel_shortcut_attack.py

tests/attacks/poison/test_one_pixel_shortcut_attack.py

nicholasadriel · 2025-08-18T12:18:12Z

Hi @nicholasadriel Thank you for your pull request. I have added a few review comments, could you please take a look?

Are you able to reproduce the results of paper by Wu et al. with this code?

Hi @beat-buesser I have reviewed the comments, performed a quick test, and everything's fine.

Regarding reproducing the results of paper by Wu et al., I wasn’t able to reproduce the exact numbers, but the core OPS algorithm here mirrors the reference implementation (single per-class pixel chosen by a stability/deviation criterion; same coordinate and color applied to all images of that class; labels unchanged; model-free).

Reproduction snapshot (CIFAR-10, ResNet-18, 200 epochs):
Wu et al. (paper): Clean accuracy 94.01%, OPS-poisoned accuracy 15.56%, Drop 78.45%
This ART implementation: Clean accuracy 89.22%, OPS-poisoned accuracy 7.64%, Drop 81.58%

While the absolute accuracies differ, the effect size of the attack is very close, which supports that the implementation captures the intended behavior. The gap in absolute numbers is likely due to training-pipeline differences (data preprocessing/augmentation, normalization, optimizer/schedule, weight decay, and seeds).

beat-buesser · 2025-08-20T08:18:57Z

@nicholasadriel I forgot, we still need from __future__ import annotations for the new typing in some of the test runs.

Signed-off-by: Nicholas Audric Adriel <[email protected]>

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Signed-off-by: Nicholas Audric Adriel <[email protected]>

nicholasadriel · 2025-08-20T11:09:41Z

@nicholasadriel I forgot, we still need from __future__ import annotations for the new typing in some of the test runs.

Hi @beat-buesser thank you for the review, I have added the import statement and corrected few typing issues based on last review results. Please kindly re‑run CI and re‑review, happy to adjust further if needed. Thank you.

Signed-off-by: Nicholas Audric Adriel <[email protected]>

nicholasadriel · 2025-08-20T11:45:17Z

@nicholasadriel I forgot, we still need from __future__ import annotations for the new typing in some of the test runs.

Hi @beat-buesser sorry, can you please kindly re‑run CI and re‑review again? I misplaced the from future import annotations thus failed some tests, but I have corrected it just now. Thanks!

nicholasadriel · 2025-08-21T12:30:55Z

Hi @beat-buesser since the last checks have already passed, what will be the next step regarding this pull request? Thanks!

beat-buesser · 2025-08-22T10:29:54Z

Hi @nicholasadriel I think from a review point of view it is now ready for merging.

beat-buesser

Hi @nicholasadriel Thank you very much for your pull request and contributing to ART!

nicholasadriel force-pushed the one-pixel-shortcut-attack branch from fbed197 to 17257dc Compare August 9, 2025 04:08

beat-buesser changed the base branch from main to dev_1.21.0 August 11, 2025 12:22

beat-buesser changed the base branch from dev_1.21.0 to main August 11, 2025 12:24

beat-buesser changed the base branch from main to dev_1.21.0 August 11, 2025 12:25

beat-buesser self-requested a review August 11, 2025 14:48

Copilot AI review requested due to automatic review settings August 12, 2025 00:34

Copilot AI reviewed Aug 12, 2025

View reviewed changes

tests/attacks/poison/test_one_pixel_shortcut_attack.py Outdated Show resolved Hide resolved

art/attacks/poisoning/one_pixel_shortcut_attack.py Show resolved Hide resolved

art/attacks/poisoning/one_pixel_shortcut_attack.py Show resolved Hide resolved

beat-buesser self-assigned this Aug 12, 2025

beat-buesser added this to the ART 1.21.0 milestone Aug 13, 2025

beat-buesser added the enhancement New feature or request label Aug 13, 2025

beat-buesser added this to ART 1.21.0 Aug 13, 2025

beat-buesser moved this to In Progress in ART 1.21.0 Aug 13, 2025

beat-buesser requested changes Aug 18, 2025

View reviewed changes

nicholasadriel force-pushed the one-pixel-shortcut-attack branch from 050689d to 4461f32 Compare August 18, 2025 11:25

nicholasadriel requested a review from beat-buesser August 18, 2025 11:53

nicholasadriel force-pushed the one-pixel-shortcut-attack branch from 4461f32 to 00ecd86 Compare August 20, 2025 10:34

nicholasadriel and others added 7 commits August 20, 2025 11:48

Add OnePixelShortcutAttack poisoning attack and unit tests

11cb8b3

Signed-off-by: Nicholas Audric Adriel <[email protected]>

Merge branch 'dev_1.21.0' into one-pixel-shortcut-attack

fe37f73

Fix OPS typing and tests; add coverage for edge branches

abc892b

Signed-off-by: Nicholas Audric Adriel <[email protected]>

Merge branch 'dev_1.21.0' into one-pixel-shortcut-attack

af8bebf

Merge branch 'dev_1.21.0' into one-pixel-shortcut-attack

99b3ff0

Update art/attacks/poisoning/one_pixel_shortcut_attack.py

d468d60

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update art/attacks/poisoning/one_pixel_shortcut_attack.py

319cf5e

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

nicholasadriel and others added 19 commits August 20, 2025 11:48

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

cd5b269

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

7683c28

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

a120118

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

857b831

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

ce8a0b2

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

7e47304

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

50b2965

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

6bd4304

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

0317dee

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

2469f60

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

93c4dc6

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

0b12cc7

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

ada2372

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

04faddb

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

5fec7a3

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

296a693

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Update tests/attacks/poison/test_one_pixel_shortcut_attack.py

48d4c53

Co-authored-by: Beat Buesser <[email protected]> Signed-off-by: Nicholas Audric Adriel <[email protected]>

Adding import statement for new typing

5dcb9fc

Signed-off-by: Nicholas Audric Adriel <[email protected]>

Adding import statement for new typing

718e818

Signed-off-by: Nicholas Audric Adriel <[email protected]>

nicholasadriel force-pushed the one-pixel-shortcut-attack branch from 00ecd86 to 718e818 Compare August 20, 2025 10:49

nicholasadriel added 2 commits August 20, 2025 11:58

Adding import statement for new typing based on last review

29ab40c

Signed-off-by: Nicholas Audric Adriel <[email protected]>

Adding and checking import statement for new typing based on last review

c381971

Signed-off-by: Nicholas Audric Adriel <[email protected]>

Fix the new import statement

8e7421b

Signed-off-by: Nicholas Audric Adriel <[email protected]>

beat-buesser approved these changes Aug 22, 2025

View reviewed changes

beat-buesser merged commit b3bad7f into Trusted-AI:dev_1.21.0 Aug 22, 2025
25 checks passed

github-project-automation bot moved this from In Progress to Done in ART 1.21.0 Aug 22, 2025

Add OnePixelShortcutAttack poisoning attack and its unit tests #2720

Add OnePixelShortcutAttack poisoning attack and its unit tests #2720

Uh oh!

Conversation

nicholasadriel commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation

Fixes

Type of change

Testing

Test Configuration

Checklist

Reference

Uh oh!

codecov bot commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nicholasadriel commented Aug 12, 2025

Uh oh!

nicholasadriel commented Aug 12, 2025

Uh oh!

beat-buesser commented Aug 13, 2025

Uh oh!

beat-buesser commented Aug 13, 2025

Uh oh!

beat-buesser left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nicholasadriel commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

beat-buesser commented Aug 20, 2025

Uh oh!

nicholasadriel commented Aug 20, 2025

Uh oh!

nicholasadriel commented Aug 20, 2025

Uh oh!

nicholasadriel commented Aug 21, 2025

Uh oh!

beat-buesser commented Aug 22, 2025

Uh oh!

beat-buesser left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

nicholasadriel commented Aug 9, 2025 •

edited

Loading

codecov bot commented Aug 11, 2025 •

edited

Loading

nicholasadriel commented Aug 18, 2025 •

edited

Loading

beat-buesser left a comment •

edited

Loading