Homework 2 - Creating Backdoor Detector Design

By Ankush Pratap Singh

├── data 
    └── cl
        └── valid.h5 // this is clean validation data used to design the defense
        └── test.h5  // this is clean test data used to evaluate the BadNet
    └── bd
        └── bd_valid.h5 // this is sunglasses poisoned validation data
        └── bd_test.h5  // this is sunglasses poisoned test data
├── models
    └── bd_net.h5
    └── bd_weights.h5
    └── B_prime_2.h5
    └── B_prime_4.h5
    └── B_prime_10.h5
├── Homework_2_ax2047.ipynb
└── eval.py // this is the evaluation script

I. Dependencies

Python 3.6.9
Keras 2.3.1
Numpy 1.16.3
Matplotlib 2.2.2
H5py 2.9.0
TensorFlow-gpu 1.15.2
Pandas
Seaborn

II. Data

Download the validation and test datasets from here and store them under data/ directory.
The dataset contains images from YouTube Aligned Face Dataset. We retrieve 1283 individuals and split into validation and test datasets.
bd_valid.h5 and bd_test.h5 contains validation and test images with sunglasses trigger respectively, that activates the backdoor for bd_net.h5.

III. Running and Creating Backdoored Model

To create B_prime models, run each cell of the file Homework_2_ax2047.ipynb.
Make sure to uncomment the part which is corresponding to the required threshold value (drop in clean_accuracy_valid) and comment out the parts which are for other threshold values.
Only once you have all the three model file corresponding to different threshold values, then only proceed to the comparison section. It will throw an error if you do not create all the three different B_prime models.
Exisiting created B_prime models can also be used which are present in models/ directory named as B_prime_2.h5, B_prime_4.h5 and B_prime_10.h5.

III. Evaluating the Backdoored Model

The DNN architecture used to train the face recognition model is the state-of-the-art DeepID network.
To evaluate the backdoored model, execute eval.py by running:
python3 eval.py <clean validation data directory> <poisoned validation data directory> <model directory>.

E.g., python3 eval.py data/cl/valid.h5 data/bd/bd_valid.h5 models/B_prime_10.h5.

This will output:

Clean Classification accuracy: 84.43751623798389

Attack Success Rate: 77.015675067117

IV. Important Notes

Please use only clean validation data (valid.h5) to design the pruning defense. And use test data (test.h5 and bd_test.h5) to evaluate the models.

For any further enquires, drop a mail on ankushpratap@nyu.edu

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Homework 2 - Creating Backdoor Detector Design

I. Dependencies

II. Data

III. Running and Creating Backdoored Model

III. Evaluating the Backdoored Model

IV. Important Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
data		data
models		models
Homework_2_ax2047.ipynb		Homework_2_ax2047.ipynb
README.md		README.md
eval.py		eval.py

Folders and files

Latest commit

History

Repository files navigation

Homework 2 - Creating Backdoor Detector Design

I. Dependencies

II. Data

III. Running and Creating Backdoored Model

III. Evaluating the Backdoored Model

IV. Important Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages