Skip to content

Feature Guidance attack for VLP models. The approach involves the ALBEF, TCL, CLIP, and BEiT3 models, as well as the VE (Visual Entailment), VG (Visual Grounding), VR (Visual Reasoning), VQA (Visual Question Answering), ZC (Zero-shot Classification), and ITR (Image-Text Retrieval) tasks.

License

Notifications You must be signed in to change notification settings

Libertax-coder/FGA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

FGA

Feature Guidance attack for VLP models. The approach involves the ALBEF, TCL, CLIP, and BEiT3 models, as well as the VE (Visual Entailment), VG (Visual Grounding), VR (Visual Reasoning), VQA (Visual Question Answering), ZC (Zero-shot Classification), and ITR (Image-Text Retrieval) tasks.

The code is being organized.
It is very stressful for one person to write all the code.
I hope you can understand.
If you really need it, you can contact me by email.
I can provide the unorganized source code.

The code is mainly based on the following two works:
Co-Attack:https://github.com/adversarial-for-goodness/Co-Attack
Set-level Guidance Attack:https://github.com/Zoky-2020/SGA
And other basic works:
CLIP: https://github.com/openai/CLIP
ALBEF: https://github.com/salesforce/ALBEF
BLIP: https://github.com/salesforce/BLIP
We are very grateful for their open-source work, which enabled us to complete our work——FGA.

About

Feature Guidance attack for VLP models. The approach involves the ALBEF, TCL, CLIP, and BEiT3 models, as well as the VE (Visual Entailment), VG (Visual Grounding), VR (Visual Reasoning), VQA (Visual Question Answering), ZC (Zero-shot Classification), and ITR (Image-Text Retrieval) tasks.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published