RoobanSappani / Attack-and-Defense-of-Adversarial-Images Public

Notifications You must be signed in to change notification settings
Fork 0
Star 2

I have implemented three types of adversarial attacks that can be used on a trained CNN model. T countermeasure these attacks, a defense algorithm is also implemented

2 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
AdversarialAttackDefense.ipynb		AdversarialAttackDefense.ipynb
LICENSE		LICENSE
README.md		README.md

Repository files navigation

Attack-and-Defense-of-Adversarial-Images

I have implemented three types of adversarial attacks that can be used on a trained CNN model. To countermeasure these attacks, a defense algorithm is also implemented. The dataset is used is MNIST.

Attack of Adversarial Images

Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake. They’re like optical illusions for machines.

I have implemented three types of white box attacks:

Below given is the training and validation loss accross all the epochs.

Fast Gradient Sign Method

Test Accuracy after FGSM attack

Examples of some adversarial images:

Iterative Fast Gradient Sign Method

Test Accuracy after I-FGSM attack

Examples of some adversarial images:

Momentum Iterative Fast Gradient Sign Method

Test Accuracy after MI_FGSM attack

Examples of some adversarial images:

Defense Distillation for Adversarial Images

To countermeasure the above attacks, distillation was implemented.

Below given is the training and validation loss for netowrkf and networkf1

Defense against FGSM

Below is the Test accuracy after defending the FGSM attack

Examples of the predicitions after defense:

Defense against I-FGSM

Below is the Test accuracy after defending the I-FGSM attack

Examples of the predicitions after defense:

Defense against MI-FGSM

Below is the Test accuracy after defending the MI-FGSM attack

Examples of the predicitions after defense:

About

I have implemented three types of adversarial attacks that can be used on a trained CNN model. T countermeasure these attacks, a defense algorithm is also implemented

computer-vision deep-learning pytorch adversial

Report repository

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook 100.0%