Installation

AI Robustness Testing Kit (AiR-TK) is an AI testing framework built upon PyTorch that enables the AI security community to evaluate the AI models against adversarial attacks easily and comprehensively. Furthermore, Air-TK supports adversarial training, the de-facto technique to improve the robustness of AI models against adversarial attacks. Having easy access to state-of-the-art adversarial attacks and the baseline adversarial training method in one place will help the AI security community to replicate, re-use, and improve the upcoming attacks and defense methods.

Although other solutions such as the adversarial robustness toolbox and MAIR have provided solutions for this in the past, they are not as comprehensive in breadth of provided attacks and defenses.

Usecases

Our tool fulfills one of the current gaps in the AI security world: a need for simple evaluation of existing frameworks in order to determine the robustness of AI models against adversarial attacks.

Key Benefits:

Ease of Use: Our tool simplifies the process of evaluating exisitng models and evaluation methods.
Centralized Maintenance: By centralizing various functionalities, our tool reduces the complexity of managing multiple libraries and tools. This streamlined approach allows for more efficient updates and maintenance.
Enhanced Usability: We prioritize user experience, ensuring that our tool is user-friendly. This focus on usability means you can spend more time on model development and less time on troubleshooting.

Installation

Our work is available via this repository and as a PyPI package.

From PyPI (Recommended)

python3 -m pip install airtk

From Repo Source (Not Recommended)

In order to install from here, you will need:

The Conda environment manager.
The Git version control system.

git clone https://github.com/LAiSR-SK/AiRobustnessTestingKit-AiR-TK-

conda env create -p .conda

conda activate ./.conda

# The tool will let your life easier by combining different attacks, depending on your needs
attacks = [
    VNIFGSM,
    VMIFGSM,
    VANILA,
    UPGD,
    TPGD,
    Square,
    SPSA,
    SparseFool,
    SINIFGSM,
]
for attack in attacks:
    attacker = attack(model)
    adversarial_example = attacker(images, labels)
    adversarial_examples[attack.__name__] = adversarial_example

We support the following attacks:

VNIFGSM: Variance-Tuning Iterative Fast Gradient Sign Method
VMIFGSM: Variance-Tuning Momentum Iterative Fast Gradient Sign Method
UPGD: Ultimate Projected Gradient Descent
TPGD: Textual Projected Gradient Descent
Square: Square Attack
SPSA: Simultaneous Perturbation Stochastic Approximation
SparseFool: SparseFool Attack
SINIFGSM: Scale-Invariant Nesterov Iterative Fast Gradient Sign Method
RFGSM: Randomized Fast Gradient Sign Method
PGDRSL2: Projected Gradient Descent with Random Start L2
PGDRS: Projected Gradient Descent with Random Start
PGDL2: Projected Gradient Descent L2
NIFGSM: Nesterov Iterative Fast Gradient Sign Method
MIFGSM: Momentum Iterative Fast Gradient Sign Method
JSMA: Jacobian-based Saliency Map Attack
FFGSM: Fast Feature Gradient Sign Method
FAB: Fast Adaptive Boundary Attack
EOTPGD: Expectation Over Transformation Projected Gradient Descent
EADL1: Elastic-net Attack with L1
EADEN: Elastic-net Attack with Elastic-net
DIFGSM: Diverse Input Fast Gradient Sign Method
BIM: Basic Iterative Method
AutoAttack: AutoAttack
APGDT: Adversarial Projected Gradient Descent Targeted
APGD: Adversarial Projected Gradient Descent
FGSM: Fast Gradient Sign Method
PGD: Projected Gradient Descent
CW: Carlini & Wagner Attack
DeepFool: DeepFool Attack
OnePixel: One Pixel Attack

Defenses

You can import and use our defenses as shown:

from torch import nn

from airtk.defense import TradesTraining

if __name__ == "__main__":
    # Initialize the training function
    training = TradesTraining(batch_size=512,
                              "cifar10",
                              "res101",
                              epochs=100,
                              lr=0.01,
                              seed=0,
                              model_dir="data/model/TRADES/",
                              save_freq=10)
                              
    # Run the specified training regime
    training()

We support the following defenses:

Adversarial Distributional Training (ADT)
Adversarial Adversarial Distributional Training (ADT++)
Adversarial Weight Distribution (ATAWP)
Curriculum Adversarial Training (Currat)
Federated Adversarial Training (FAT)
Feature Scatter (FS)
Geometry Aware Instance Reweighted Adversarial Training (GAIRAT)
TRadeoff-inspired Adversarial DEfenses via Surrogate loss minimization (TRADES)
TRADES with Adversarial Weight Distribution (TRADESAWP)
Various Attacks (VA)
You Only Propogate Once (YOPO)

Most of which can use the following keyword arguments:

kwarg name	use
dataset_name	name of the dataset to use
model_name	name of the model to use
epochs	number of epochs to train / test for
batch_size	size of training and testing batches
eps	size of image perturbations
model_dir	directory to save models to

Pretrained Models

In order to expedite progress in the field of secure AI, we have provided the weights of our trained models on huggingface. These can be loaded via load_pretrained and then or further augmented:

import torch
from airtk.data import CIFAR100
from airtk.model import ResNet50
from torch.utils.data import DataLoader

if __name__ == "__main__":
    torch.set_default_device("cuda")

    # 1. Load the model
    model: ResNet50 = ResNet50.from_pretrained("LAiSR-SK/curriculum-at-cifar100-res50")
    
    # 2. Evaluate the model against CIFAR100
    testset: CIFAR100 = CIFAR100(root="data/", train=False, download=True)
    test_loader: DataLoader = DataLoader(testset, batch_szie = 256, shuffle=True)
    
    total: int = 0
    correct: int = 0
    for x, y in test_loader:
        logits = model(x)
        _, predicted = torch.max(logits, 1)

        total_correct += (predicted == y).sum().item()
        total += predicted.size[0]
        
    acc: float = 100 * correct / total

    print(f"Accuracy: {acc}")

Future Direction

In the near future, AiR-TK will include most-recent text-based, LLM, and diffuiosn models attacks and defenses

Disclaimer -- A message from the Director of LAiSR Research Group

Air-TK is built upon using source code from the original authors and other AI framework such as MAIR and IBM-ART. Upon using this tool, it is recommend to cite this tool and the coresponding attack and defense method
This tool is publicly opend to the AI security community to improve the AI robustness and make the AI more secure and safe to use. It is a must to be used in ethical way that is aligned with U.S. Law and internationl law. This tool is not built to be used in an unethical manner.

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
asset/repo/image		asset/repo/image
data		data
log		log
script		script
src/airtk		src/airtk
submodules		submodules
.gitignore		.gitignore
.gitmodules		.gitmodules
AUTHORS.md		AUTHORS.md
CITATION.cff		CITATION.cff
LICENSE.md		LICENSE.md
README.md		README.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Usecases

Key Benefits:

Installation

From PyPI (Recommended)

From Repo Source (Not Recommended)

Contents

Attacks

Defenses

Pretrained Models

Future Direction

Disclaimer -- A message from the Director of LAiSR Research Group

Acknowledgment

Cite Us

About

Releases

Packages

Contributors 4

Languages

License

LAiSR-SK/AiRobustnessTestingKit-AiR-TK-

Folders and files

Latest commit

History

Repository files navigation

Usecases

Key Benefits:

Installation

From PyPI (Recommended)

From Repo Source (Not Recommended)

Contents

Attacks

Defenses

Pretrained Models

Future Direction

Disclaimer -- A message from the Director of LAiSR Research Group

Acknowledgment

Cite Us

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages