Skip to content

sakhan-1111/ArtiFace

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 

Repository files navigation

ArtiFace: A dataset for deepfake detection.

Paper:

Abstract: Deepfake technology’s rise has led to a surge in false identities, creating a significant and present problem with broad societal ramifications. Concerns over identity theft, harassment, and the dissemination of false information have escalated due to the simplicity with which deepfaked facial images can now be produced and distributed thanks to the broad availability of generative AI tools like Generative Adversarial Networks (GANs). The availability of these tools has political ramifications since it can degrade public opinion and damage institutional trust. As such, the ability to identify deepfake face images has become essential. Ensuring a person’s identity is critical in preventing the dissemination of false information on social media. Detection of deepfake facial images is also necessary for identity verification in border control, law enforcement, and security applications. To effectively and precisely recognize deepfake face images, this study effort has focused on modifying transfer learning models, such as ResNet101V2, MobileNetV2, NASNetLarge, NASNetMobile, DenseNet121, DenseNet169, DenseNet201, and Xception.

Update:

  • 2024-09-03 The paper has been accepted to IEEE UEMCON 2024
  • 2024-08-26 Dataset is available on Kaggle.

Dataset Description:

  • Total number of images: 106,650
  • Number of real images: 53,368
  • Number of fake images: 53,282
  • Sources of fake images: ProjectedGAN, StarGAN, Stable Diffusion, and Taming Transformer
  • Sources of real images: CIPS and FFHQ
  • Image Resolution: 200 x 200
  • Image format: jpg

Dataset Folder Structure:

2.png

Fig. 1. Folder structure and number of images in the ArtiFace dataset.

Samples of Fake Images:

1.png

Fig. 2. Example of deepfaked faces using (a) ProjectedGAN, (b) Stable Diffusion, (c) StarGAN, and (d) Taming Transformer.

Dataset Download:

The dataset is hosted on Kaggle. The dataset can be downloaded using the link below:

ArtiFace Dataset

How to use:

Extract the dataset using the command below:

tar -xvf ArtiFace.tar.xz

Citation:

@INPROCEEDINGS{10754706,
  author={Khan, Shafiqul Alam and Valles, Damian},
  booktitle={2024 IEEE 15th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON)}, 
  title={Deepfake Detection Using Transfer Learning}, 
  year={2024},
  volume={},
  number={},
  pages={556-562},
  keywords={Training;Deepfakes;Social networking (online);Law enforcement;Face recognition;Computational modeling;Transfer learning;Mobile communication;Security;Surges;deepfake detection;GANs;transfer learning;deep learning;identity verification},
  doi={10.1109/UEMCON62879.2024.10754706}}

License:

The licenses assosicated with the sources of the ArtiFace dataset is listed below:

Data License
Method License
ProjectedGAN MIT
StarGAN MIT
Stable Diffusion Apache-2.0
Taming Transformer MIT
CIPS MIT
FFHQ Creative Commons BY-NC-SA 4.0 license

Acknowledgment:

  • The authors would like to express their gratitude to the authors of ArtiFact dataset as this dataset is a subset of Artifact.

  • The authors also would like to express their gratitude to the authors of the methods that is used for creating ArtiFace dataset.

    Data Method Reference
    Method Reference
    FFHQ link
    Taming Transformer link
    Stable Diffusion link
    CIPS link
    StarGAN link
    ProjectedGAN link