Unstructured Data course

Art Classifier

This project was undertaken as part of the Unstructured Data course within the Big Data Master's Degree program at Comillas ICAI University. The GitHub repo created for this project can be found here.

The team responsible for the project includes:

Name	Email
Jorge Ayuso Martínez	jorgeayusomartinez@alu.comillas.edu
Carlota Monedero Herranz	carlotamoh@alu.comillas.edu
José Manuel Vega Gradit	josemanuel.vega@alu.comillas.edu

The primary objective of this project is to construct an Art Classifier using Deep Learning techniques. We employed data from the WikiArt project and used a dataset available on Kaggle, which was customized for building Deep Learning models that can classify various art styles.

Due to practical and storage limitations, we will only be classifying four of the numerous available movement styles: Romanticism, Realism, Renaissance, and Baroque:

Romanticism

Realism

Renaissance

Baroque

To ensure a balanced dataset, we cropped the original dataset to 5,000 images for each of the four styles. This will enable us to achieve a more reasonable training time while still allowing for effective model training and assessment. This modified dataset can be found here. The validation and test sets are stored in their corresponding .zip files, whereas the training set is stored in the train folder, which contains four .zip files, one for each art style.

Overview

Our project begins with the creation of a basic CNN architecture from scratch, which will serve as our base model. However, we encounter a major issue with this network: significant overfitting to the training data. To address this issue, we take an incremental approach and build on top of the initial architecture using various techniques, including dropout, batch normalization, and data augmentation.

After exploring our custom-built model, we shift our focus to transfer learning and examine two different philosophies: feature extraction and fine-tuning.

For feature extraction, we use a pre-trained model as a fixed feature extractor, and the extracted features serve as input for training a classifier on the new task
For fine-tuning, we unfreeze the last few layers of the convolutional part and retrain them along with the classifier.

To accomplish this, we evaluate three models:

ResNet50
VGG16 & VGG19
MobileNet

Finally, we explore a different network architecture using Huggingface's Transformers.

Throughout the project, we monitor accuracy and loss plotted across epochs for each model, ensuring that we interpret the results correctly and draw actionable insights.

Name		Name	Last commit message	Last commit date
Latest commit History 118 Commits
docs		docs
src		src
.gitignore		.gitignore
1-Base_model.ipynb		1-Base_model.ipynb
2-Dropout_model.ipynb		2-Dropout_model.ipynb
3-Batch_normalization_model.ipynb		3-Batch_normalization_model.ipynb
4-Data_augmentation_model.ipynb		4-Data_augmentation_model.ipynb
5.1-ResNet50.ipynb		5.1-ResNet50.ipynb
5.2-MobileNetV3.ipynb		5.2-MobileNetV3.ipynb
5.3-VGG16.ipynb		5.3-VGG16.ipynb
5.3-VGG19.ipynb		5.3-VGG19.ipynb
6-Vision_Transformer.ipynb		6-Vision_Transformer.ipynb
README.md		README.md
requirements.txt		requirements.txt
requirements_vit.txt		requirements_vit.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unstructured Data course

Art Classifier

Romanticism

Realism

Renaissance

Baroque

Overview

About

Releases

Packages

Languages

Josemvg/art-classifier

Folders and files

Latest commit

History

Repository files navigation

Unstructured Data course

Art Classifier

Romanticism

Realism

Renaissance

Baroque

Overview

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages