Convolutional Neural Networks for Dog Breed Identification

Juan E. Rolon

Project overview

In this project, I implemented a real-world application of Convolutional Neural Networks (CNNs) to develop an image classifier. This project was submitted as part of the requisites required to obtain Machine Learning Engineer Nanodegree from Udacity. It also forms part of the Artificial Intelligence curriculum.

The project requires building a pipeline that can be used within a web or mobile application to process real-world, user-supplied images. Given an image of a dog, the algorithm indetifies the type of breed associated to the image. The classifier is also capable of identifying a resembling dog breed when supplied with images of humans or closely related animals.

Along with exploring state-of-the-art convolutional neural networks classification models, this project deals with important design decisions of image classifiers, and challenges involved in piecing together a series of models designed to perform various tasks in a data processing pipeline.

Installation

Required Libraries

Sklearn
Tensorflow with GPU support
OpenCV
Keras
Imbalanced-learn
Feather format

Git Cloning and Datasets Downloading

Clone the following repository to obtain the required datasets.

	git clone https://github.com/juanerolon/convolutional-neural-networks.git

Download the dog dataset. Unzip the folder and place it in the repo, at location path/to/dog-project/dogImages.
Download the human dataset. Unzip the folder and place it in the repo, at location path/to/dog-project/lfw. If you are using a Windows machine, you are encouraged to use 7zip to extract the folder.
Donwload the VGG-16 bottleneck features for the dog dataset. Place it in the repo, at location path/to/dog-project/bottleneck_features.

Obtain the necessary Python packages, and switch Keras backend to Tensorflow.

For Mac/OSX:

	conda env create -f requirements/aind-dog-mac.yml
	source activate aind-dog
	KERAS_BACKEND=tensorflow python -c "from keras import backend"

For Linux:

	conda env create -f requirements/aind-dog-linux.yml
	source activate aind-dog
	KERAS_BACKEND=tensorflow python -c "from keras import backend"

For Windows:

	conda env create -f requirements/aind-dog-windows.yml
	activate aind-dog
	set KERAS_BACKEND=tensorflow
	python -c "from keras import backend"

Open the notebook in the present repositry and follow along.
```
	jupyter notebook dog_app.ipynb
```

Infrastructure required

The model can be trained on a local CPU-GPU, or if needed on an Amazon Web Services EC2 GPU instance. Please refer to the following instructions for setting up a GPU instance for this project. (link for AIND students, link for MLND students)

Usage

To implement the following project we use an IDE capable of editing and running Ipython notebooks. If Jupyter is installed in the python distribution type:

$ jupyter notebook cnn-image-classifier.ipynb

Project Workflow

Step 1: Detect Humans

Criteria	Procedure
1: Assess the Human Face Detector	Obtain the percentage of the first 100 images in the dog and human face datasets with a detected human face.
2: Assess the Human Face Detector	Assess whether Haar cascades for face detection are an appropriate technique for human detection.

Step 2: Detect Dogs

Criteria	Proceduer
3: Assess the Dog Detector	Obtain the percentage of the first 100 images in the dog and human face datasets with a detected dog.

Step 3: Create a CNN to Classify Dog Breeds (from Scratch)

Criteria	Procedure
Model Architecture	Select a CNN architecture.
Train the Model	Obtain the number of epochs used to train the algorithm.
Test the Model	Optimize model to obtain at least 1% accuracy on the test set.

Step 5: Create a CNN to Classify Dog Breeds (using Transfer Learning)

Criteria	Procedure
Obtain Bottleneck Features	Download the bottleneck features corresponding to one of the Keras pre-trained models (VGG-19, ResNet-50, Inception, or Xception).
Model Architecture	Select a model architecture.
Model Architecture	Assess whether the chosen architecture succeeds in the classification task.
Compile the Model	Compile the cnn architecture by specifying the loss function and optimizer.
Train the Model	Implement a checkpointing procedure to train the model to select the model with the best validation loss.
Load the Model with the Best Validation Loss	Load the model weights that attained the least validation loss.
Test the Model	Obtain an accuracy on the test set at least of 60% or greater.
Predict Dog Breed with the Model	Implement a function that takes a file path to an image as input and returns the dog breed that is predicted by the CNN.

Step 6: Test Algorithm

Criteria	Procedure
Test Algorithm	Use the CNN from Step 5 to detect dog breed. Assess whether output for each detected image type (dog, human, other) is different from previous cases. Obtain either predicted actual (or resembling) dog breed.

Step 7: Test Improved Algorithm

Criteria	Procedure
Test Algorithm on Sample Images	Test at least 6 images, including at least two human and two dog images.
Test Algorithm on Sample Images	Assess performance of the algorithm and at least three possible points of improvement.

Suggested Additional Tests and Improvements

(1) Augment the Training Data

Augmenting the training and/or validation set might help improve model performance.

(2) Turning the Algorithm into a Web App

Turn code into a web app using Flask or web.py!

(3) Overlay Dog Ears on Detected Human Heads

Overlay a Snapchat-like filter with dog ears on detected human heads. Determine where to place the ears through the use of the OpenCV face detector, which returns a bounding box for the face. It is also possible to overlay a dog nose filter, some nice tutorials for facial keypoints detection exist here.

(4) Add Functionality for Dog Mutts

Currently, if a dog appears 51% German Shepherd and 49% poodle, only the German Shephard breed is returned. The algorithm may fail for every mixed breed dog. Of course, if a dog is predicted as 99.5% Labrador, it is still worthwhile to round this to 100% and return a single breed; so, it will be neccessary to find a nice balance.

(5) Experiment with Multiple Dog/Human Detectors

Perform a systematic evaluation of various methods for detecting humans and dogs in images. Provide improved methodology for the face_detector and dog_detector functions.

License

The present project constitutes intellectual work towards completion of Udacitys Machine Learning Engineer Nanodegree. You are free to modify and adapt the code to your needs, but please avoid using an exact copy of this work as your own to obtain credits towards any educational platform, doing so may imply plagiarism on your part.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
.ipynb_checkpoints		.ipynb_checkpoints
bottleneck_features		bottleneck_features
haarcascades		haarcascades
images		images
my_test_images		my_test_images
requirements		requirements
saved_models		saved_models
testing		testing
val_curves		val_curves
.DS_Store		.DS_Store
README.ipynb		README.ipynb
README.md		README.md
cnn-image-classifier.ipynb		cnn-image-classifier.ipynb
extract_bottleneck_features.py		extract_bottleneck_features.py
rsnt50_cnn_pf_metrics.csv		rsnt50_cnn_pf_metrics.csv
rsnt50_cnn_pf_metrics.png		rsnt50_cnn_pf_metrics.png
sample_output.png		sample_output.png
scratch_cnn_pf_metrics.csv		scratch_cnn_pf_metrics.csv
scratch_cnn_pf_metrics.png		scratch_cnn_pf_metrics.png
vgg16_cnn_pf_metrics.csv		vgg16_cnn_pf_metrics.csv
vgg16_cnn_pf_metrics.png		vgg16_cnn_pf_metrics.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Convolutional Neural Networks for Dog Breed Identification

Juan E. Rolon

Project overview

Installation

Required Libraries

Git Cloning and Datasets Downloading

Infrastructure required

Usage

Project Workflow

Step 1: Detect Humans

Step 2: Detect Dogs

Step 3: Create a CNN to Classify Dog Breeds (from Scratch)

Step 5: Create a CNN to Classify Dog Breeds (using Transfer Learning)

Step 6: Test Algorithm

Step 7: Test Improved Algorithm

Suggested Additional Tests and Improvements

(1) Augment the Training Data

(2) Turning the Algorithm into a Web App

(3) Overlay Dog Ears on Detected Human Heads

(4) Add Functionality for Dog Mutts

(5) Experiment with Multiple Dog/Human Detectors

License

About

Releases

Packages

Languages

juanerolon/Convolutional-neural-networks-ai

Folders and files

Latest commit

History

Repository files navigation

Convolutional Neural Networks for Dog Breed Identification

Juan E. Rolon

Project overview

Installation

Required Libraries

Git Cloning and Datasets Downloading

Infrastructure required

Usage

Project Workflow

Step 1: Detect Humans

Step 2: Detect Dogs

Step 3: Create a CNN to Classify Dog Breeds (from Scratch)

Step 5: Create a CNN to Classify Dog Breeds (using Transfer Learning)

Step 6: Test Algorithm

Step 7: Test Improved Algorithm

Suggested Additional Tests and Improvements

(1) Augment the Training Data

(2) Turning the Algorithm into a Web App

(3) Overlay Dog Ears on Detected Human Heads

(4) Add Functionality for Dog Mutts

(5) Experiment with Multiple Dog/Human Detectors

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages