Distilling knowledge in Neural Network

The term "Knowledge Distillation" (a.k.a Teacher-Student Model) was first introduced by (Bu-cilu et al., 2006; Ba & Caruana,2014) and has been popularized by (Hinton et al., 2015), as a way to let smaller deep learning models learn how bigger ones generalize to large datasets, hence increase the performance of the smaller one.

In this notebook I'll show you an intuitive way on how you can implement Knowledge Distillation in Keras.

In this notebook:

I use the CIFAR10 dataset from tf.keras.datasets
I use Colab Notebook's K80 GPU for training (which save a lot of time & money..)
I use Keras as a high-level API of Tensorflow 2.0 (@ tf.keras)

Architectures

Teacher Model

A simple CNN with every convolution layer followed by a pooling layer, ends up with a 256-unit dense layer and a softmax.

Student Model

A minimal vanilla neural network with 1 64-unit hidden layer followed by a softmax. This model is used for both student model and standalone one.

Result

The result might be differ for those who want to reproduce this notebook, as I couldn't find a way to maintain the random seed in Keras! :D

Model	Test Accuracy	no. Parameters	training speed (per sample)	Epochs
Teacher Model	70.75 %	1M06	59 μs	20
Student Model	48.61 %	197K	42 μs	50
Standalone Model	39.34 %	197K	34 μs	50

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.ipynb_checkpoints		.ipynb_checkpoints
img		img
.gitignore		.gitignore
Knowledge_Distillation_Notebook.ipynb		Knowledge_Distillation_Notebook.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distilling knowledge in Neural Network

Architectures

Teacher Model

Student Model

Result

About

Releases

Packages

Languages

patrickphat/Knowledge-Distillation-Keras

Folders and files

Latest commit

History

Repository files navigation

Distilling knowledge in Neural Network

Architectures

Teacher Model

Student Model

Result

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages