chapter eight

8 Image classification

This chapter covers

Understanding convolutional neural networks (ConvNets)
Using data augmentation to mitigate overfitting
Using a pretrained ConvNet for feature extraction
Fine-tuning a pretrained ConvNet

Computer vision was the first big success story of deep learning. It led to the initial rise of deep learning between 2011 and 2015. A type of deep learning called convolutional neural networks started getting remarkably good results on image classification competitions around that time, first with Dan Ciresan winning two niche competitions (the ICDAR 2011 Chinese character recognition competition and the IJCNN 2011 German traffic signs recognition competition) and then, more notably, in fall 2012, with Hinton’s group winning the high-profile ImageNet large-scale visual recognition challenge. Many more promising results quickly started bubbling up in other computer vision tasks.

8.1 Introduction to ConvNets

8.1.1 The convolution operation

8.1.2 The max-pooling operation

8.2 Training a ConvNet from scratch on a small dataset

8.2.1 The relevance of deep learning for small-data problems

8.2.2 Downloading the data

8.2.3 Building your model

8.2.4 Data preprocessing

8.2.5 Using data augmentation

8.3 Using a pretrained model

8.3.1 Feature extraction with a pretrained model

8.3.2 Fine-tuning a pretrained model

Summary