chapter thirteen

13 Training a classification model to detect suspected tumors

This chapter covers

Using PyTorch DataLoaders to load data
Implementing a model that performs classification on our CT data
Setting up the basic skeleton for our application
Adding logging and displaying metrics during training

In the previous chapters, we set the stage for our cancer-detection project. We covered medical details of lung cancer, took a look at the main data sources we will use for our project, and transformed our raw CT scans into a PyTorch Dataset instance. Now that we have a dataset, we can easily consume our training data. So let’s do that!

13.1 A foundational model and training loop

We’re going to do two main things in this chapter. We’ll start by building the nodule classification model and training loop that will be the foundation that the rest of part 2 uses to explore the larger project. To do that, we’ll use the Ct and LunaDataset classes we implemented in chapter 12 to feed DataLoader instances. Those instances, in turn, will feed our classification model with data via training and validation loops.

We’ll finish the chapter by using the results from running that training loop to introduce one of the hardest challenges in this part of the book: how to get high-quality results from messy, limited data. In later chapters, we’ll explore the specific ways in which our data is limited, as well as mitigate those limitations.

13.2 The main entry point for our application

13.3 Pretraining setup and initialization

13.3.1 Initializing the model and optimizer

13.3.2 Care and feeding of data loaders

13.4 Our first-pass neural network design

13.4.1 The core convolutions

13.4.2 The full model

13.5 Training and validating the model

13.5.1 The computeBatchLoss function

13.5.2 The validation loop is similar

13.6 Outputting performance metrics

13.6.1 The logMetrics function

13.7 Running the training script

13.7.1 Needed data for training

13.7.2 Interlude: The tqdm function

13.8 Evaluating the model: Getting 99.7% correct means we’re done, right?

13.9 Graphing training metrics with TensorBoard

13.9.1 Running TensorBoard