chapter six

6 Core PyTorch: Autograd, optimizers, and utilities

This chapter covers

Understanding automatic differentiation
Using automatic differentiation with PyTorch tensors
Getting started with PyTorch SGD and Adam optimizers
Applying PyTorch to linear regression with gradient descent
Using data set batches for gradient descent
PyTorch Dataset and DataLoader utility classes for batches

In chapter 5, you learned about the tensor, a core PyTorch data structure for n-dimensional arrays. The chapter illustrated the significant performance advantages of PyTorch tensors over native Python data structures for arrays and introduced PyTorch APIs for creating tensors as well as performing common operations on one or more tensors.

This chapter teaches another key feature of the PyTorch tensors: support for calculation of gradients using automatic differentiation (autodiff). Described as one of the major advances in scientific computing since 1970, autodiff is suprisingly simple and was invented by Seppo Linnainmaa, a master’s student at the University of Helsinki.¹ The first part of this chapter introduces you to the fundamentals of autodiff by showing how you can implement the core algorithm for a scalar tensor using basic Python.

6.1 Understanding the basics of autodiff

6 Core PyTorch: Autograd, optimizers, and utilities

This chapter covers

6.1 Understanding the basics of autodiff

6.2 Linear regression using PyTorch automatic differentiation

6.3 Transitioning to PyTorch optimizers for gradient descent

6.4 Getting started with data set batches for gradient descent

6.5 Data set batches with PyTorch Dataset and DataLoader

6.6 Dataset and DataLoader classes for gradient descent with batches

Summary