chapter eight

8 Model Training and Validation: Part 1

This chapter covers

Developing the Model Training and Validation components
Capturing metrics and artifacts in tracking frameworks
Adding the Model Training and Validation components to pipelines
Different methods to access training and evaluation data

In the previous chapter, we've laid the groundwork to download the identity card dataset that includes the images and labels, process them into a format that YOLO expects, then divided them up into train, test and validation splits. Through that process, you've also learnt how to create your first Kubeflow components and pipelines.

The next step, and arguably the most fun, is model training and evaluation. In this chapter, we'll carry on extending the data preparation pipeline from the previous chapter and bolt on training and evaluation components. In your initial project, you'll develop an ID card object detection system utilizing the popular YOLO (You Only Look Once) algorithm. Having mastered this concept, you'll then apply similar techniques to design a movie recommendation system.

8.1 Training an Object Detection Model

When we left off at the previous chapter, we had six lists, containing the cross product of either train, test, validation splits of either file names of images or file names of lists. These lists should then be passed to the training component that we will outline in this chapter. The training component then:

8.1.1 Downloading with MinIO

8 Model Training and Validation: Part 1

This chapter covers

8.1 Training an Object Detection Model

8.1.1 Downloading with MinIO

8.1.2 Training YOLO on a Custom Dataset

8.1.3 Training the Model

8.1.4 Creating the Training Component

8.1.5 Creating the Validation Component

8.1.6 Creating the Pipeline

8.1.7 Executing the Pipeline

8.1.8 Trying Out the Weights Locally

8.2 Summary