7 Data Analysis & Preparation
This chapter covers
- Introducing the Captone Projects
- Building and launching images for Kubeflow notebooks
- Using Kubeflow notebooks for data analysis
- Data passing in Kubeflow Pipelines
- Writing Kubeflow components that pass data
- Developing the Data Preparation pipeline for Object Detection, including downloading the dataset and splitting it into train, validation, and test
This chapter kicks off with two capstone projects: one centered on detecting identity cards and the other on recommending movies. As we progress, upcoming chapters will explore different stages of the ML pipeline—model training, evaluation, and serving. Our focus on practical application means that we'll heavily integrate these concepts into a tangible project within the pipeline. Through concrete real-world implementations, our aim is to solidify the relevance and understanding of these discussed concepts within our ongoing project.
Capstone Project 1: Identity Card Detection
The landscape of Machine Learning is ever-evolving, with new developments surfacing every other week. During the era when Deep Learning took center stage, innovations like new versions of YOLO (You Only Look Once) and ResNet became the talk of the town.