This chapter covers
- Validating whether removing bad values improves the model performance
- Validating whether embeddings for the categorical columns improve the model performance
- Possible approaches to improving the performance of the model
- Comparing the performance of the deep learning model with a non-deep-learning model
In chapter 6, we trained the deep learning model and did a series of experiments to measure and improve its performance. In this chapter, we will go through a set of additional experiments to validate two key aspects of the model: removing bad values (a step that we took as part of the data preparation described in chapters 3 and 4) and including embeddings for the categorical columns (as described in chapter 5). Then we will describe an experiment to compare the deep learning solution using the streetcar delay prediction deep learning model with a solution that uses a non-deep-learning approach called XGBoost.
When you have cloned the GitHub repo (http://mng.bz/v95x) associated with this book, you’ll find the code related to the experiments in the notebooks subdirectory. The following listing shows the files used in the experiments described in this chapter.