This chapter covers
Chapter 1 highlighted the key role played by data in a machine learning project. As we saw, training the learning algorithm on a larger quantity of high-quality data increases the accuracy of the model more than fine tuning or replacing the algorithm itself. In an interview about big data [Coyle, 2016], Greg Linden, who invented the now widely used item-to-item collaborative filtering algorithm for Amazon, replied: