chapter four

4 Content-Based Recommendation

This chapter covers

An overview of the most common recommendation techniques
How to design proper graph models for a content-based recommendation engine
How to import existing (not-graph) datasets in the graph models designed
How to implement working content-based recommendation engines

Representation is one of the most complex and compelling tasks in machine learning, and computer science in general. Pedro Domingos, a CS professor at the University of Washington, published an article in 2012 [Domingos, 2012] in which he decomposes machine learning into three main components: representation, evaluation, and optimization.

Representation, specifically, affects three core aspects of a machine learning project’s lifecycle:

The formal language (or schema) in which a training dataset is expressed before passing it as input to the learning process
The way in which the result of the learning process—the predictive model—is stored
How, during the prediction phase, the training data and the prediction model are accessed during forecasting

All of these are influenced by the learning algorithm used to infer the generalization from the observed examples in the training dataset, and they affect the overall performance in terms of forecast accuracy and training and prediction performance (speed).

4.1 Recommendation engines—An introduction

4.2 Content-based recommendations

4.2.1 Representing item features

4 Content-Based Recommendation

This chapter covers

4.1 Recommendation engines—An introduction

4.2 Content-based recommendations

4.2.1 Representing item features

Exercises

4.2.2 User modeling

Exercises

4.2.3 Providing recommendations

Exercise

Exercises

Exercises

Exercise

4.2.4 Advantages of the graph approach

4.3 Summary

4.4 References