This chapter covers:
- Using machine learning to build generalized relevance ranking
- Ranking within the search engine using machine learning models
- How learning to rank is different from other machine learning methods
- Building a robust and generalizable ranking model
In this chapter, we’ll explore Learning to Rank (LTR): using machine learning to create a generalized ranking function. We’ll start by seeing where LTR compares to solutions in previous chapters. We’ll then begin our explorations with simple models using Solr’s LTR capabilities, walking through the steps of training and ranking search results with an LTR model. Finally, we’ll close with discussion of the different choices and options along the path to performing LTR.
We’ve seen from chapter 4 we can use collaborative filtering to predict which documents are likely to satisfy specific queries, based on similar queries. Consider the two red shoe
and scarlett shoes
queries in Table 10.1:
Table 10.1. Comparing the success of different products between the query red shoe
and the query scarlett shoes
Product | q=Red Shoe | q=Scarlett Shoes |
CTR=0.9 |
CTR=0.9 |
|
CTR=0.01 |
CTR=0.01 |
|
CTR=0.5 |
(Not Returned) |
|
CTR=0.01 |
CTR=0.01 |