This chapter covers
- Automated LTR (re)training from user behavioral signals (clicks, etc)
- Transforming user signals into implicit LTR training data using click models
- Why raw clicks alone don’t work well to build LTR training data
- Overcoming biases in how users interact with search results
In Chapter 10, we went step-by-step to train a Learning to Rank (LTR) model. Like walking through the mechanics of building a car, we saw the underlying nuts and bolts of LTR model training. In this chapter we treat the LTR training process as a black box. In other words, we step away from LTR internals, instead treating LTR more like a self-driving car, fine tuning its trip toward a final destination.
Recall that LTR relies on accurate training data in order to be effective. LTR training data describes how users expect search results to be optimally ranked. The training data provides the directions we input into our LTR self-driving car. As you’ll see, knowing what’s relevant based on user interactions comes with many challenges. If we can overcome these challenges and gain high confidence in our training data, though, then we can build Automated Learning to Rank: a system that regularly retrains LTR to capture the latest user relevance expectations.