1 Setting the stage for offline evaluations
This chapter covers
- An introduction to offline evaluations
- Exploring the AI development lifecycle
- Highlighting the impact on A/B testing
- Using offline evaluations in real world product applications
- Explaining why offline evaluations are incredibly valuable
Recommender systems. Search algorithms. Language models. Computer vision. Predictive analytics. These AI systems represent the backbone of modern digital products. They each carry their own impact, influence, engineering complexity, training methodology, and of course, evaluation strategy. They can appear anywhere—in a streaming app suggesting your next binge-watch, an e-commerce website ranking search results, a social media feed curating content, a chatbot answering customer questions, or a financial app detecting fraudulent transactions.