2 What is MLOps ?
This chapter covers
- Understanding MLOps and its role in production ML
- Key challenges in building reliable ML systems
- How MLOps differs from traditional DevOps
- Building confidence through structured ML processes
In Chapter 1, we introduced the ML lifecycle and the foundational skills needed to become an effective ML Engineer. Now, let's dig deeper into MLOps - the practices and principles that will help you reliably deliver value through ML systems. Machine learning and the models are often not the end product of an organization, but rather a means to an end.
The gap between business value generation, requirements and necessary infrastructure is the primary reason why ML and by extension MLOps is hard. Very few companies truly do research on model development and instead reuse architectures and train/adapt off the shelf models for specific domains and problem sets. Availability of comprehensive open source libraries like Huggingface also potentially make modeling trivial. After defining a problem and identifying an architecture to solve the problem statement, the hard questions come into focus. How would the model be trained? How would data get to the model, how would it interact with the other services, where would it be run and how do we make sure the model is accurate over time?