chapter eleven
11 Running tasks in containers
This chapter covers
- Identifying some challenges involved in managing Airflow deployments
- Examining how containerized approaches can help simplify Airflow deployments
- Running containerized tasks in Airflow on Docker
- Establishing a high-level overview of workflows in developing containerized DAGs
Previously, we implemented several DAGs using different Airflow operators, each specialized to perform a specific type of task. Although operators are powerful tools, they can also pose challenges for deploying and maintaining your DAGs if you use a wide variety of different operators across your pipelines. Here, we explore some of these challenges and look at how a containerized workflow using Docker and/or Kubernetes can simplify your workflow.
11.1 Challenges of many different operators
Operators are arguably one of the strong features of Airflow, as they provide great flexibility to coordinate jobs across many different types of systems. However, creating and managing DAGs with many different operators can be quite challenging due to the complexity involved.