chapter eleven
11 Running tasks in containers
This chapter covers
- Identifying challenges in managing Airflow deployments
- Examining how containerized approaches can help simplify Airflow deployments
- Running containerized tasks in Airflow on Docker
- Establishing a high-level overview of workflows in developing containerized DAGs
Previously, we implemented several DAGs using different Airflow operators, each specialized to perform a specific type of task. Although operators are powerful tools, they can also pose challenges in deploying and maintaining your DAGs if you use a wide variety of operators across your pipelines. This chapter explores some of these challenges and examines how a containerized workflow using Docker and/or Kubernetes can simplify your workflow.
11.1 Challenges of different operators
Operators are arguably some of the strongest features of Airflow because they give you great flexibility in coordinating jobs across different types of systems. But creating and managing DAGs with many operators can be quite challenging due to the complexity involved.