contents

 

preface

acknowledgments

about this book

about the authors

about the cover illustration

  

Part 1.  Getting started

  1  Meet Apache Airflow

  1.1  Introducing data pipelines

Data pipelines as graphs

Executing a pipeline graph

Pipeline graphs vs. sequential scripts

Running pipeline using workflow managers

  1.2  Introducing Airflow

Defining pipelines flexibly in (Python) code

Scheduling and executing pipelines

Monitoring and handling failures

Incremental loading and backfilling

  1.3  When to use Airflow

Reasons to choose Airflow

Reasons not to choose Airflow

  1.4  The rest of this book

  2  Anatomy of an Airflow DAG

  2.1  Collecting data from numerous sources

Exploring the data

  2.2  Writing your first Airflow DAG