In all previous chapters, we’ve focused on various aspects of writing Airflow code, mostly demonstrated with examples using generic operators such as the BashOperator
and PythonOperator
. While these operators can run arbitrary code and thus could run any workload, the Airflow project also holds other operators for more specific use cases, for example, running a query on a Postgres database. These operators have one and only one specific use case, such as running a query. As a result, they are easy to use by simply providing the query to the operator, and the operator internally handles the querying logic. With a PythonOperator
, you would have to write such querying logic yourself.
For the record, with the phrase external system we mean any technology other than Airflow and the machine Airflow is running on. This could be, for example, Microsoft Azure Blob Storage, an Apache Spark cluster, or a Google BigQuery data warehouse.