11 Building a causal inference workflow
This chapter covers
- Building a causal analysis workflow
- Estimating causal effects with DoWhy
- Estimating causal effects using machine learning methods
- Causal inference with causal latent variable models
Recall the causal inference workflow I introduced in Chapter 10, shows again in Figure 11.1
In this chapter, we’ll focus on building out this workflow in full. We’ll focus on one type of query in particular – causal effects. But the workflow generalizes to all causal queries.
We focus on causal effect inference, namely estimation of average treatment effects (ATEs) and conditional average treatment effects (CATEs) because they are the most popular causal query.
11.1 Step 1: Select the query
11.2 Step 2: Build the Model
11.3 Step 3: Identify the estimand
11.3.1 The backdoor adjustment estimand
11.3.2 The instrumental variable estimand
11.3.3 The front-door adjustment estimand
11.3.4 Choosing estimands and reducing “DAG anxiety”
11.3.5 When you don’t have identification
11.4 Step 4: Estimate the estimand
11.4.1 Linear regression estimation of the backdoor estimand
11.4.2 Propensity score estimators of the backdoor estimand
11.4.3 Backdoor estimation with machine learning
11.4.4 Front-door estimation
11.4.5 Instrumental variable methods
11.4.6 Comparing and selecting estimators
11.5 Step 5. Refutation
11.5.1 Data size reduction
11.5.2 Adding a dummy confounder
11.5.3 Replacing treatment with a dummy
11.5.4 Replacing outcome with a dummy outcome
11.5.5 Testing robustness to unmodeled confounders
11.6 Causal Inference with Causal Generative Models
11.6.1 Transformations for causal inference
11.6.2 Steps for inferring a causal query with a causal generative model
11.6.3 Extending inference to estimation
11.6.4 A VAE-inspired model for causal inference
11.7 Summary