appendix-d

Appendix D. Proof for doubly robust ATE^aipw estimator

In Chapter 8 we introduced the ATE^{^}_aipw estimator and saw that it is doubly robust. If you are curious about why the ATE^{^}_aipw is doubly robust, you can find the proof here. We need to check the two following conditions:

If the models from the T-learner are unbiased, f₀(c) = E[Y| c, T=0] and f₁(c) = E[Y| c, T=1], then the ATE^{^}_aipw is also unbiased, that is E[ATE^{^}_aipw] = ATE.
If the propensity score is unbiased, s(c) = P(T=1|c), then the ATE^{^}_aipw is also unbiased, that is E[ATE^{^}_aipw] = ATE.

D.1 DR property with respect to the T-learner

We will first check the DR property for the T-learner. So, assume that the models from the T-learner are unbiased. First, notice that ATE^{^}_aipw can be expressed in terms of the T-learner estimator ATE^{^}_t. Consider the random variables T_i, C_i and Y_i. Then

\begin{equation} \displaylines{ ATE^{\hat{}}_{aipw} = \\ \frac{1}{n} \left[\sum_i f_1(C_i) - f_0(C_i)\right] + \frac{1}{n} \left[\sum_i \frac{(Y_i - f_1(C_i)) T_i}{s(C_i)} - \frac{(Y_i - f_0(C_i))(1-T_i)}{1 - s(C_i)}\right] = } \end{equation}

\[ATE^{\hat{}}_{t} + RES^{\hat{}}_t =\]

If we can see that the residual has expectation zero, E[RES^{^}_t] = 0, then we are done, because in that case

\[E[ATE^{\hat{}}_{t}] = E[ATE^{\hat{}}_{t}] + E[RES^{\hat{}}_t] = ATE\]

Let’s see that E[RES^{^}_t] = 0. For simplicity, we will drop the index i and calculate the expectation for only one term of the summand. Using the total law of expectation we get

Appendix D. Proof for doubly robust ATE^aipw estimator

D.1 DR property with respect to the T-learner

D.2 DR property with respect to the IPW