The Central Role of the Propensity Score

Paul Rosenbaum and Donald Rubin (1983)

Core Contribution

The propensity score is the treatment probability

\[ e(x)=\Pr(D=1\mid X=x). \]

Rosenbaum and Rubin showed that if treatment is ignorable given covariates,

\[ (Y(1),Y(0))\perp D\mid X, \]

then it is also ignorable given the scalar propensity score:

\[ (Y(1),Y(0))\perp D\mid e(X). \]

This turns a high-dimensional balancing problem into a one-dimensional balancing problem.

Minimal Implementation

Simulate treatment assignment from the propensity score \(e(X)=\Pr(D=1\mid X)\).

n = 700
x1, x2 = rng.normal(size=(2, n))
e = special.expit(-0.2 + 1.0 * x1 - 0.8 * x2)
d = rng.binomial(1, e)
tau = 1.4
y = tau * d + 1.2 * x1 + 0.7 * x2 + rng.normal(size=n)
X = np.c_[np.ones(n), x1, x2]
d.mean()

np.float64(0.44)

Estimate \(\hat e(X)\) by logistic regression.

def nll(b):
    xb = X @ b
    return -np.sum(d * xb - np.logaddexp(0, xb))

b = optimize.minimize(nll, np.zeros(3)).x
ehat = special.expit(X @ b)
ehat[:5]

array([0.69465984, 0.17941007, 0.33860013, 0.35399138, 0.09402754])

Use inverse-propensity weights to estimate the ATE and check covariate balance.

w = d / ehat + (1 - d) / (1 - ehat)
ate_ipw = np.mean(d * y / ehat - (1 - d) * y / (1 - ehat))

def smd(v, weights=None):
    if weights is None:
        mt, mc = v[d == 1].mean(), v[d == 0].mean()
    else:
        mt = np.average(v[d == 1], weights=weights[d == 1])
        mc = np.average(v[d == 0], weights=weights[d == 0])
    return (mt - mc) / v.std()

before = [smd(x1), smd(x2)]
after = [smd(x1, w), smd(x2, w)]
ate_ipw, before, after

(np.float64(1.247090179005577),
 [np.float64(0.7340805837422606), np.float64(-0.6405355858540289)],
 [np.float64(-0.011581466176673293), np.float64(-0.03605703076593948)])

Plot standardized mean differences before and after weighting.

fig, ax = plt.subplots(figsize=(6, 3.4))
loc = np.arange(2)
ax.bar(loc - 0.18, before, width=0.35, label="raw")
ax.bar(loc + 0.18, after, width=0.35, label="IPW")
ax.axhline(0, color="white", lw=0.8)
ax.set(xticks=loc, xticklabels=["x1", "x2"], ylabel="standardized mean difference",
       title=f"IPW ATE={ate_ipw:.2f}, truth={tau:.2f}")
ax.legend()
plt.show()

Inverse propensity weighting reduces covariate imbalance.

Implementations

DoWhy, EconML, MatchIt, WeightIt