Confounding
Classic confounding situation: X affects treatment A and affects outcome Y
If X is observed, we can analyze data using: Matching, Propensity Score Matching, and IPTW
Even if there are risk factors, V, it is valid to simply control for X
Suppose there are unmeasured/unobserved variables, U, that affect A and Y. Then we have unmeasured confounding.
Ignorability assumption violated:
Given X, we cannot obtain a valid matching sample (pseudo-population)
Instrument Variables
The instrument variables (IV) approach is an alternative causal inference method that does not rely on the ignorability assumption.
Encouragement Design
A: Smoking during Pregnancy (yes/no)
Y: Birth Weight
X: Parity, Mother's Age, Weight, etc.
Z: Randomize to either receive encouragement to stop smoking (Z=1) or receive usual care (Z=0)
The encouragement will not affect the baby's birth weight through any path other than stopping smoking.
Intention-to-treat Analysis: Focus on the causal effect of encouragement
Note it is NOT a causal effect of smoking.
Randomized Trials with Noncompliance
Randomize Trial
Z: randomization to treatment (1 if randomized to treatment, 0 otherwise)
A: treatment received (1 if receive treatment, 0 otherwise)
Y: outcome
Typically, NOT everyone assigned treatment will receive the treatment (e.g., Non-compliance)
Non-compliance makes a randomized trial like an observational study
Every subject will be assigned treatment Z and received treatment A.
Every subject has two potential values of treatment:
: It could be treatment (i.e., ) or no treatment (i.e., ). It is the value of treatment if randomized to Z=0. : It could be treatment or no treatment. It is the value of treatment if randomized to Z=1.
The average causal effect of treatment assignment on treatment received:
Estimable from the observed data:
The average causal effect of treatment assignment on the outcome:
This is the average value of the outcome if everyone had been assigned to receive treatment minus the average outcome if no one had been assigned to receive the treatment.
It is the intention-to-treat effect.
If perfect compliance, it would be equal to the causal effect of treatment.
Estimable from the observed data:
Compliance Classes
Label | Implication | ||
---|---|---|---|
0 | 0 | Never-takers | No variation in treatment received and no information about causality |
0 | 1 | Compliers | Treatment received is randomized by design. Hope we can learn something. |
1 | 0 | Defiers | Assume this subpopulation is small or does not exist. |
1 | 1 | Always-takers | No variation in treatment received and no information about causality |
Causal Effects
A motivation for using IV methods is concern about possible unmeasured confounding.
If there is unmeasured confounding, we cannot marginalize over alll confounders via matching, IPTW, etc.
Both matching and IPTW create a balanced sample or pseudo-population to measure the causal effect averaged over the whole population.
IV methods do not focus on the average causal effect for the population. They focus on a local average treatment effect (LATE) evaluated from a subpopulation of compliers.
- It is also known as the complier average causal effect (CACE).
Observed Data
Class | ||||
---|---|---|---|---|
0 | 0 | 0 | ? | Never-takers or Compliers |
0 | 1 | 1 | ? | Always-takers or Defiers |
1 | 0 | ? | 0 | Never-takers or Defiers |
1 | 1 | ? | 1 | Always-takers or Compliers |
Compliance classes are also known as principal strata, which are latent.
Without additional assumptions, we cannot identify each subject into one of four categories. We can only narrow it down to two categories, however.
Monotonicity Assumption
A variable is an instrumental variable (IV) if
It is associated with the treatment;
It affects the outcome only through its effect on treatment (also known as the exclusion restriction).
Randomization should not be affecting the outcome, except possibly through the impact that randomization had on what treatment somebody received.
A valid IV should not affect unmeasured confounders.
The monotonicity assumption is that there are no defiers.
The probability of treatment should increase with more encouragement.
No one consistently does the opposite of what they are told.
With this monotonicity assumption, now we have enough information to identify the causal effect among compliers. For observed data, we have
Class | ||||
---|---|---|---|---|
0 | 0 | 0 | ? | Never-takers or Compliers |
0 | 1 | 1 | Always-takers |
|
1 | 0 | 0 | Never-takers |
|
1 | 1 | ? | 1 | Always-takers or Compliers |
Causal Effect Identification and Estimation
Intention-to-treat effect:
where
Note: Among always takers and never takers, we can drop the condition on Z:
Then we can simplify
and
Therefore,
which implies
For the second line of the equation, because Z was randomized and these subjects are compliers, randomizing Z can be referred to as randomizing A.
Also,
: Proportion of people who are always takers or compliers : Proportion of people who are always takers The difference is the proportion of compliers
As a result, we have
where the numerator is the IIT and the denominator is the causal effect of treatment assignment on the treatment received.
If perfect compliance, CACE=ITT.
Denominator always between 0 and 1. Thus, CACE will be at least as large as ITT.
- ITT is an underestimate of CACE because some people assigned to treatment did not take it.
Summary
IV requires 2 key assumptions, the strongest of which is the exclusion restriction.
If one also makes the monotonicity assumption, then the complier average causal effect is identified.
Sensitivity Analysis
Sensitivity analysis is developed for each of the IV assumptions:
Exclusion Analysis: If Z does directly affect Y by an amount
, would my conclusions change? Monotonicity: If the proportion of defiers was
, would my conclusions change?
Reference: Baiocchi et al. (2014) "Tutorial in Biostatistics: Instrumental Variable Methods for Causal Inference."
Weak Instruments
The strength of an IV is how well it predicts treatment received.
A strong instrument is highly predictive of treatment.
- Encouragement greatly increases the probability of treatment
A weak instrument is weakly predictive of treatment.
- Encouragement barely increases the probability of treatment
Strength of IVs
Estimate the proportion of compliers:
If this number is close to 1, it has a strong instrument;
If close to 0, it has a weak instrument.
This proportion implies an effective sample size.
Suppose only 1% of the population are compliers. This leads to very large variance estimates, resulting in unstable estimates of causal effects. It would likely produce confidence intervals that are too wide to be useful.
Strengthen Instrument: Near/far Matching
Match so that covariates are similar but the instrument is very different
Reference: Baiocchi et al. (2012) "Near/far matching: a study design approach to instrumental variables."
View / Make Comments