Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference1 Gary King GaryKing.org April 10, 2015 1 c Copyright 2015 Gary King, All Rights Reserved. Gary King () Research Designs April 10, 2015 1 / 23 Reference Kosuke Imai, Gary King, and Elizabeth Stuart. Misunderstandings among Experimentalists and Observationalists: Balance Test Fallacies in Causal Inference Journal of the Royal Statistical Society, Series A Vol. 171, Part 2 (2008): Pp. 1-22 http://gking.harvard.edu/files/abs/matchse-abs.shtml Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 2 / 23 Notation Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 3 / 23 Notation Sample of n units from a finite population of N units (typically N >> n) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 3 / 23 Notation Sample of n units from a finite population of N units (typically N >> n) Sample selection: Ii is 1 for units selected, 0 otherwise Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 3 / 23 Notation Sample of n units from a finite population of N units (typically N >> n) Sample selection: Ii is 1 for units selected, 0 otherwise Treatment assignment: Ti is 1 for treated group, 0 control group Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 3 / 23 Notation Sample of n units from a finite population of N units (typically N >> n) Sample selection: Ii is 1 for units selected, 0 otherwise Treatment assignment: Ti is 1 for treated group, 0 control group (Assume: treated and control groups are each of size n/2) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 3 / 23 Notation Sample of n units from a finite population of N units (typically N >> n) Sample selection: Ii is 1 for units selected, 0 otherwise Treatment assignment: Ti is 1 for treated group, 0 control group (Assume: treated and control groups are each of size n/2) Observed outcome variable: Yi Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 3 / 23 Notation Sample of n units from a finite population of N units (typically N >> n) Sample selection: Ii is 1 for units selected, 0 otherwise Treatment assignment: Ti is 1 for treated group, 0 control group (Assume: treated and control groups are each of size n/2) Observed outcome variable: Yi Potential outcomes: Yi (1) and Yi (0), Yi potential values when Ti is 1 or 0 respectively. Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 3 / 23 Notation Sample of n units from a finite population of N units (typically N >> n) Sample selection: Ii is 1 for units selected, 0 otherwise Treatment assignment: Ti is 1 for treated group, 0 control group (Assume: treated and control groups are each of size n/2) Observed outcome variable: Yi Potential outcomes: Yi (1) and Yi (0), Yi potential values when Ti is 1 or 0 respectively. Fundamental problem of causal inference. Only one potential outcome is ever observed: If Ti = 0, Yi (0) = Yi Yi (1) = ? If Ti = 1, Yi (0) = ? Yi (1) = Yi Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 3 / 23 Notation Sample of n units from a finite population of N units (typically N >> n) Sample selection: Ii is 1 for units selected, 0 otherwise Treatment assignment: Ti is 1 for treated group, 0 control group (Assume: treated and control groups are each of size n/2) Observed outcome variable: Yi Potential outcomes: Yi (1) and Yi (0), Yi potential values when Ti is 1 or 0 respectively. Fundamental problem of causal inference. Only one potential outcome is ever observed: If Ti = 0, Yi (0) = Yi Yi (1) = ? If Ti = 1, Yi (0) = ? Yi (1) = Yi (Ii , Ti , Yi ) are random; Yi (1) and Yi (0) are fixed. Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 3 / 23 Quantities of Interest Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 4 / 23 Quantities of Interest Treatment Effect (for unit i): TEi Gary King (Harvard) ≡ Yi (1) − Yi (0) Research Designs for Causal Inference April 10, 2015 4 / 23 Quantities of Interest Treatment Effect (for unit i): TEi ≡ Yi (1) − Yi (0) Population Average Treatment Effect: PATE ≡ N 1 X TEi N i=1 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 4 / 23 Quantities of Interest Treatment Effect (for unit i): TEi ≡ Yi (1) − Yi (0) Population Average Treatment Effect: N 1 X TEi N PATE ≡ i=1 Sample Average Treatment Effect: SATE ≡ Gary King (Harvard) 1 n X TEi i∈{Ii =1} Research Designs for Causal Inference April 10, 2015 4 / 23 Decomposition of Causal Effect Estimation Error Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 5 / 23 Decomposition of Causal Effect Estimation Error Difference in means estimator: X 1 1 D ≡ Yi − n/2 n/2 i ∈{Ii =1,Ti =1} Gary King (Harvard) Research Designs for Causal Inference X Yi . i ∈{Ii =1,Ti =0} April 10, 2015 5 / 23 Decomposition of Causal Effect Estimation Error Difference in means estimator: X 1 1 D ≡ Yi − n/2 n/2 i ∈{Ii =1,Ti =1} X Yi . i ∈{Ii =1,Ti =0} Estimation Error: ∆ ≡ PATE − D Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 5 / 23 Decomposition of Causal Effect Estimation Error Difference in means estimator: X 1 1 D ≡ Yi − n/2 n/2 i ∈{Ii =1,Ti =1} X Yi . i ∈{Ii =1,Ti =0} Estimation Error: ∆ ≡ PATE − D Pretreatment confounders: X are observed and U are unobserved Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 5 / 23 Decomposition of Causal Effect Estimation Error Difference in means estimator: X 1 1 D ≡ Yi − n/2 n/2 i ∈{Ii =1,Ti =1} X Yi . i ∈{Ii =1,Ti =0} Estimation Error: ∆ ≡ PATE − D Pretreatment confounders: X are observed and U are unobserved Decomposition: ∆ = ∆ S + ∆T = (∆SX + ∆SU ) + (∆TX + ∆TU ) Error due to ∆S (sample selection), ∆T (treatment imbalance), and each due to observed (Xi ) and unobserved (Ui ) covariates Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 5 / 23 Selection Error Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 6 / 23 Selection Error Definition: ∆S ≡ PATE − SATE N −n = (NATE − SATE), N where NATE is the nonsample average treatment effect. Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 6 / 23 Selection Error Definition: ∆S ≡ PATE − SATE N −n = (NATE − SATE), N where NATE is the nonsample average treatment effect. ∆S vanishes if: Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 6 / 23 Selection Error Definition: ∆S ≡ PATE − SATE N −n = (NATE − SATE), N where NATE is the nonsample average treatment effect. ∆S vanishes if: 1 The sample is a census (Ii = 1 for all observations and n = N); Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 6 / 23 Selection Error Definition: ∆S ≡ PATE − SATE N −n = (NATE − SATE), N where NATE is the nonsample average treatment effect. ∆S vanishes if: 1 2 The sample is a census (Ii = 1 for all observations and n = N); SATE = NATE; or Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 6 / 23 Selection Error Definition: ∆S ≡ PATE − SATE N −n = (NATE − SATE), N where NATE is the nonsample average treatment effect. ∆S vanishes if: 1 2 3 The sample is a census (Ii = 1 for all observations and n = N); SATE = NATE; or Switch quantity of interest from PATE to SATE (recommended!) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 6 / 23 Decomposing Selection Error Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 7 / 23 Decomposing Selection Error Decomposition: ∆S = ∆SX + ∆SU Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 7 / 23 Decomposing Selection Error Decomposition: ∆S = ∆SX + ∆SU ∆SX = 0 when empirical distribution of (observed) X is identical in population and sample: Fe (X | I = 0) = Fe (X | I = 1). Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 7 / 23 Decomposing Selection Error Decomposition: ∆S = ∆SX + ∆SU ∆SX = 0 when empirical distribution of (observed) X is identical in population and sample: Fe (X | I = 0) = Fe (X | I = 1). ∆SU = 0 when empirical distribution of (unobserved) U is identical in population and sample: Fe (U | I = 0) = Fe (U | I = 1). Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 7 / 23 Decomposing Selection Error Decomposition: ∆S = ∆SX + ∆SU ∆SX = 0 when empirical distribution of (observed) X is identical in population and sample: Fe (X | I = 0) = Fe (X | I = 1). ∆SU = 0 when empirical distribution of (unobserved) U is identical in population and sample: Fe (U | I = 0) = Fe (U | I = 1). conditions are unverifiable: X is observed only in sample and U is not observed at all. Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 7 / 23 Decomposing Selection Error Decomposition: ∆S = ∆SX + ∆SU ∆SX = 0 when empirical distribution of (observed) X is identical in population and sample: Fe (X | I = 0) = Fe (X | I = 1). ∆SU = 0 when empirical distribution of (unobserved) U is identical in population and sample: Fe (U | I = 0) = Fe (U | I = 1). conditions are unverifiable: X is observed only in sample and U is not observed at all. ∆SX vanishes if weighting on X Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 7 / 23 Decomposing Selection Error Decomposition: ∆S = ∆SX + ∆SU ∆SX = 0 when empirical distribution of (observed) X is identical in population and sample: Fe (X | I = 0) = Fe (X | I = 1). ∆SU = 0 when empirical distribution of (unobserved) U is identical in population and sample: Fe (U | I = 0) = Fe (U | I = 1). conditions are unverifiable: X is observed only in sample and U is not observed at all. ∆SX vanishes if weighting on X ∆SU cannot be corrected after the fact Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 7 / 23 Decomposing Treatment Imbalance Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 8 / 23 Decomposing Treatment Imbalance Decomposition: ∆T = ∆TX + ∆TU Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 8 / 23 Decomposing Treatment Imbalance Decomposition: ∆T = ∆TX + ∆TU ∆TX = 0 when X is balanced between treateds and controls: Fe (X | T = 1, I = 1) = Fe (X | T = 0, I = 1). Verifiable from data; can be generated ex ante by blocking or enforced ex post via matching or parametric adjustment Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 8 / 23 Decomposing Treatment Imbalance Decomposition: ∆T = ∆TX + ∆TU ∆TX = 0 when X is balanced between treateds and controls: Fe (X | T = 1, I = 1) = Fe (X | T = 0, I = 1). Verifiable from data; can be generated ex ante by blocking or enforced ex post via matching or parametric adjustment ∆TU = 0 when U is balanced between treateds and controls: Fe (U | T = 1, I = 1) = Fe (U | T = 0, I = 1). Unverifiable. Achieved only by assumption or, on average, by random treatment assignment Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 8 / 23 Alternative Quantities of Interest Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 9 / 23 Alternative Quantities of Interest Sample average treatment effect on the treated: X 1 SATT ≡ TEi n/2 i∈{Ii =1,Ti =1} Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 9 / 23 Alternative Quantities of Interest Sample average treatment effect on the treated: X 1 SATT ≡ TEi n/2 i∈{Ii =1,Ti =1} Population average treatment effect on the treated: X 1 PATT ≡ ∗ TEi N i∈{Ti =1} (where N ∗ = population) Gary King (Harvard) PN i=1 Ti is the number of treated units in the Research Designs for Causal Inference April 10, 2015 9 / 23 Alternative Quantities of Interest Sample average treatment effect on the treated: X 1 SATT ≡ TEi n/2 i∈{Ii =1,Ti =1} Population average treatment effect on the treated: X 1 PATT ≡ ∗ TEi N i∈{Ti =1} PN (where N ∗ = i=1 Ti is the number of treated units in the population) When are these of more interest than PATE and SATE? Why never for randomized experiments? Why usually for matching? Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 9 / 23 Alternative Quantities of Interest Sample average treatment effect on the treated: X 1 SATT ≡ TEi n/2 i∈{Ii =1,Ti =1} Population average treatment effect on the treated: X 1 PATT ≡ ∗ TEi N i∈{Ti =1} PN (where N ∗ = i=1 Ti is the number of treated units in the population) When are these of more interest than PATE and SATE? Why never for randomized experiments? Why usually for matching? Analogous estimation error decomposition: ∆0 = PATT − D, holds: ∆0 = (∆0SX + ∆0SU ) + (∆0TX + ∆0TU ) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 9 / 23 Effects of Design Components on Estimation Error Design Choice Gary King (Harvard) ∆ SX Research Designs for Causal Inference ∆SU ∆ TX ∆ TU April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Gary King (Harvard) ∆ SX Research Designs for Causal Inference ∆SU ∆ TX ∆ TU April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Random sampling Gary King (Harvard) ∆ SX avg = 0 Research Designs for Causal Inference ∆SU avg ∆ TX ∆ TU = 0 April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Random sampling Complete stratified random sampling Gary King (Harvard) ∆ SX avg = 0 =0 Research Designs for Causal Inference ∆SU avg ∆ TX ∆ TU = 0 = 0 avg April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Random sampling Complete stratified random sampling Focus on SATE rather than PATE Gary King (Harvard) ∆ SX avg = 0 =0 =0 Research Designs for Causal Inference ∆SU avg ∆ TX ∆ TU = 0 = 0 =0 avg April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Random sampling Complete stratified random sampling Focus on SATE rather than PATE Weighting for nonrandom sampling Gary King (Harvard) ∆ SX avg = 0 =0 =0 =0 Research Designs for Causal Inference ∆SU avg ∆ TX ∆ TU = 0 = 0 =0 =? avg April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Random sampling Complete stratified random sampling Focus on SATE rather than PATE Weighting for nonrandom sampling Large sample size Gary King (Harvard) ∆ SX avg = 0 =0 =0 =0 →? Research Designs for Causal Inference ∆SU avg = 0 = 0 =0 =? →? ∆ TX ∆ TU →? →? avg April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Random sampling Complete stratified random sampling Focus on SATE rather than PATE Weighting for nonrandom sampling Large sample size Random treatment assignment Gary King (Harvard) ∆ SX avg = 0 =0 =0 =0 →? Research Designs for Causal Inference ∆SU avg = 0 = 0 =0 =? →? ∆ TX ∆ TU →? = 0 avg avg avg →? = 0 April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Random sampling Complete stratified random sampling Focus on SATE rather than PATE Weighting for nonrandom sampling Large sample size Random treatment assignment Complete blocking Gary King (Harvard) ∆ SX avg = 0 =0 =0 =0 →? Research Designs for Causal Inference ∆SU avg = 0 = 0 =0 =? →? ∆ TX ∆ TU avg →? = 0 =0 avg →? = 0 =? avg April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Random sampling Complete stratified random sampling Focus on SATE rather than PATE Weighting for nonrandom sampling Large sample size Random treatment assignment Complete blocking Exact matching Gary King (Harvard) ∆ SX avg = 0 =0 =0 =0 →? Research Designs for Causal Inference ∆SU avg = 0 = 0 =0 =? →? ∆ TX ∆ TU avg →? = 0 =0 =0 avg →? = 0 =? =? avg April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Random sampling Complete stratified random sampling Focus on SATE rather than PATE Weighting for nonrandom sampling Large sample size Random treatment assignment Complete blocking Exact matching Assumption Gary King (Harvard) ∆ SX avg = 0 =0 =0 =0 →? Research Designs for Causal Inference ∆SU avg = 0 = 0 =0 =? →? ∆ TX ∆ TU avg →? = 0 =0 =0 avg →? = 0 =? =? avg April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Random sampling Complete stratified random sampling Focus on SATE rather than PATE Weighting for nonrandom sampling Large sample size Random treatment assignment Complete blocking Exact matching Assumption No selection bias Gary King (Harvard) ∆ SX avg = 0 =0 =0 =0 →? ∆SU avg = 0 = 0 =0 =? →? ∆ TX avg →? = 0 =0 =0 avg avg = 0 Research Designs for Causal Inference ∆ TU →? = 0 =? =? avg avg = 0 April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Random sampling Complete stratified random sampling Focus on SATE rather than PATE Weighting for nonrandom sampling Large sample size Random treatment assignment Complete blocking Exact matching Assumption No selection bias Ignorability Gary King (Harvard) ∆ SX avg = 0 =0 =0 =0 →? ∆SU avg = 0 = 0 =0 =? →? ∆ TX avg →? = 0 =0 =0 avg avg = 0 Research Designs for Causal Inference ∆ TU →? = 0 =? =? avg avg = 0 avg = 0 April 10, 2015 10 / 23 Effects of Design Components on Estimation Error Design Choice Random sampling Complete stratified random sampling Focus on SATE rather than PATE Weighting for nonrandom sampling Large sample size Random treatment assignment Complete blocking Exact matching Assumption No selection bias Ignorability No omitted variables Gary King (Harvard) ∆ SX avg = 0 =0 =0 =0 →? ∆SU avg = 0 = 0 =0 =? →? ∆ TX avg →? = 0 =0 =0 avg avg = 0 Research Designs for Causal Inference ∆ TU →? = 0 =? =? avg avg = 0 avg = 0 =0 April 10, 2015 10 / 23 Comparing Blocking and Matching Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 11 / 23 Comparing Blocking and Matching Adding blocking to random assignment is always as or more efficient, and never biased (blocking on pretreatment variables related to the outcome is more efficient) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 11 / 23 Comparing Blocking and Matching Adding blocking to random assignment is always as or more efficient, and never biased (blocking on pretreatment variables related to the outcome is more efficient) Blocking is like regression adjustment, where the functional form and the parameter values are known Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 11 / 23 Comparing Blocking and Matching Adding blocking to random assignment is always as or more efficient, and never biased (blocking on pretreatment variables related to the outcome is more efficient) Blocking is like regression adjustment, where the functional form and the parameter values are known Matching is like blocking, except: Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 11 / 23 Comparing Blocking and Matching Adding blocking to random assignment is always as or more efficient, and never biased (blocking on pretreatment variables related to the outcome is more efficient) Blocking is like regression adjustment, where the functional form and the parameter values are known Matching is like blocking, except: 1 to avoid selection error, must change quantity of interest from PATE to PATT or SATT, Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 11 / 23 Comparing Blocking and Matching Adding blocking to random assignment is always as or more efficient, and never biased (blocking on pretreatment variables related to the outcome is more efficient) Blocking is like regression adjustment, where the functional form and the parameter values are known Matching is like blocking, except: 1 2 to avoid selection error, must change quantity of interest from PATE to PATT or SATT, random treatment assignment following matching is impossible, Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 11 / 23 Comparing Blocking and Matching Adding blocking to random assignment is always as or more efficient, and never biased (blocking on pretreatment variables related to the outcome is more efficient) Blocking is like regression adjustment, where the functional form and the parameter values are known Matching is like blocking, except: 1 2 3 to avoid selection error, must change quantity of interest from PATE to PATT or SATT, random treatment assignment following matching is impossible, Exact matching, unlike blocking, is dependent on the already-collected data happening to contain sufficiently good matches. Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 11 / 23 Comparing Blocking and Matching Adding blocking to random assignment is always as or more efficient, and never biased (blocking on pretreatment variables related to the outcome is more efficient) Blocking is like regression adjustment, where the functional form and the parameter values are known Matching is like blocking, except: 1 2 3 4 to avoid selection error, must change quantity of interest from PATE to PATT or SATT, random treatment assignment following matching is impossible, Exact matching, unlike blocking, is dependent on the already-collected data happening to contain sufficiently good matches. In the worst case scenerio, matching (just like regression adjustment) can increase bias (this cannot occur with blocking plus random assignment) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 11 / 23 Comparing Blocking and Matching Adding blocking to random assignment is always as or more efficient, and never biased (blocking on pretreatment variables related to the outcome is more efficient) Blocking is like regression adjustment, where the functional form and the parameter values are known Matching is like blocking, except: 1 2 3 4 to avoid selection error, must change quantity of interest from PATE to PATT or SATT, random treatment assignment following matching is impossible, Exact matching, unlike blocking, is dependent on the already-collected data happening to contain sufficiently good matches. In the worst case scenerio, matching (just like regression adjustment) can increase bias (this cannot occur with blocking plus random assignment) Adding matching to a parametric model almost always reduces model dependence and bias, and sometimes variance too Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 11 / 23 The Benefits of Major Research Designs: Overview Ideal experiment Randomized clinicial trials (Limited or no blocking) Randomized clinicial trials (Full blocking) Social Science Field Experiment (Limited or no blocking) Survey Experiment (Limited or no blocking) Observational Study (Representative data set, Well-matched) Observational Study (Unrepresentative but partially, correctable data, well-matched) Observational Study (Unrepresentative data set, Well-matched) ∆SX →0 ∆ SU →0 ∆ TX =0 ∆ TU →0 6= 0 6= 0 avg avg 6= 0 6= 0 =0 6= 0 6= 0 →0 →0 →0 →0 →0 →0 ≈0 ≈0 ≈0 6= 0 ≈0 6= 0 ≈0 6= 0 6= 0 6= 0 ≈0 6= 0 = 0 = 0 avg = 0 For column Q, “→ 0” denotes E (Q) = 0 and limn→∞ Var(Q) = 0, avg whereas “ = 0” indicates zero on average, or E (Q) = 0, for a design with a small n. ∆S can be set to zero if we switch from PATE to SATE. Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 12 / 23 The Ideal Experiment (according to the paper) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 13 / 23 The Ideal Experiment (according to the paper) Random selection from well-defined population Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 13 / 23 The Ideal Experiment (according to the paper) Random selection from well-defined population large n Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 13 / 23 The Ideal Experiment (according to the paper) Random selection from well-defined population large n blocking on all known confounders Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 13 / 23 The Ideal Experiment (according to the paper) Random selection from well-defined population large n blocking on all known confounders random treatment assignment within blocks Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 13 / 23 The Ideal Experiment (according to the paper) Random selection from well-defined population large n blocking on all known confounders random treatment assignment within blocks E (∆SX ) = 0, limn→∞ V (∆SX ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 13 / 23 The Ideal Experiment (according to the paper) Random selection from well-defined population large n blocking on all known confounders random treatment assignment within blocks E (∆SX ) = 0, limn→∞ V (∆SX ) = 0 E (∆SU ) = 0, limn→∞ V (∆SU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 13 / 23 The Ideal Experiment (according to the paper) Random selection from well-defined population large n blocking on all known confounders random treatment assignment within blocks E (∆SX ) = 0, limn→∞ V (∆SX ) = 0 E (∆SU ) = 0, limn→∞ V (∆SU ) = 0 ∆ TX = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 13 / 23 The Ideal Experiment (according to the paper) Random selection from well-defined population large n blocking on all known confounders random treatment assignment within blocks E (∆SX ) = 0, limn→∞ V (∆SX ) = 0 E (∆SU ) = 0, limn→∞ V (∆SU ) = 0 ∆ TX = 0 E (∆TU ) = 0, limn→∞ V (∆TU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 13 / 23 An Even More Ideal Experiment Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 14 / 23 An Even More Ideal Experiment Begin with a well-defined population Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 14 / 23 An Even More Ideal Experiment Begin with a well-defined population Define sampling strata based on cross-classification of all known confounders Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 14 / 23 An Even More Ideal Experiment Begin with a well-defined population Define sampling strata based on cross-classification of all known confounders Random sampling within strata (if strata sample is proportional to population fraction, no weights are needed) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 14 / 23 An Even More Ideal Experiment Begin with a well-defined population Define sampling strata based on cross-classification of all known confounders Random sampling within strata (if strata sample is proportional to population fraction, no weights are needed) large n Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 14 / 23 An Even More Ideal Experiment Begin with a well-defined population Define sampling strata based on cross-classification of all known confounders Random sampling within strata (if strata sample is proportional to population fraction, no weights are needed) large n blocking on all known confounders Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 14 / 23 An Even More Ideal Experiment Begin with a well-defined population Define sampling strata based on cross-classification of all known confounders Random sampling within strata (if strata sample is proportional to population fraction, no weights are needed) large n blocking on all known confounders random treatment assignment within blocks Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 14 / 23 An Even More Ideal Experiment Begin with a well-defined population Define sampling strata based on cross-classification of all known confounders Random sampling within strata (if strata sample is proportional to population fraction, no weights are needed) large n blocking on all known confounders random treatment assignment within blocks ∆ SX = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 14 / 23 An Even More Ideal Experiment Begin with a well-defined population Define sampling strata based on cross-classification of all known confounders Random sampling within strata (if strata sample is proportional to population fraction, no weights are needed) large n blocking on all known confounders random treatment assignment within blocks ∆ SX = 0 E (∆SU ) = 0, limn→∞ V (∆SU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 14 / 23 An Even More Ideal Experiment Begin with a well-defined population Define sampling strata based on cross-classification of all known confounders Random sampling within strata (if strata sample is proportional to population fraction, no weights are needed) large n blocking on all known confounders random treatment assignment within blocks ∆ SX = 0 E (∆SU ) = 0, limn→∞ V (∆SU ) = 0 ∆ TX = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 14 / 23 An Even More Ideal Experiment Begin with a well-defined population Define sampling strata based on cross-classification of all known confounders Random sampling within strata (if strata sample is proportional to population fraction, no weights are needed) large n blocking on all known confounders random treatment assignment within blocks ∆ SX = 0 E (∆SU ) = 0, limn→∞ V (∆SU ) = 0 ∆ TX = 0 E (∆TU ) = 0, limn→∞ V (∆TU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 14 / 23 Randomized Clinical Trials (Little or no Blocking) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 15 / 23 Randomized Clinical Trials (Little or no Blocking) nonrandom selection Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 15 / 23 Randomized Clinical Trials (Little or no Blocking) nonrandom selection small n Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 15 / 23 Randomized Clinical Trials (Little or no Blocking) nonrandom selection small n little or no blocking Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 15 / 23 Randomized Clinical Trials (Little or no Blocking) nonrandom selection small n little or no blocking random treatment assignment Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 15 / 23 Randomized Clinical Trials (Little or no Blocking) nonrandom selection small n little or no blocking random treatment assignment ∆SX 6= 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 15 / 23 Randomized Clinical Trials (Little or no Blocking) nonrandom selection small n little or no blocking random treatment assignment ∆SX 6= 0 ∆SU 6= 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 15 / 23 Randomized Clinical Trials (Little or no Blocking) nonrandom selection small n little or no blocking random treatment assignment ∆SX 6= 0 ∆SU 6= 0 E (∆TX ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 15 / 23 Randomized Clinical Trials (Little or no Blocking) nonrandom selection small n little or no blocking random treatment assignment ∆SX 6= 0 ∆SU 6= 0 E (∆TX ) = 0 E (∆TU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 15 / 23 Randomized Clinical Trials (Full Blocking) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 16 / 23 Randomized Clinical Trials (Full Blocking) nonrandom selection Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 16 / 23 Randomized Clinical Trials (Full Blocking) nonrandom selection small n Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 16 / 23 Randomized Clinical Trials (Full Blocking) nonrandom selection small n Full blocking Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 16 / 23 Randomized Clinical Trials (Full Blocking) nonrandom selection small n Full blocking random treatment assignment Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 16 / 23 Randomized Clinical Trials (Full Blocking) nonrandom selection small n Full blocking random treatment assignment ∆SX 6= 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 16 / 23 Randomized Clinical Trials (Full Blocking) nonrandom selection small n Full blocking random treatment assignment ∆SX 6= 0 ∆SU 6= 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 16 / 23 Randomized Clinical Trials (Full Blocking) nonrandom selection small n Full blocking random treatment assignment ∆SX 6= 0 ∆SU 6= 0 ∆ TX = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 16 / 23 Randomized Clinical Trials (Full Blocking) nonrandom selection small n Full blocking random treatment assignment ∆SX 6= 0 ∆SU 6= 0 ∆ TX = 0 E (∆TU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 16 / 23 Social Science Field Experiment Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 17 / 23 Social Science Field Experiment nonrandom selection Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 17 / 23 Social Science Field Experiment nonrandom selection large n Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 17 / 23 Social Science Field Experiment nonrandom selection large n limited or no blocking Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 17 / 23 Social Science Field Experiment nonrandom selection large n limited or no blocking random treatment assignment Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 17 / 23 Social Science Field Experiment nonrandom selection large n limited or no blocking random treatment assignment ∆SX 6= 0 or change PATE to SATE and ∆SX = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 17 / 23 Social Science Field Experiment nonrandom selection large n limited or no blocking random treatment assignment ∆SX 6= 0 or change PATE to SATE and ∆SX = 0 ∆SU 6= 0 or change PATE to SATE and ∆SU = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 17 / 23 Social Science Field Experiment nonrandom selection large n limited or no blocking random treatment assignment ∆SX 6= 0 or change PATE to SATE and ∆SX = 0 ∆SU 6= 0 or change PATE to SATE and ∆SU = 0 E (∆TX ) = 0, limn→∞ V (∆TX ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 17 / 23 Social Science Field Experiment nonrandom selection large n limited or no blocking random treatment assignment ∆SX 6= 0 or change PATE to SATE and ∆SX = 0 ∆SU 6= 0 or change PATE to SATE and ∆SU = 0 E (∆TX ) = 0, limn→∞ V (∆TX ) = 0 E (∆TU ) = 0, limn→∞ V (∆TU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 17 / 23 Survey Experiment Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 18 / 23 Survey Experiment random selection Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 18 / 23 Survey Experiment random selection large n Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 18 / 23 Survey Experiment random selection large n limited or no blocking Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 18 / 23 Survey Experiment random selection large n limited or no blocking random treatment assignment (only question wording treatments) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 18 / 23 Survey Experiment random selection large n limited or no blocking random treatment assignment (only question wording treatments) E (∆SX ) = 0, limn→∞ V (∆SX ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 18 / 23 Survey Experiment random selection large n limited or no blocking random treatment assignment (only question wording treatments) E (∆SX ) = 0, limn→∞ V (∆SX ) = 0 E (∆SU ) = 0, limn→∞ V (∆SU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 18 / 23 Survey Experiment random selection large n limited or no blocking random treatment assignment (only question wording treatments) E (∆SX ) = 0, limn→∞ V (∆SX ) = 0 E (∆SU ) = 0, limn→∞ V (∆SU ) = 0 E (∆TX ) = 0, limn→∞ V (∆TX ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 18 / 23 Survey Experiment random selection large n limited or no blocking random treatment assignment (only question wording treatments) E (∆SX ) = 0, limn→∞ V (∆SX ) = 0 E (∆SU ) = 0, limn→∞ V (∆SU ) = 0 E (∆TX ) = 0, limn→∞ V (∆TX ) = 0 E (∆TU ) = 0, limn→∞ V (∆TU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 18 / 23 Observational Study, well-matched Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 19 / 23 Observational Study, well-matched nonrandom selection Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 19 / 23 Observational Study, well-matched nonrandom selection large n Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 19 / 23 Observational Study, well-matched nonrandom selection large n no blocking Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 19 / 23 Observational Study, well-matched nonrandom selection large n no blocking nonrandom treatment assignment Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 19 / 23 Observational Study, well-matched nonrandom selection large n no blocking nonrandom treatment assignment ∆SX ≈ 0 if representative, corrected by weighting, or for estimating SATE; or 6= 0 otherwise Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 19 / 23 Observational Study, well-matched nonrandom selection large n no blocking nonrandom treatment assignment ∆SX ≈ 0 if representative, corrected by weighting, or for estimating SATE; or 6= 0 otherwise ∆SU 6= 0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 19 / 23 Observational Study, well-matched nonrandom selection large n no blocking nonrandom treatment assignment ∆SX ≈ 0 if representative, corrected by weighting, or for estimating SATE; or 6= 0 otherwise ∆SU 6= 0 ∆TX ≈ 0 (due to matching well) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 19 / 23 Observational Study, well-matched nonrandom selection large n no blocking nonrandom treatment assignment ∆SX ≈ 0 if representative, corrected by weighting, or for estimating SATE; or 6= 0 otherwise ∆SU 6= 0 ∆TX ≈ 0 (due to matching well) ∆TU 6= 0 except by assumption Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 19 / 23 What is the Best Design? Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 20 / 23 What is the Best Design? The ideal design is rarely feasible Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 20 / 23 What is the Best Design? The ideal design is rarely feasible Effort in experimental studies: random assignment Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 20 / 23 What is the Best Design? The ideal design is rarely feasible Effort in experimental studies: random assignment Effort in observational studies: measuring and adjusting for X (via matching or modeling) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 20 / 23 What is the Best Design? The ideal design is rarely feasible Effort in experimental studies: random assignment Effort in observational studies: measuring and adjusting for X (via matching or modeling) the Achilles heal of experimental studies: ∆S and small n Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 20 / 23 What is the Best Design? The ideal design is rarely feasible Effort in experimental studies: random assignment Effort in observational studies: measuring and adjusting for X (via matching or modeling) the Achilles heal of experimental studies: ∆S and small n the Achilles heal of observational studies: ∆U Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 20 / 23 What is the Best Design? The ideal design is rarely feasible Effort in experimental studies: random assignment Effort in observational studies: measuring and adjusting for X (via matching or modeling) the Achilles heal of experimental studies: ∆S and small n the Achilles heal of observational studies: ∆U Each design accomodates best to the applications for which it was designed Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 20 / 23 Fallacies in Experimental Research Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 21 / 23 Fallacies in Experimental Research Failure to block on all known covariates Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 21 / 23 Fallacies in Experimental Research Failure to block on all known covariates seen incorrectly as requiring fewer assumptions (about what to block on) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 21 / 23 Fallacies in Experimental Research Failure to block on all known covariates seen incorrectly as requiring fewer assumptions (about what to block on) In fact, blocking helps (except in strange situations) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 21 / 23 Fallacies in Experimental Research Failure to block on all known covariates seen incorrectly as requiring fewer assumptions (about what to block on) In fact, blocking helps (except in strange situations) Blocking on relevant covariates is better, so choose carefully. Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 21 / 23 Fallacies in Experimental Research Failure to block on all known covariates seen incorrectly as requiring fewer assumptions (about what to block on) In fact, blocking helps (except in strange situations) Blocking on relevant covariates is better, so choose carefully. “Block what you can and randomize what you cannot” (Box et al.) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 21 / 23 Fallacies in Experimental Research Failure to block on all known covariates seen incorrectly as requiring fewer assumptions (about what to block on) In fact, blocking helps (except in strange situations) Blocking on relevant covariates is better, so choose carefully. “Block what you can and randomize what you cannot” (Box et al.) Conducting t-tests to check for balance after random treatment assignment Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 21 / 23 Fallacies in Experimental Research Failure to block on all known covariates seen incorrectly as requiring fewer assumptions (about what to block on) In fact, blocking helps (except in strange situations) Blocking on relevant covariates is better, so choose carefully. “Block what you can and randomize what you cannot” (Box et al.) Conducting t-tests to check for balance after random treatment assignment variables blocked on balance exactly after treatment assignment; so if you’re checking, you missed an opportunity to increase efficiency Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 21 / 23 Fallacies in Experimental Research Failure to block on all known covariates seen incorrectly as requiring fewer assumptions (about what to block on) In fact, blocking helps (except in strange situations) Blocking on relevant covariates is better, so choose carefully. “Block what you can and randomize what you cannot” (Box et al.) Conducting t-tests to check for balance after random treatment assignment variables blocked on balance exactly after treatment assignment; so if you’re checking, you missed an opportunity to increase efficiency if variables became available after treatment assignment, a t-test only checks for whether randomization was done appropriately Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 21 / 23 Fallacies in Experimental Research Failure to block on all known covariates seen incorrectly as requiring fewer assumptions (about what to block on) In fact, blocking helps (except in strange situations) Blocking on relevant covariates is better, so choose carefully. “Block what you can and randomize what you cannot” (Box et al.) Conducting t-tests to check for balance after random treatment assignment variables blocked on balance exactly after treatment assignment; so if you’re checking, you missed an opportunity to increase efficiency if variables became available after treatment assignment, a t-test only checks for whether randomization was done appropriately randomization balances on average; any one random assignment is not balanced exactly (which is why its better to block) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 21 / 23 60 20 1 Math test score "Statistical insignificance" region 40 2 t−statistic 3 80 4 100 The Balance Test Fallacy in Matching Research 0 0 QQ Plot Mean Deviation 0 100 200 300 400 Number of Controls Randomly Dropped Difference in Means 0 100 200 300 400 Number of Controls Randomly Dropped Randomly dropping observations “reduces” imbalance??? Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 22 / 23 The Balance Test Fallacy: Explanation Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 23 / 23 The Balance Test Fallacy: Explanation hypothesis tests are a function of balance AND power; we only want to measure balance Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 23 / 23 The Balance Test Fallacy: Explanation hypothesis tests are a function of balance AND power; we only want to measure balance for SATE, PATE, or SPATE, balance is a characteristic of the sample, not some superpopulation; so no need to make an inference Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 23 / 23 The Balance Test Fallacy: Explanation hypothesis tests are a function of balance AND power; we only want to measure balance for SATE, PATE, or SPATE, balance is a characteristic of the sample, not some superpopulation; so no need to make an inference Simple linear model (for intution): Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 23 / 23 The Balance Test Fallacy: Explanation hypothesis tests are a function of balance AND power; we only want to measure balance for SATE, PATE, or SPATE, balance is a characteristic of the sample, not some superpopulation; so no need to make an inference Simple linear model (for intution): Suppose E (Y | T , X ) = θ + T β + X γ Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 23 / 23 The Balance Test Fallacy: Explanation hypothesis tests are a function of balance AND power; we only want to measure balance for SATE, PATE, or SPATE, balance is a characteristic of the sample, not some superpopulation; so no need to make an inference Simple linear model (for intution): Suppose E (Y | T , X ) = θ + T β + X γ Bias in coefficient on T from regressing Y on T (without X ): E (βˆ − β | T , X ) = G γ (where G are coefficients from a regression X on a constant and T ) Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 23 / 23 The Balance Test Fallacy: Explanation hypothesis tests are a function of balance AND power; we only want to measure balance for SATE, PATE, or SPATE, balance is a characteristic of the sample, not some superpopulation; so no need to make an inference Simple linear model (for intution): Suppose E (Y | T , X ) = θ + T β + X γ Bias in coefficient on T from regressing Y on T (without X ): E (βˆ − β | T , X ) = G γ (where G are coefficients from a regression X on a constant and T ) Imbalance is due to G only. Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 23 / 23 The Balance Test Fallacy: Explanation hypothesis tests are a function of balance AND power; we only want to measure balance for SATE, PATE, or SPATE, balance is a characteristic of the sample, not some superpopulation; so no need to make an inference Simple linear model (for intution): Suppose E (Y | T , X ) = θ + T β + X γ Bias in coefficient on T from regressing Y on T (without X ): E (βˆ − β | T , X ) = G γ (where G are coefficients from a regression X on a constant and T ) Imbalance is due to G only. If G = 0, bias=0 Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 23 / 23 The Balance Test Fallacy: Explanation hypothesis tests are a function of balance AND power; we only want to measure balance for SATE, PATE, or SPATE, balance is a characteristic of the sample, not some superpopulation; so no need to make an inference Simple linear model (for intution): Suppose E (Y | T , X ) = θ + T β + X γ Bias in coefficient on T from regressing Y on T (without X ): E (βˆ − β | T , X ) = G γ (where G are coefficients from a regression X on a constant and T ) Imbalance is due to G only. If G = 0, bias=0 If G 6= 0, bias can be any size due to γ Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 23 / 23 The Balance Test Fallacy: Explanation hypothesis tests are a function of balance AND power; we only want to measure balance for SATE, PATE, or SPATE, balance is a characteristic of the sample, not some superpopulation; so no need to make an inference Simple linear model (for intution): Suppose E (Y | T , X ) = θ + T β + X γ Bias in coefficient on T from regressing Y on T (without X ): E (βˆ − β | T , X ) = G γ (where G are coefficients from a regression X on a constant and T ) Imbalance is due to G only. If G = 0, bias=0 If G 6= 0, bias can be any size due to γ If you cannot control G (and we don’t usually try to minimize γ so we don’t cook the books), then G must be reduced without limit. Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 23 / 23 The Balance Test Fallacy: Explanation hypothesis tests are a function of balance AND power; we only want to measure balance for SATE, PATE, or SPATE, balance is a characteristic of the sample, not some superpopulation; so no need to make an inference Simple linear model (for intution): Suppose E (Y | T , X ) = θ + T β + X γ Bias in coefficient on T from regressing Y on T (without X ): E (βˆ − β | T , X ) = G γ (where G are coefficients from a regression X on a constant and T ) Imbalance is due to G only. If G = 0, bias=0 If G 6= 0, bias can be any size due to γ If you cannot control G (and we don’t usually try to minimize γ so we don’t cook the books), then G must be reduced without limit. No threshold level is safe. Gary King (Harvard) Research Designs for Causal Inference April 10, 2015 23 / 23
© Copyright 2025