of statistical inference The machinery of causal calculus Probabilistic graphical models Causal conditional probabilities d-separation The 3 Rules of Causal Calculus Does smoking cause cancer? Predict or Explain? Summary Questions? Appendix
only a subset of interesting questions 2. Costly, take a lot of time 3. Can be trickier proceduraly than one thinks 4. Sometimes impossible practically or ethically…
Let denote the set of nodes in Z which are not ancestors of W. Z(W) Suppose . (Y ⊥ Z|W, X) G , X ⎯⎯ ⎯ Z(W) ⎯ ⎯ ⎯⎯⎯⎯⎯⎯⎯⎯ Then: . p(y|w, do(x), do(z)) = p(y|w, do(x))
tools and the 3 rules, we can now: 1. Collect Smoker / non-smoker observational data Look Ma' no RCT data! 2. Draw the graphical causal model 3. Derive p(cancer|do(smoke))
type of data Anti-smoking lobby: 1. tar increases cancer in both groups (+5 p. points) 2. smoking increases tar (380/400 vs. 20/380) p(X|Y) Summary: Smoking is BAD for you!
r t f u r i o u s h a n d - w a v i n g > > , where p(y|do(x)) = p(y| , z)p(z|x)p( ) ∑ z x ′ x′ x′ p(y|do(x)) = p(cancer|do(smoke)) The result is: smoking causes tar deposits and those increase cancer! (note: data is made up) Summary: Smoking is BAD for you!
to solve the prob. graphical models There are also completely different approaches Neyman-Rubin Causality Granger Causation coef cient The "econometrics toolkit": Structural Models / instrumental variables etc.