Randomized experiments (Causal inference: What if, Chapter 2)

Chapter 2 RANDOMIZED EXPERIMENTS komuRa Twitter: @hizakayu 1

Table of contents • 2.1 Randomization • 2.2 Conditional randomization
• 2.3 Standardization • 2.4 Inverse probability weighting • Fine Points • 2.1 Crossover experiments • 2.2 Risk period • Technical Points • 2.1 Full exchangeability and mean exchangeability • 2.2 Formal definition of IP weights • 2.3 Equivalence of IP weighting and standardization 2

2.1 Randomization 3

Randomized experiment 理想的なデータ(Table1.1) Effect measureが計算できる。現実的なデータ(Table2.1) Association measureしか計算できない。 4 Randomized
experimentsでは、Table2のようなmissing valueを含むデータしか手に入らないにもかかわらず、effect measureを計算できる。 ∵missing valueがランダムに起きることが保証されている。

“exchangeable”な2群 A=１ Pr[Y=1|A=1]=0.3 A=０ Pr[Y=1|A=0]=0.6 assoc/ risk ratio = 0.3/0.6
= 0.5 assoc/ risk diff = 0.3 – 0.6 = -0.3 A=１ Pr[Y=1|A=1]=0.3 A=０ Pr[Y=1|A=0]=0.6 assoc/ risk ratio = 0.3/0.6 = 0.5 assoc/ risk diff = 0.3 – 0.6 = -0.3 5 別の群が治療に割り当てられてもassociation measureが同じ。このような場合に、2群は“exchangeable”という。

Exchangeabilityの表し方 • Exchangeabilityとは、 Pr Ya = 1 A = 1
= Pr Ya = 1 A = 0 = Pr Ya = 1 , for both a=0 and a=1 • また、Exchangeabilityは , for all values a 6

Exchangeabilityの意義 • Exchangeabilityの下では、それぞれ、 Pr Y=1 = 1 = Pr Y
= 1 A = 1 , Pr Y=0 = 1 = Pr Y = 1 A = 0 が成り立つ。 • 現実の値から counterfactual risk が計算できる。 • 現実の値から causal risk ratio / diff が計算できる。 • 理想的なランダム化試験では、“Association is causation.” 7

Couterfactual outcome, Ya について • Yaは、治療が割り付けられる前から決まっているもの。 • Yaは、治療aに割り当てられたときの起こるであろうアウトカムのことをエンコードするもの。 •
Yaは、実際に割り当てられる治療とは独立。 • そういう点で、 Yaは遺伝子構成のようなもの。 • Yaと遺伝子構成と違うのは、 Yaが治療が割り当てられた後にしか判明せず、かつ、治療Aがaであったときにしか判明しないこと。 8

との違い • は、治療群と非治療群を入れ替えても同じリスクになること。 • が成り立つからといって、とは限らない。例えば、 •
ランダム化試験ではである。 • しかし、治療に効果がある場合、治療はアウトカムと関連するため、とはならない。 9

Table2.1ではExchangeabilityは成立？理想的なデータ(Table1.1) 現実的なデータ(Table2.1) • a = 0について Pr Y=0 =
1| = 1 = 7 13 ≓ 0.54 Pr Y=0 = 1| = 0 = 3 7 ≓ 0.43 • a = 1について Pr Y=0 = 1| = 1 = 7 13 ≓ 0.54 Pr Y=0 = 1| = 0 = 3 7 ≓ 0.43 以上から、は成立しない。 10

Exchangeabilityが成立しなければ、 Randomized experimentでないといえる？仮に理想的なデータ（Table1.1）があったとしても、二つの理由から結論付けられない。 • サンプルサイズが小さい。 • Exchangeabilityが成立していなくても、randomized experimentであり得る。
• 二つ以上のコインを使う場合→conditional randomization 11

2.2 Conditional randomization 12

Marginal experiments と conditional randomized experiments 予後因子Lを加えたデータ（Table2.2） Design1 [marginally
randomized experiments] • Design2 [conditionally randomized experiments] 13

Design 1 marginal experiments • Marginally randomized experiment なら、 Pr
Ya = 1 A = 1 = Pr Ya = 1 A = 0 (= Pr Ya = 1 ) or , for all a が成り立つはず。 • しかし、今回は治療群と非治療群とで予後因子Lの分布が違う（69%(=9/13) vs 43%(=3/7)）。 • 治療AがYaを予測しているため、が成立しない。 14

Design 2 conditionally randomized experiments① • Critical condition かどうかで分けた後、subset ごとにmarginally
randomized experiments を行っている。 • Critical condition(L=1)である人に限れば、Yaのリスクは同じになるので、 Pr Ya = 1 A = 1, L = 1 = Pr Ya = 1 A = 0, L = 1 or |L=1 が成り立つ。 15

Design 2 conditionally randomized experiments② • Non critical condition（L=0）の subset
でも同様なので、 |L=l, for all l. が成り立つ。 • シンプルに表すと |L • これをconditional exchangeabilityという。 16

Conditinally randomized experimentsでの二つの計算。 Causal effectsの計算にあたって、選択肢が二つある。 • Subset ごとにcausal effect計算（=
Stratification） L=1のとき Pr Ya = 1 A = 1, L = 1 / Pr Ya = 1 A = 0, L = 1 L=0のとき Pr Ya = 1 A = 1, L = 0 / Pr Ya = 1 A = 0, L = 0 • 平均のcaucal effectを計算 • Standardization • Inverse probability weighting 17

2.3 Standardization 18

Standardization① ∵ |L • 今知りたいのは、 Pr[Ya=1 = 1] / Pr[Ya=0
= 1] ⋯ ★ 19 Pr Y = 1 A = 1, L = 0 = 1 4 = Pr[Y=1 = 1|L = 0] Pr Y = 1 A = 0, L = 0 = 1 4 = Pr[Y=0 = 1|L = 0] Pr Y = 1 A = 1, L = 1 = 2 3 = Pr[Y=1 = 1|L = 1] Pr Y = 1 A = 0, L = 1 = 2 3 = Pr[Y=0 = 1|L = 1]

Standardization② ★の分子= Pr[Ya=1 = 1] = 全員が治療された（A=1）ときのリスク = subsetのサイズに比例した重みをもつ加重平均 =
Pr Y=1 = 1 L = 0 ∗ 0.4 + Pr Y=1 = 1 L = 1 ∗ 0.6 = 1 4 ∗ 0.4 + 2 3 ∗ 0.6 = 0.5 • 同様にして ★の分母= Pr[Ya=0 = 1] = 0.5 • よって、 Pr[Ya=1 = 1] / Pr[Ya=0 = 1] = 1 20

Standardization③ • Marginal counterfactural riskというのは、 Pr[Ya = 1] = Pr
= 1 = 0 Pr = 0 + Pr = 1 = 1 Pr = 1 = Pr = 1 = Pr = = Pr = 1 = , = Pr = • 観察データの分布の関数でcounterfactural quarityが表せられるとき、”identified” または “identifiable”という。 21

2.4 Inverse probability weighting 22

Fully randomized causally interpreted structured tree graph FRCISTG Table2.2 23
これを使って、Pr[Ya=1 = 1] / Pr[Ya=0 = 1] ⋯ ★ を算出する

仮に全員がA=0だったとしたもの分母（Pr[=0 = 1]）の計算観察したもの 24 ∴Pr Ya=0 = 1
= 2+8 20 = 0.5 ⋯ ☽

仮に全員がA=1だったとしたもの分子（Pr[=1 = 1]）の計算観察したもの 25 ∴Pr Ya=1 = 1
= 2+8 20 = 0.5 ⋯ ☀

Pseudo-population Pseudo-population と Inverse probability weighting 26 |L のもとでは、Pseudo-population の
associational risk ratio ともともとの populationの causal risk ratio は等しくなる。

Standardization と Inverse probability weighting • どちらの過程も、すべての人が治療 a を受けたときのシミュレーションを行っているとみなせる。
• Inverse probability weightingでは、共変量Lで条件づけたときの治療Aの条件付確率を使っている。 • Pr = | = • Standardizationでは、共変量Lの確率とAとLで条件づけたときのYの条件付確率を使っている。 • Pr = と Pr = 1 = , = 27

Fine Points 28

Fine Point 2.1 Crossover experiment • ライトニングボルトと血圧 • 同一の個人の異なるtreatment下でのアウトカムを比較するような研究デザインを
crossover experiment という。 29 yesterday today

Fine Point 2.1 Crossover experiment 個人の因果効果を同定するためには3つの仮定が必要。 1) Carryover effect がない。
=1 0 , 1 = =1 1 2) Individual causal effect が時間に寄らない。 =1 − =0 = , for t = 0, 1 3) 治療しない場合の counterfactual outcome が時間に寄らない。 =0 = , for t = 0, 1 30

Fine Point 2.1 Crossover experiment 1 − 0 = 1
1 =1 − 0 0 =0 ∵consistensy = 1 1 =1 − 1 1 =0 + 1 1 =0 − 0 0 =0 = + − ∵(2), (3) = 31

Fine Point 2.2 Risk Period • リスク：一定期間内にアウトカムを発生した人の割合。 • 期間が重要。例えば、
• 同じ研究データに基づいていても、 RR of 1 year = 0.05, RR of 100 year = 1 となることはあり得る。 • 治療が死亡に関して因果的効果を持つというのは、死亡を防ぐことを意味するのではなくて、死亡を遅らせることを意味する。 32

Technical Points 33

Technical Point 2.1 Full exchangeability and mean exchangeability • Full
exchangeability: The set of all treatment values: The set of all counterfactual outcome: • Mean exchangeability: E = = E[] • Exchangeability ならば mean exchangeability は成り立つが、逆は成り立たない（例えば、分散などの平均以外のパラメータが treatment と独立にならないため）。 34

Technical Point 2.1 Full exchangeability and mean exchangeability • 因みに、E
= E[| = ]を証明するにはmean exchangeabilityで十分。右辺 = E[| = ] = E | = ∵consistensy = E ∵mean exchangeability = 左辺 • アウトカムが二値変数の場合、exch/とmean exch/ は同じになる。 35

Technical Point 2.2 Formal definition of IP weights • IP
weights: = 1 • 確率ではなくて、確率密度関数で定義される。 • 離散変数では、 = Pr[ = | = ] • 各個人に対する重みがそれぞれの個人のA, Lの値で決まるため、ある特定の値a, lではなくて確率変数A, Lで表す。 • Pr[ = | = ]という表現が不適切なため、確率密度関数で定義する。 36

Technical Point 2.3 Equivalence of IP weighting and standardization Standardized
mean = Pr = 1 = , = Pr = IP weighted mean = E[ = [|] ] Standardized mean = IP weighted mean の証明。証明） IP weighted mean = E[ = [|] ] IP weighted mean = 1 [|] {E = , = Pr[ = ]} IP weighted mean = Pr = 1 = , = Pr = IP weighted mean = Standardized mean 37

Technical Point 2.3 Equivalence of IP weighting and standardization 更にconditional
exchangeabilityを仮定すれば IPW mean と Std/ mean が counterfactual outcome と同じになる。 Standardized meanについて E = Pr = Pr = E = Pr = , = Pr = ∵conditional exchangeability, positivity E = Pr = , = Pr = ∵consistency E = Standardized mean 38

Technical Point 2.3 Equivalence of IP weighting and standardization IPW
meanについて IP weighted mean = E[ = [|] ] IP weighted mean = E[ = [|] ] ∵consistensy IP weighted mean = E{E = | } ∵条件付期待値の公式 IP weighted mean = E{E = | E[|]} ∵condiditional exchangeability IP weighted mean = E{E | } ∵E = | = 1 IP weighted mean = E IP weighted mean = counterfactual outcome 39

Reference • Hernán MA, Robins JM (2020). Causal Inference: What
If. Boca Raton: Chapman & Hall/CRC. • 条件付期待値, 分散の意味と有名公式 from https://mathtrain.jp/condexpectation. • Icons made by Freepik, Those Icons and Smashicons from www.flaticon.com. 40

Randomized experiments (Causal inference: What ...

Randomized experiments (Causal inference: What if, Chapter 2)

More Decks by Shuntaro Sato

Other Decks in Science

Featured

Transcript