Slide 11
Slide 11 text
• Luedtke, A. R. and van der Laan, M. J. Statistical inference for the mean outcome under a
possibly non-unique optimal treatment strategy. Annals of statistics, 2016.
• Balsubramani, A. and Ramdas, A. Sequential non- parametric testing with the law of the
iterated logarithm. In UAI, 2016.
• Kaufmann, E., Cappé, O., and Garivier, A. On the complexity of best-arm identification in
multi-armed bandit models. JMLR, 2016.
• Zhao, S., Zhou, E., Sabharwal, A., and Ermon, S. Adaptive concentration inequalities for
sequential decision problems. In NeurIPS, pp. 1343–1351. Cur- ran Associates, Inc., 2016.
• Chernozhukov, V., Chetverikov, D., Demirer, M., Duflo, E., Hansen, C., Newey, W., and Robins,
J. Double/debiased machine learning for treatment and structural parameters. Econometrics
Journal, 21: C1–C68, 2018.
• Hadad, V., Hirshberg, D. A., Zhan, R., Wager, S., and Athey, S. Confidence intervals for policy
evaluation in adaptive experiments. arXiv preprint arXiv:1911.02768, 2019.
Reference 11