Reference

Ø Kato, M., Ariu, K., Imaizumi, M., Nomura, M., and Qin, C. (2022), “Best Arm Identification with a Fixed Budget under a Small Gap”

• Carpentier, A. and Locatelli, A. Tight (lower) bounds for the fixed budget best arm identification388

bandit problem. In COLT, 2016

• Glynn, P. and Juneja, S. A large deviations perspective on ordinal optimization. In Proceedings of the 2004 Winter Simulation Conference, volume 1. IEEE, 2004

• Kasy, M. and Sautmann, A. Adaptive treatment assignment in experiments for policy choice. Econometrica.

• Kaufmann, E., Cappe, O., and Garivier, A. (2016), “On the Complexity of Best-Arm Identification in Multi-Armed ´ Bandit Models,” Journal of Machine Learning

Research.

• Fan, X., Grama, I., and Liu, Q. (2013), “Cramer large deviation expansions for martingales under Bernstein’s condition,” Stochastic Processes and their

Applications.

• Lai, T. and Robbins, H. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 1985

• Hirano, K. and Porter, J. R. Asymptotics for statistical treatment rules. Econometrica, 2009

• van der Vaart, A. Asymptotic Statistics. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, 1998

25