and Qin, C. (2022), โBest Arm Identification with a Fixed Budget under a Small Gapโ โข Carpentier, A. and Locatelli, A. Tight (lower) bounds for the fixed budget best arm identification388 bandit problem. In COLT, 2016 โข Glynn, P. and Juneja, S. A large deviations perspective on ordinal optimization. In Proceedings of the 2004 Winter Simulation Conference, volume 1. IEEE, 2004 โข Kasy, M. and Sautmann, A. Adaptive treatment assignment in experiments for policy choice. Econometrica. โข Kaufmann, E., Cappe, O., and Garivier, A. (2016), โOn the Complexity of Best-Arm Identification in Multi-Armed ยด Bandit Models,โ Journal of Machine Learning Research. โข Fan, X., Grama, I., and Liu, Q. (2013), โCramer large deviation expansions for martingales under Bernsteinโs condition,โ Stochastic Processes and their Applications. โข Lai, T. and Robbins, H. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics, 1985 โข Hirano, K. and Porter, J. R. Asymptotics for statistical treatment rules. Econometrica, 2009 โข van der Vaart, A. Asymptotic Statistics. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, 1998 25