Gupta, and A. Zisserman, “Adaptive Text Recognition Through Visual Matching,” in Computer Vision – ECCV 2020, 2020, pp. 51–67. [B. Shi+ TPAMI2017] B. Shi, X. Bai, and C. Yao, “An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 39, no. 11, pp. 2298–2304, Nov. 2017. [B. Shi+ CVPR2016] B. Shi, X. Wang, P. Lyu, C. Yao, and X. Bai, “Robust scene text recognition with automatic rectification,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 4168–4176. [B. Shi+ TPAMI2019] B. Shi, M. Yang, X. Wang, P. Lyu, C. Yao, and X. Bai, “ASTER: An Attentional Scene Text Recognizer with Flexible Rectification,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 41, no. 9, pp. 2035–2048, Sep. 2019. [J. Baek+ CVPR2019] J. Baek et al., “What is wrong with scene text recognition model comparisons? dataset and model analysis,” in Proceedings of the IEEE International Conference on Computer Vision, 2019, pp. 4715–4723. [M. Jaderberg+ NeurIPSW2014] M. Jaderberg, K. Simonyan, A. Vedaldi, and A. Zisserman, “Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition,” arXiv [cs.CV], Jun. 09, 2014. [A. Gupta+ CVPR2016] A. Gupta, A. Vedaldi, and A. Zisserman, “Synthetic data for text localisation in natural images,” in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 2315–2324. [L. Vincent + ICDAR2007] L. Vincent. Google book search: Document understanding on a massive scale. In PROC. ninth International Conference on Document Analysis and Recognition (ICDAR), pages 819–823, Washington, DC, 2007. 6, 7, 27 [JS Chung+ ACCV2016] J. S. Chung and A. Zisserman, “Lip Reading in the Wild,” in Computer Vision – ACCV 2016, 2017, pp. 87–103. [B. M. Lake + ACCV2016] B. M. Lake, R. Salakhutdinov, and J. B. Tenenbaum, “Human-level concept learning through probabilistic program induction,” Science, vol. 350, no. 6266, pp. 1332–1338, Dec. 2015.