Text Recognition Model Comparisons? Dataset and Model Analysis,” in Proc. ICCV, 2019, pp. 4714–4722. [Chen+ 2020] Chen et al., “A Simple Framework for Contrastive Learning of Visual Representations,” in Proc. ICML, PMLR 119, 2020, pp. 1597–1607. [Aberdam+ 2021] Aberdam et al., “Sequence-to-Sequence Contrastive Learning for Text Recognition,” in Proc. CVPR, 2021, pp. 15302–15312. [Chen+ 2021] Chen et al., “Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study,” arXiv:2112.15093, 2021. [He+ 2022] He et al., “Masked Autoencoders Are Scalable Vision Learners,” in Proc. CVPR, 2022, pp. 16000–16009. [Yang+ 2022] Yang et al., “Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition,” in Proc. ACM MM, 2022, pp. 4214–4223. 参考文献