Slide 31
Slide 31 text
Donut
Geewook Kim, Teakgyu Hong, Moonbin Yim, Jeongyeon Nam, Jinyoung Park, Jinyeong Yim,
Wonseok Hwang, Sangdoo Yun, Dongyoon Han, Seunghyun Park (2021). OCR-free
Document Understanding Transformer. https://arxiv.org/abs/2111.15664
SegFormer
Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, Jose M. Alvarez, Ping Luo (2021).
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers.
https://arxiv.org/abs/2105.15203
DocILE
Štěpán Šimsa, Milan Šulc, Michal Uřičář, Yash Patel, Ahmed Hamdi, Matěj Kocián, Matyáš
Skalický, Jiří Matas, Antoine Doucet, Mickaël Coustaty, Dimosthenis Karatzas (2023). DocILE
Benchmark for Document Information Localization and Extraction.
https://arxiv.org/abs/2302.05658
Appendix: Citation