Slide 33
Slide 33 text
© 2023 LayerX Inc. 33
参考文献
[1] https://github.com/JaidedAI/EasyOCR
[2] PubLayNet: largest dataset ever for document layout analysis, https://arxiv.org/abs/1908.07836
[3] Document AI: Benchmarks, Models and Applications, https://arxiv.org/abs/2111.08609
[4] LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking, https://arxiv.org/abs/2204.08387
[5] Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer, https://arxiv.org/abs/1910.10683
[6] InfographicVQA, https://arxiv.org/abs/2104.12756