摘要: DocKylin: A Large Multimodal Model for Visual Document Understanding with Efficient Visual Slimming http://arxiv.org/abs/2406.19101 现存的文档理解多模态模型面临3个主要 阅读全文
posted @ 2024-08-27 17:29 Big-Yellow-J 阅读(120) 评论(0) 推荐(0)
levels of contents