You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
某些情况下,layout模型没有将tablecaption和tablefootnote识别成正确的类型
比如你提供的两个例子:
在这个例子中
Table 6: Experimental results for MRC task. 被识别为table capiton
Table 7: Experimental results for PI task. 被识别为table footnote
后处理逻辑仅根据table的坐标和对应的描述区块坐标进行就近关联,且遵循一般通用逻辑,table caption在table body上方,table footnote在table body下方,因此将这两条文本都与下方table进行关联
在这个例子中
Table 8: The effect of different data augmentation ways for QQP in terms of F1-score. 被识别为table caption
而 Table 9真正的caption被识别成text block
因此后处理逻辑会将table8的capiton和下方的图片进行关联,而table9的capiton则被合进了正文
Description of the bug | 错误描述
--问题概述--
当一个图像上下同时出现图像名称标注时几乎一定会错误判断对应的caption和footnote,从而影响后续caption和footnote的结果。经过测试这一问题并不只针对单一文档,在有类似排版的文档中都出现了。这个问题相当影响caption和footnote的可用性,希望在后续版本中修正这一问题,谢谢!
--详细信息--
返回结果的相关部分如下:
{
"type": "table",
"img_path": "images/51c5210ce9b4d027f9d58183d87117e381a24066f79d6f188828ba2a83738947.jpg",
"table_caption": [
"Table 6: Experimental results for MRC task. "
],
"table_footnote": [
"Table 7: Experimental results for PI task. "
],
"page_idx": 6
},
原始pdf的截图:
返回结果的相关部分如下:
{
"type": "table",
"img_path": "images/8b29d8e85fd2e2b5aaa96cfd93c72fd4c3c071b22723f0d00a387549c8b321b2.jpg",
"table_caption": [],
"table_footnote": [],
"page_idx": 7
},
{
"type": "table",
"img_path": "images/eb6762be7264abc475e3515060df93271dde0f89e19aea59c74da698416c6b7a.jpg",
"table_caption": [
"Table 8: The effect of different data augmentation ways for QQP in terms of F1-score. "
],
"table_footnote": [],
"page_idx": 7
}
原始pdf的截图:
How to reproduce the bug | 如何复现
正常运行即可。这样排版的论文很多,这里只提供两个文件作为参考。
文件1
2020.acl-main.45.pdf
文件2
P19-1416.pdf
Operating system | 操作系统
Linux
Python version | Python 版本
3.10
Software version | 软件版本 (magic-pdf --version)
0.9.x
Device mode | 设备模式
cuda
The text was updated successfully, but these errors were encountered: