版面检测模型微调后结果与标注不匹配
#16051
Replies: 1 comment
-
![]() |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
标注如下
label2coco后格式如下
{
"images": [
{
"height": 1191,
"width": 1684,
"id": 1,
"file_name": "jianhangliushui_5.png"
},
{
"height": 1191,
"width": 1684,
"id": 2,
"file_name": "jianhangliushui_20.png"
},
{
"height": 1191,
"width": 1684,
"id": 3,
"file_name": "jianhangliushui_8.png"
}
],
"annotations": [
{
"iscrowd": 0,
"image_id": 1,
"bbox": [
597.4794520547944,
22.897260273972567,
438.3561643835619,
32.87671232876713
],
......
}],
"categories": [
{
"id": 1,
"name": "figure_title",
"supercategory": "figure_title"
},
{
"id": 2,
"name": "text",
"supercategory": "text"
},
{
"id": 3,
"name": "table",
"supercategory": "table"
},
{
"id": 4,
"name": "seal",
"supercategory": "seal"
}
]
}
训练完成后结果如下
输出json如下
{
"input_path": "/home/mft/quan_train/banmianjiance/data_test/images/jianhangliushui_5.png",
"page_index": null,
"boxes": [
{
"cls_id": 5,
"label": "Table",
"score": 0.9108839631080627,
"coordinate": [
46.182682037353516,
100.04399108886719,
1604.726806640625,
1042.6273193359375
]
},
{
"cls_id": 4,
"label": "Figure caption",
"score": 0.5901358127593994,
"coordinate": [
596.763671875,
26.531038284301758,
1029.5452880859375,
48.09413146972656
]
}
]
}
标签文档label.txt如下

只使用了需要的标签
问题
输出结果中没有学到任何信息,甚至标签映射都发生了改变。
对PP-DocLayout-L进行微调,原始数据是通过labelme标注后通过labelme2coco转换得到。coco数据通过了数据检测。30张表格图训练40轮测试。
Beta Was this translation helpful? Give feedback.
All reactions