PPOCRLABEL generate gt.txt for table recognition training dataset #12382
Replies: 2 comments
-
The annotation of table recognition consists of two parts: box and html structures.
|
Beta Was this translation helpful? Give feedback.
-
@tink2123 Thank you in advance. Can we train a model to recognize table structure(html format) in images like the ones above? If so, how do we annotate the data when a cell has multiple vertical alignments? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
hello thank you for making this beautiful repo. i have a question related PPOCRlabel.

i already annoted some table and when i see the result from auto generated gt.txt, i got some html tag its not right. especially for the merge cell.
i'm annoted the data like this:
it is right ?
i got the result like this:

This is not similar to the table in the picture.
i expect the result should be like this:

Did I make a mistake during annotation? Or do you have to correct it manually even though you are already using PPOCRLABEL?
thank you
Beta Was this translation helpful? Give feedback.
All reactions