How does Docling assign <loc_xxx> tags in export_to_doctags()? Method or formula? #2033
Replies: 1 comment
-
The If two elements have identical or overlapping bounding boxes, they may get the same loc tag—this is common for text blocks and tables that share provenance or spatial coordinates. You can influence how loc tags are computed by modifying the provenance bounding boxes ( The tagging logic is implemented in:
You can see the exact logic and adjust parameters as needed in these files. To reply, just mention @dosu. How did I do? Good | Irrelevant | Incorrect | Verbose | Hallucination | Report 🐛 | Other |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi team,
When using 'DocumentConverter' --> 'export_to_doctags()', I get tags like '<loc_80>', '<loc_220>', etc.
Could you please explain how these loc numbers are assigned?
– Are they based on row numbers, text order, coordinates, or something else?
– Why do some text blocks and tables get the same loc?
– Is there a way to influence or read how they are computed?
Also, is there a link or file in the docling codebase where this tagging happens?
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions