Text Layout / Reading Order detection #13016
-
Hello, I was wondering if anyone tried to detect text document layout. Think a newspaper article or ACL paper copied/pasted in an ASCII file. Example:
When simply read as a text, line by line, the document will be interpreted as
Obviously, the desired text is:
There are a lot of papers describing this task as part of OCR, i.e. detecting text layout on images. I am looking for resources on how to approach this on the text data. Any insights will be really appreciated. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
Hi, that's a fairly general question about NLP methods - we try to keep this support forum focused on specific questions about spaCy and its related libraries. |
Beta Was this translation helpful? Give feedback.
Hi, that's a fairly general question about NLP methods - we try to keep this support forum focused on specific questions about spaCy and its related libraries.