Skip to content

Bold Tokens not being Detected #3940

@rudra0713

Description

@rudra0713

Hi, thanks again for the beautiful work on
Sample_tables_rh.pdf
PDF documents.
I am facing an issue regarding the detection of bold tokens in the following document. Please check page 3 of the document, you will see tokens like Assets, Liabilities, etc in the first table in bold. The document was created in Latex and the \textbf{} command was used to mark bold tokens. However, when I check the spans and the flags property all tokens are detected as not bold. I used the following code to determine such properties:

is_bold = bool(span['flags'] & 2) 
is_italic = bool(span['flags'] & 1)  

Please, let me know if this is an issue with the document or my code.
Thanks in advance.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions