-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Description
How should I hadnle the situations where pdf_to_text has removed the square braces from references?
e.g.: 2004-46-NATURE-C01-mention:
The molecule is rich in proline residues (13%) and analysis of its amino acid sequence with PONDR21 indicates that in the absence of other viral components at least the N-terminal half of the subunit would be disordered.
Should I do anything to indicate that here the 21 was actually a reference? I don't know that stemming is going to work because some software does end in numbers. I wonder whether some tweak to the pdf_to_txt code might help us here?
Metadata
Metadata
Assignees
Labels
No labels