-
Notifications
You must be signed in to change notification settings - Fork 1
Description
Need to define the best compromise between Line and Zone extraction in order to better annotate my manuscript(s).
3 of my pages pose a significant layout problem where the order of the text is perturbated and blocks of text need to be read vertically instead of horizontally.
Currently, I am assigning a new MainZone#n to bits of texts whose reading order I want to establish/redirect such as columns. It is important to keep this information as it can be useful for the transmission of the text.
Examples:
Leyden,VossLat041,folios 4v and 5r
The problem occurs when I try to extract the main text that lies in the DefaultLines of the multiple ordered MainZones and the glosses that correspond to each line of each zone.
I need the text to appear in that specific order, given that in my fiche de récollement I give for each gloss the exact line (after ordering) of the lemma associated with it.
I need a text_extraction code that extracts the DefaultLines of every MainZone in an ordered fashion (MainZone, MainZone#1-6) and then does the same for the InterlinearLines, in order to (at some point, somehow) tag automatically and associate the lemmas and the glosses following the info in my fiche.


