-
Notifications
You must be signed in to change notification settings - Fork 2
Open
Description
Losing the original text? Is it the right thing to do?
((kind "wf")
(form . "two")
(lemmas "two")
(tag . "ignore")
(meta
(sep . "-")
(type . "num")))we do have the sep for produze the original text. Question is:
- is it easier to have the text tokenized in the buffer?
- should we not distinguish between spaces and other separators?
Remember that default sep is space, so when a token doesn't have sep it is assumed sep=" ". See confusing explanation in https://github.com/own-pt/glosstag/blob/princeton/dtd/glosstag.dtd#L158-L161 for the glosstag corpus !!
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels