-
Notifications
You must be signed in to change notification settings - Fork 157
Issues with whitespace definition #361
Copy link
Copy link
Open
Description
Neither Java's Character.isWhitespace(int), or Character.isSpaceChar(int), or the unicode [:White_Space:] specification treats \u180E (MONGOLIAN VOWEL SEPARATOR) as a whitespace.
Yet the openCypher grammar considers this a whitespace character, why?
openCypher/grammar/basic-grammar.xml
Line 781 in 346aa0d
| <literal value="᠎"/> <!-- MONGOLIAN VOWEL SEPARATOR --> |
Furthermore the definition of whitespace in the openCypher grammar does not consider \u0085 (NEXT LINE) to be whitespace, while it is part of the unicode [:White_Space:] specification. Perhaps that should be added? (it is not considered a whitespace by either Character.isWhitespace(int) or Character.isSpaceChar(int), which explains why it is not in the grammar).
Reactions are currently unavailable