Lexing unicode characters

Hi,

I am trying to lex unicode characters.

Let's say I have this lexer rule:
```
\\u[0-9|a-f|A-F][0-9|a-f|A-F][0-9|a-f|A-F][0-9|a-f|A-F] "UNICODE-4"
```

This rule would only match an encoded unicodes e.g. `\u2081`, and not a decoded/raw unicode, such as: `₁`.

Does grmtools have any support for unicode?