To investigate: do grammars that preclude leading spaces harm models that use space-prepending tokenizers?

When a user passes a grammar that is a subset of /\w+.*/, is this going to knock models off distribution if they expect a space-prepending preprocessing step in the tokenizer?

Came up in the course of #272