-The general philosophy is "you show us what the output should look like, and the Speech service figures out how people say it." But this doesn't always work because some scenarios might have quirky unpredictable ways of saying things, or the Speech service background rules might have gaps. For example, there can be colloquial pronunciations for initials and acronyms. For example, `ZPI` might be spoken as `zippy`. In this case, a pattern like `ZPI-\d\d` is unlikely to work if a user says `zippy twenty two`. For this sort of situation, there's a display text format notation `{spoken>written}`. This particular case could be written out `{zippy>ZPI}-\d\d`.
0 commit comments