Pre-filter words whose diacrictic forms are not in the dictionary

Pre-filter words whose non-diacrictized word-forms are not in the dictionary, before asking the model to do ADR. This way we can get more predictable results and error messages for Out-Of-Vocabulary words (OOV)

If the model sees a word like `elerindodo`, validate that this word's diacritic form exists in the dictionary and return an error message if it doesn't! This way, since the model doesn't know about `elerindodo`,  it can just say so, rather than confuse the users by returning the "top probability word" which may be a random thing like `aláǹtakùn`!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pre-filter words whose diacrictic forms are not in the dictionary #15

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Pre-filter words whose diacrictic forms are not in the dictionary #15

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions