Skip to content

Conversation

@larissakl
Copy link
Contributor

Adds a treebuilder for the HMM-topology which is, in contrast to the old treebuilders ("minimized-hmm" and "classic-hmm"), compatible with the new TreeTimesyncBeamSearch.

For monophones, the tree is very simple:
tree_monophone_1_noctx

To get this tree, the variation of the phonemes in the lexicon needs to be set to "none". If it's set to "context", it will work as well, but have unnecessary root states:
tree_monophone_1

For diphones (diphone-dense state tying), the tree looks as follows:
tree_diphone_1

Triphones are not supported. Depends on #127 as I am using the shared base class here as well.

I did one test run with the TreeTimesyncBeamSearch and there are not obvious errors. However, I still observe a degradation from 5.8% to 6.2% which could have multiple reasons. Therefore, some considerations for future work:
In order to get competitive results, there might still be some details we need to implement. Most importantly, we need an exit penalty, which might be easy to add to the search, but we should probably allow for different exit penalties for words and non-words (silence/unknown). Furthermore, with the TransitionLabelScorer, it is easy to define TDPs, but again, we should have the possibility to have different TDPs for non-words, so we would need more transition types (especially silence-transitions) and tiny adjustments in the search algorithms.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants