Add HmmTreeBuilder #154

larissakl · 2025-10-20T06:54:59Z

Adds a treebuilder for the HMM-topology which is, in contrast to the old treebuilders ("minimized-hmm" and "classic-hmm"), compatible with the new TreeTimesyncBeamSearch.

For monophones, the tree is very simple:

To get this tree, the variation of the phonemes in the lexicon needs to be set to "none". If it's set to "context", it will work as well, but have unnecessary root states:

For diphones (diphone-dense state tying), the tree looks as follows:

Triphones are not supported. Depends on #127 as I am using the shared base class here as well.

I did one test run with the TreeTimesyncBeamSearch and there are not obvious errors. However, I still observe a degradation from 5.8% to 6.2% which could have multiple reasons. Therefore, some considerations for future work:
In order to get competitive results, there might still be some details we need to implement. Most importantly, we need an exit penalty, which might be easy to add to the search, but we should probably allow for different exit penalties for words and non-words (silence/unknown). Furthermore, with the TransitionLabelScorer, it is easy to define TDPs, but again, we should have the possibility to have different TDPs for non-words, so we would need more transition types (especially silence-transitions) and tiny adjustments in the search algorithms.

Add HmmTreeBuilder

3466fb5

larissakl requested review from SimBe195 and curufinwe October 20, 2025 06:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add HmmTreeBuilder #154

Add HmmTreeBuilder #154

Uh oh!

larissakl commented Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add HmmTreeBuilder #154

Are you sure you want to change the base?

Add HmmTreeBuilder #154

Uh oh!

Conversation

larissakl commented Oct 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants