Skip to content
Discussion options

You must be logged in to vote

min_silence_duration_ms

One approach would be to try increasing this.
If you domain has quite long utterances separated by long silences, it can achieve your goal.

Or is post-processing (manually merging adjacent segments) the recommended approach in this case?

Probably yes.
You see, if there is a small silence between some speech, and we merge it - we kind of lose information, hence we do not do it.
If we enforce minimal speech length and there is no proper speech of such length, we will be either deleting information or introducing bias.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@NenarTu
Comment options

Answer selected by NenarTu
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants