Commit 75db936
committed
whisper-cli : align token timestamps with VAD ts
This commit aligns the token timestamps with the VAD timestamps when VAD
is enabled.
The motivation of this is that currently the token timestamps that are
reported in the full json output are the timestamps that whisper sees
after the VAD has processed the audio. This means that whisper only sees
possibly filtered audio and the token timestamps are related to the
filtered audio, not the original audio. For the segment timestamps we
map/align them with original timestamps but this is not currenly done
for the token timestamps which is what this commit aims to address.
Resolves: #31741 parent e05af24 commit 75db936
3 files changed
+13
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
727 | 727 | | |
728 | 728 | | |
729 | 729 | | |
730 | | - | |
| 730 | + | |
| 731 | + | |
| 732 | + | |
| 733 | + | |
| 734 | + | |
| 735 | + | |
| 736 | + | |
731 | 737 | | |
732 | 738 | | |
733 | 739 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
712 | 712 | | |
713 | 713 | | |
714 | 714 | | |
| 715 | + | |
| 716 | + | |
715 | 717 | | |
716 | 718 | | |
717 | 719 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
7967 | 7967 | | |
7968 | 7968 | | |
7969 | 7969 | | |
| 7970 | + | |
| 7971 | + | |
| 7972 | + | |
| 7973 | + | |
7970 | 7974 | | |
7971 | 7975 | | |
7972 | 7976 | | |
| |||
0 commit comments