Commit df7a3d8
Fix whisper timestamp extraction for tokenizers with added tokens (#804)
* support user defined tokens by bounding timestamp token if statement
* Update src/tokenizers.js
Co-authored-by: Joshua Lochner <[email protected]>
* calculate timestamp_end instead of hardcoding
* Update tokenizers.js
* Merge conflict resolution
---------
Co-authored-by: Joshua Lochner <[email protected]>1 parent 7dffb9a commit df7a3d8
1 file changed
+6
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3583 | 3583 | | |
3584 | 3584 | | |
3585 | 3585 | | |
| 3586 | + | |
| 3587 | + | |
| 3588 | + | |
| 3589 | + | |
| 3590 | + | |
3586 | 3591 | | |
3587 | 3592 | | |
3588 | 3593 | | |
| |||
3670 | 3675 | | |
3671 | 3676 | | |
3672 | 3677 | | |
3673 | | - | |
| 3678 | + | |
3674 | 3679 | | |
3675 | 3680 | | |
3676 | 3681 | | |
| |||
0 commit comments