Possible issue with segment timestamps in prompt #1723

funboarder13920 · 2023-10-19T15:15:11Z

funboarder13920
Oct 19, 2023

Hello,

I am wondering if the way the prompt is built during the inference is aligned with the prompt from the training.
During the inference, segment timestamps from the decoding are propagated in the prompt. Tokens from the decoding are appended to the all_tokens history. After being appended, the timestamps do not represent much anymore, for example timestamps could be out of order which might never have been seen by the model which could be an issue. It is also possible that the prompt from the training data does not contain any timestamps as it is not really necessary.

I didn't find a anything in the code that would get rid of the prompt timestamps.
The openai whisper paper does not go into training data details.

Do you have any insights on the format of the training data, especially regarding the prompt ?

Best,

funboarder13920 · 2023-10-23T09:02:05Z

funboarder13920
Oct 23, 2023
Author

From the answer provided here : #838 (comment) , I guess the timestamps in the prompt during the inference do not correspond to the process applied during training.

During inference:

the prompt is not limited to the previous 30s.
timestamps are not in chronological order and do not correspond to the same segment/chunk of audio

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Possible issue with segment timestamps in prompt #1723

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

Possible issue with segment timestamps in prompt #1723

Uh oh!

Uh oh!

funboarder13920 Oct 19, 2023

Replies: 1 comment

Uh oh!

Uh oh!

funboarder13920 Oct 23, 2023 Author

funboarder13920
Oct 19, 2023

funboarder13920
Oct 23, 2023
Author