Hi,
Thanks a lot for the codes. I have a question, the decoder_input_ids has a value that removes the last value from output list, which is the end token. But for outputs that do not fullfil the length, you have used a pad ID that is 1. So the end value of such an output is the pad ID and not the end token. In that case, if you truncate the end value of output, it will only remove the padID and not the end token. Is that how it should be? I hope what I have understood is right.
Thanks a lot.