The reported performance cannot be reproduced unless I use the pre-trained model. If I train the model from scratch, the performance never reaches the reported ones.
Can you share random seeds/exact setting that you chose to produce the reported performance?