I think the resnet50 baseline from torchvision (23.85% top-1 error) is trained for 100 epochs instead of 90.