In my test, mobilevit_xs process one image will cost 113ms, which is much larger than the value in the paper.