Hi there, do you know how I can I use one of the two techniques above to do image classification on "Stanford Dogs Dataset"?
I've already tried the "B_16_imagenet1k" model but the accuracy obtained on 4.160 images isn't that good.
I saw that the difference between B_16 and L_16 is in the model parameters so even in the structure of the network.
I didn't focus on it: can you explain it? Do you know where can I read about it?