Thanks for your excellent work. I found that the data load speed is very slow when training the shuffleNetV2-0.5X. The GPU is basically not working. I use one SSD and 10 workers to load data.
Normally the data load for a batch of 256 requires less than 0.1s, but it becomes more than 10s now. It seems that GPU is much more fast than the data loading process. I wonder if you have solution for this problem? Thanks.