In the function "train" line 120, when creating a moving mnist factory, the parameter ctx_num is omitted, while the parameter batch_size is still divided by the number of gpus. This will cause dimension mismatch when running the file with multiple gpus.