Multi-GPU training for object detection #7527
Unanswered
Rajesh-ParaxialTech
asked this question in
Q&A
Replies: 1 comment 2 replies
-
Hi @Rajesh-ParaxialTech, thanks for your interest here.
I didn't fully get your question, what do you want to incorparate? Do you mean how to ensure your pipeline can using DDP?
Hope it helps, thanks. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello
Thanks a lot for the nice GPU training concept and support. Currently I have been training an object detection model using the "Single Node multi-gpu " mode. I have tried to incorporate the features mentioned in the Brats segmentation (in my case I am trying the object detection model training) in my code. But couldnt find a way to incorporate the following:
Please let me know where i can incorporate in the code. My feeling is that this is to be incorporated somewhere in this file ".local/lib/python3.10/site-packages/monai/bundle/reference_resolver.py" of the pipeline used in the object detection tasks. At this point the config_items are evaluated and my object detection model (i.e the retinanet detector training) is exceuted with data, I think.
Without using the statement (mentioned in the figure above) in my code, while running the code, i am getting this error shown in figure below
Also my question is whether this line "model = DistributedDataParallel(model, device_ids=[device]) " mentioned in the figure above (first figure) is required for single Node multi-GPU training mode. Or is it required only for multi-node, multi-GPU training ?
It would be really helpful, If i could be guided in this regard.
Thanking you again
Rajesh
Beta Was this translation helpful? Give feedback.
All reactions