Predict on GPU and calculate metrics in CPU, DDP Mode #6433

frapercan · 2021-03-09T09:10:41Z

frapercan
Mar 9, 2021

Hello, I'm interested in calculating PRC, ROC, and other metrics over a multilabel segmentation problem (Tons of Pixels). The problem is the amount of vRAM required to use those metrics and that I would like to integrate everything on the same pipeline.

So I would like that during training, using the evaluation loop, to calculate those metrics without OOM. I have seen that that metrics calculation works slower when no GPU is selected but it is mainly due to the time it needs the model to predict values but this is the only way I have achieved to calculate Metrics using RAM.

I have tried to set up and feed the PrecisionRecallCurve metric with CPU tensors meanwhile inferences are done in GPU. But there is no way to work in such a hybrid mode, Update step works fine, but when I do compute the metric it tells me that GPU tensors are required.

I did something like this, and a lot of variants:

initialization (One GPU and DDP are configured):

PrecisionRecallCurve(pos_label=1, num_classes=1, compute_on_step=False).cpu()

step_end:
precision_recall_curve.update(patch_pred[:, defect_index].cpu(), patch_mask[:, defect_index].cpu())

epoch_end:
precision, recall, threshold = precision_recall_curve.compute()

The error:

File "/usr/local/lib/python3.8/site-packages/torch/distributed/distributed_c10d.py", line 1419, in all_gather
    work = _default_pg.allgather([tensor_list], [tensor])
RuntimeError: Tensors must be CUDA and dense

I would like to discuss and talk about this kind of problem, how can it be solved, I can't find much information about CPU metrics usage in the documentation, and I'm still learning.

I have seen people using the sci-kit-learn library, probably it would fit my requirements, but I wouldn't like to add more stuff to the stack.

Thanks for your work, this framework is amazing :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Predict on GPU and calculate metrics in CPU, DDP Mode #6433

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Predict on GPU and calculate metrics in CPU, DDP Mode #6433

Uh oh!

frapercan Mar 9, 2021

Replies: 0 comments

frapercan
Mar 9, 2021