UNet2DS tensorflow non-deterministic training

Just making a note for future reference that training the UNet2DS model on the GPU with Tensorflow backend results in non-deterministic gradient updates, which results in non-deterministic final results. The final submission are typically within 2% of each other in terms of mean F1 score, but still this adds a confounding factor when trying to compare changes to the architecture or training strategy.

There is a lot of material online about TF's non-determinism. Most of it points to the fact that the underlying CuDNN implementation uses non-deterministic reductions for convolutions (i.e. floating point operations are not necessarily associative). The best, most recent insight I could find was in [this pull-request](https://github.com/tensorflow/tensorflow/pull/10636), with comments indicating there is supposedly a forthcoming fix to address this issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UNet2DS tensorflow non-deterministic training #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

UNet2DS tensorflow non-deterministic training #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions