Skip to content
Discussion options

You must be logged in to vote

A100 uses tf32 format in computation by default, its speed is already faster than FP32 and compatible with FP16. Therefore, the speed might not be accelerated in some cases. In MMDetection, due to the complexity of the model, many parts are explicitly designed to use FP32 to avoid failure.

Replies: 4 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by ZwwWayne
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #9022 on November 08, 2022 08:07.