File tree Expand file tree Collapse file tree 1 file changed +4
-3
lines changed
Expand file tree Collapse file tree 1 file changed +4
-3
lines changed Original file line number Diff line number Diff line change @@ -948,9 +948,10 @@ def generate_gpu_compiler_spec(
948948
949949 Args:
950950 precision:
951- kGpuPerfHintHigh - best inference latency at the expense of power consumption.
952- kGpuPerfHintNormal - balanced performance dependent upon power management.
953- kGpuPerfHintLow - lowest power consumption at the expense of inference latency.
951+ kGpuPrecisionFp32 - Sets the precision mode to floating point 32-bit (FP32).
952+ kGpuPrecisionFp16 - Sets the precision mode to floating point 16-bit (FP16).
953+ kGpuPrecisionHybrid - Sets the precision mode to FP16 for storage and FP32 for calculations.
954+ kGpuPrecisionUserProvided - Uses the tensor data type provided by the user.
954955 use_memory_optimizations: If true, backend will share NATIVE tensor memory
955956 based upon analysis of the network topology.
956957 use_node_optimizations: If true, backend will fuse compatible operations into
You can’t perform that action at this time.
0 commit comments