Skip to content

Commit 2b29216

Browse files
okakarpadnikolaev-amd
authored andcommitted
[AUTOGENERATED] [release/2.8] remove extra transposes in NHWC convolutions on MIOpen (#2410)
Cherry-pick of #2405 Co-authored-by: Dmitry Nikolaev <[email protected]>
1 parent f7921f4 commit 2b29216

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

aten/src/ATen/native/miopen/Conv_miopen.cpp

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1200,7 +1200,7 @@ std::tuple<at::Tensor,at::Tensor,at::Tensor> miopen_convolution_transpose_backwa
12001200
IntArrayRef padding, IntArrayRef output_padding, IntArrayRef stride, IntArrayRef dilation, int64_t groups,
12011201
bool benchmark, bool deterministic, std::array<bool,3> output_mask) {
12021202

1203-
Tensor grad_output = grad_output_t.contiguous();
1203+
Tensor grad_output = grad_output_t.contiguous(input.suggest_memory_format());
12041204

12051205
Tensor grad_input, grad_weight, grad_bias;
12061206
if (output_mask[0]) {
@@ -1452,7 +1452,7 @@ std::tuple<at::Tensor,at::Tensor,at::Tensor> miopen_depthwise_convolution_backwa
14521452
IntArrayRef padding, IntArrayRef stride, IntArrayRef dilation, int64_t groups,
14531453
bool benchmark, bool deterministic, std::array<bool,3> output_mask) {
14541454

1455-
Tensor grad_output = grad_output_t.contiguous();
1455+
Tensor grad_output = grad_output_t.contiguous(input.suggest_memory_format());
14561456

14571457
Tensor grad_input, grad_weight, grad_bias;
14581458
if (output_mask[0]) {

0 commit comments

Comments
 (0)