[GPU] Optimze copy tensor with padding #32371

susbhere · 2025-10-13T07:41:00Z

This PR got messed up with multiple merge commits. Closing this and opening new one:
#32461

Details:

Tensor layout related properties are calculated once and used those
cached values during per element offset calculation. This brings ~200x improvement in wait time between two queries for PhiSlica model. That means a user has to wait only for 0.36 sec (instead of 74 sec !!!) between two queries. These numbers are from LNL.

Tickets:

CVS-174810

Tensor layout related properties are calculated once and used those cached values during per element offset calculation. [ONNX] Introduce GraphIterator interface in frontend (openvinotoolkit#32325) - *Introduce GraphIterator for ONNX* - *CVS-156050* --------- Signed-off-by: Maxim Vafin <[email protected]>

convert_and_copy_padded_source()

mklimenk

Two minor comments, but otherwise looks good.
Could you add some data on the E2E influence of his change? You've mentioned a significant improvement for this function, it'd be nice to see the in a context of a model compilation.

src/plugins/intel_gpu/src/plugin/common_utils.cpp

src/plugins/intel_gpu/src/runtime/layout.cpp

susbhere · 2025-10-16T09:04:49Z

Jenkins build passed.

https://openvino-ci.toolbox.iotg.sclab.intel.com/job/private-ci/job/github_trigger/job/openvino/view/change-requests/job/PR-32371/lastSuccessfulBuild/

Tensor layout related properties are calculated once and used those cached values during per element offset calculation. [ONNX] Introduce GraphIterator interface in frontend (openvinotoolkit#32325) - *Introduce GraphIterator for ONNX* - *CVS-156050* --------- Signed-off-by: Maxim Vafin <[email protected]>

convert_and_copy_padded_source()

src/plugins/intel_gpu/include/intel_gpu/runtime/tensor.hpp

src/plugins/intel_gpu/src/plugin/common_utils.cpp

…e/openvino into copy_padded_optimization

Tensor layout related properties are calculated once and used those cached values during per element offset calculation. [ONNX] Introduce GraphIterator interface in frontend (openvinotoolkit#32325) - *Introduce GraphIterator for ONNX* - *CVS-156050* --------- Signed-off-by: Maxim Vafin <[email protected]>

convert_and_copy_padded_source()

…e/openvino into copy_padded_optimization

…ffset calculation to common_utils.cpp

github-actions bot added the category: GPU OpenVINO GPU plugin label Oct 13, 2025

Removed code added for profiling and accuracy check for

7d2720c

convert_and_copy_padded_source()

susbhere force-pushed the copy_padded_optimization branch from e130e73 to 7d2720c Compare October 14, 2025 06:22

mklimenk reviewed Oct 14, 2025

View reviewed changes

src/plugins/intel_gpu/src/plugin/common_utils.cpp Show resolved Hide resolved

src/plugins/intel_gpu/src/runtime/layout.cpp Outdated Show resolved Hide resolved

Replaced for loop with std::replace for readability.

174e8ba

susbhere force-pushed the copy_padded_optimization branch from 174e8ba to f0bd92e Compare October 14, 2025 13:03

susbhere marked this pull request as ready for review October 14, 2025 13:04

susbhere requested review from a team as code owners October 14, 2025 13:04

susbhere force-pushed the copy_padded_optimization branch 3 times, most recently from 99c5e9f to 8c8331b Compare October 16, 2025 04:42

susbhere force-pushed the copy_padded_optimization branch from 8c8331b to 300a281 Compare October 16, 2025 09:05

mvafin and others added 3 commits October 16, 2025 18:52

Removed code added for profiling and accuracy check for

1e25078

convert_and_copy_padded_source()

Replaced for loop with std::replace for readability.

1b0f98e

susbhere force-pushed the copy_padded_optimization branch from 300a281 to 1b0f98e Compare October 16, 2025 13:22

yeonbok changed the title ~~[OV] Optimze copy tensor with padding~~ [GPU] Optimze copy tensor with padding Oct 16, 2025

yeonbok reviewed Oct 16, 2025

View reviewed changes

src/plugins/intel_gpu/include/intel_gpu/runtime/tensor.hpp Outdated Show resolved Hide resolved

yeonbok reviewed Oct 16, 2025

View reviewed changes

src/plugins/intel_gpu/include/intel_gpu/runtime/tensor.hpp Outdated Show resolved Hide resolved

yeonbok reviewed Oct 16, 2025

View reviewed changes

src/plugins/intel_gpu/include/intel_gpu/runtime/tensor.hpp Outdated Show resolved Hide resolved

yeonbok reviewed Oct 16, 2025

View reviewed changes

src/plugins/intel_gpu/src/plugin/common_utils.cpp Outdated Show resolved Hide resolved

p-durandin added this to the 2025.4 milestone Oct 17, 2025

susbhere and others added 4 commits October 17, 2025 14:43

Merge branch 'copy_padded_optimization' of https://github.com/susbher…

3215e16

…e/openvino into copy_padded_optimization

Removed code added for profiling and accuracy check for

29a88c1

convert_and_copy_padded_source()

Replaced for loop with std::replace for readability.

e41a06d

susbhere force-pushed the copy_padded_optimization branch from 1b0f98e to e41a06d Compare October 17, 2025 09:18

Merge branch 'copy_padded_optimization' of https://github.com/susbher…

80cca8c

…e/openvino into copy_padded_optimization

susbhere force-pushed the copy_padded_optimization branch from 6d6398e to 088c447 Compare October 17, 2025 09:33

Reverted tensor.hpp, Removed data elements from layout class. Moved o…

113dd78

…ffset calculation to common_utils.cpp

susbhere force-pushed the copy_padded_optimization branch from 088c447 to 113dd78 Compare October 17, 2025 09:35

susbhere added 2 commits October 17, 2025 15:33

Merge branch 'master' into copy_padded_optimization

c7fedcc

Merge branch 'master' into copy_padded_optimization

73baf9a

susbhere closed this Oct 17, 2025

yeonbok mentioned this pull request Oct 17, 2025

Optimze copy tensor with padding #32461

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GPU] Optimze copy tensor with padding #32371

[GPU] Optimze copy tensor with padding #32371

susbhere commented Oct 13, 2025 •

edited

Loading

Uh oh!

mklimenk left a comment

Uh oh!

Uh oh!

Uh oh!

susbhere commented Oct 16, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

[GPU] Optimze copy tensor with padding #32371

[GPU] Optimze copy tensor with padding #32371

Conversation

susbhere commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Details:

Tickets:

Uh oh!

mklimenk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

susbhere commented Oct 16, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

susbhere commented Oct 13, 2025 •

edited

Loading