Skip to content

Conversation

susbhere
Copy link
Contributor

@susbhere susbhere commented Oct 13, 2025

This PR got messed up with multiple merge commits. Closing this and opening new one:
#32461

Details:

  • Tensor layout related properties are calculated once and used those
    cached values during per element offset calculation. This brings ~200x improvement in wait time between two queries for PhiSlica model. That means a user has to wait only for 0.36 sec (instead of 74 sec !!!) between two queries. These numbers are from LNL.

Tickets:

Tensor layout related properties are calculated once and used those
cached values during per element offset calculation.

[ONNX] Introduce GraphIterator interface in frontend (openvinotoolkit#32325)

 - *Introduce GraphIterator for ONNX*

 - *CVS-156050*

---------

Signed-off-by: Maxim Vafin <[email protected]>
@github-actions github-actions bot added the category: GPU OpenVINO GPU plugin label Oct 13, 2025
@susbhere susbhere force-pushed the copy_padded_optimization branch from e130e73 to 7d2720c Compare October 14, 2025 06:22
Copy link

@mklimenk mklimenk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Two minor comments, but otherwise looks good.
Could you add some data on the E2E influence of his change? You've mentioned a significant improvement for this function, it'd be nice to see the in a context of a model compilation.

@susbhere susbhere force-pushed the copy_padded_optimization branch from 174e8ba to f0bd92e Compare October 14, 2025 13:03
@susbhere susbhere marked this pull request as ready for review October 14, 2025 13:04
@susbhere susbhere requested review from a team as code owners October 14, 2025 13:04
@susbhere susbhere force-pushed the copy_padded_optimization branch 3 times, most recently from 99c5e9f to 8c8331b Compare October 16, 2025 04:42
@susbhere
Copy link
Contributor Author

@susbhere susbhere force-pushed the copy_padded_optimization branch from 8c8331b to 300a281 Compare October 16, 2025 09:05
mvafin and others added 3 commits October 16, 2025 18:52
Tensor layout related properties are calculated once and used those
cached values during per element offset calculation.

[ONNX] Introduce GraphIterator interface in frontend (openvinotoolkit#32325)

 - *Introduce GraphIterator for ONNX*

 - *CVS-156050*

---------

Signed-off-by: Maxim Vafin <[email protected]>
@susbhere susbhere force-pushed the copy_padded_optimization branch from 300a281 to 1b0f98e Compare October 16, 2025 13:22
@yeonbok yeonbok changed the title [OV] Optimze copy tensor with padding [GPU] Optimze copy tensor with padding Oct 16, 2025
@p-durandin p-durandin added this to the 2025.4 milestone Oct 17, 2025
susbhere and others added 4 commits October 17, 2025 14:43
Tensor layout related properties are calculated once and used those
cached values during per element offset calculation.

[ONNX] Introduce GraphIterator interface in frontend (openvinotoolkit#32325)

 - *Introduce GraphIterator for ONNX*

 - *CVS-156050*

---------

Signed-off-by: Maxim Vafin <[email protected]>
@susbhere susbhere force-pushed the copy_padded_optimization branch from 1b0f98e to e41a06d Compare October 17, 2025 09:18
@susbhere susbhere force-pushed the copy_padded_optimization branch from 6d6398e to 088c447 Compare October 17, 2025 09:33
@susbhere susbhere force-pushed the copy_padded_optimization branch from 088c447 to 113dd78 Compare October 17, 2025 09:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: GPU OpenVINO GPU plugin

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants