Skip to content

Conversation

@susbhere
Copy link
Contributor

@susbhere susbhere commented Jan 8, 2026

This optimization is targeted for DGPU only. Removing the CPU->CPU copy prior to CPU->GPU copy improves load_weights() time by ~30% across various models. That improves onnxruntime session creation time by 3-5%.
Details can be found here: https://jira.devtools.intel.com/projects/AGIJ/issues/AGIJ-27

@github-actions github-actions bot added category: Core OpenVINO Core (aka ngraph) category: GPU OpenVINO GPU plugin labels Jan 8, 2026
@e-ddykim e-ddykim marked this pull request as ready for review January 8, 2026 08:48
@e-ddykim e-ddykim requested review from a team as code owners January 8, 2026 08:48
@susbhere susbhere force-pushed the OptimizeLoadWeights branch 2 times, most recently from 1317b22 to d94dd40 Compare January 8, 2026 10:56
@susbhere susbhere force-pushed the OptimizeLoadWeights branch from d94dd40 to 43b6c53 Compare January 8, 2026 14:15
@susbhere susbhere changed the title CPU->CPU copy removed during load weights for DGfx CPU->CPU copy removed during load weights for DGPU Jan 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: Core OpenVINO Core (aka ngraph) category: GPU OpenVINO GPU plugin do not merge do_not_merge

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants