[GPU] Support fsv16 Shape agnostic convolution. #25020

hyunback · 2024-06-14T05:47:44Z

Details:

Stable Diffusion in dpas has bad first inference latency because all onednn convolutions are compiled at first inference. We can resolve this bottleneck with shape agnostic kernel. Target kernel is convolution_fsv16_1x1

Tickets:

143317

Signed-off-by: hyunback <[email protected]>

Find the root cause and fixing.. Signed-off-by: hyunback <[email protected]>

Signed-off-by: hyunback <[email protected]>

...s/intel_gpu/src/kernel_selector/kernels/convolution/convolution_kernel_b_fs_yx_fsv16_1x1.cpp

Signed-off-by: hyunback <[email protected]>

src/plugins/intel_gpu/src/graph/graph_optimizer/compile_graph.cpp

...s/intel_gpu/src/kernel_selector/kernels/convolution/convolution_kernel_b_fs_yx_fsv16_1x1.cpp

Signed-off-by: hyunback <[email protected]>

e-ddykim

Looks good to me

...s/intel_gpu/src/kernel_selector/kernels/convolution/convolution_kernel_b_fs_yx_fsv16_1x1.cpp

Signed-off-by: hyunback <[email protected]>

sshlyapn · 2024-06-26T09:19:20Z

...s/intel_gpu/src/kernel_selector/kernels/convolution/convolution_kernel_b_fs_yx_fsv16_1x1.cpp

+            kd.internalBufferSizes.clear();
+            kd.internalBufferSizes.push_back(prim_params.inputs[0].PhysicalSizeInBytes());
+            kd.internalBufferDataType = prim_params.inputs[0].GetDType();


Why do we need this internal buffer?

Applied, frankly no need, it came from convolution_kernel_base.

Signed-off-by: hyunback <[email protected]>

### Details: - Stable Diffusion in dpas has bad first inference latency because all onednn convolutions are compiled at first inference. We can resolve this bottleneck with shape agnostic kernel. Target kernel is convolution_fsv16_1x1 ### Tickets: - *143317* --------- Signed-off-by: hyunback <[email protected]>

hyunback added category: GPU OpenVINO GPU plugin WIP work in progress labels Jun 14, 2024

hyunback requested review from a team as code owners June 14, 2024 05:47

hyunback force-pushed the sa_conv_fsv16_poc branch 3 times, most recently from 36192b8 to d5da6d5 Compare June 20, 2024 00:28

hyunback added 4 commits June 20, 2024 10:00

[GPU] Support fsv16 Shape agnostic convolution.

9bcc99a

Signed-off-by: hyunback <[email protected]>

Fixing accuracy issue

1a7c7b6

Find the root cause and fixing.. Signed-off-by: hyunback <[email protected]>

Temporaily enable fusions, SA Conv fsv16_1x1 and add debugging env.

7a2895f

Signed-off-by: hyunback <[email protected]>

Update to parameterize for hardcoding.

01e85b5

Signed-off-by: hyunback <[email protected]>

hyunback force-pushed the sa_conv_fsv16_poc branch from d5da6d5 to 01e85b5 Compare June 20, 2024 01:03

hyunback added 2 commits June 21, 2024 21:22

Add test case.

ae79d77

Signed-off-by: hyunback <[email protected]>

Fix unit-test failure

f3fa3d3

Signed-off-by: hyunback <[email protected]>

hyunback force-pushed the sa_conv_fsv16_poc branch 2 times, most recently from 1100ab5 to f3fa3d3 Compare June 21, 2024 14:05

e-ddykim reviewed Jun 24, 2024

View reviewed changes

...s/intel_gpu/src/kernel_selector/kernels/convolution/convolution_kernel_b_fs_yx_fsv16_1x1.cpp Show resolved Hide resolved

Apply code-review comment.

90a64fa

Signed-off-by: hyunback <[email protected]>

e-ddykim reviewed Jun 24, 2024

View reviewed changes

hyunback added 3 commits June 25, 2024 19:15

Update for code-review comment.

bebb819

Signed-off-by: hyunback <[email protected]>

Update.

a7ea15c

Signed-off-by: hyunback <[email protected]>

Remove dummy code.

4c3b532

Signed-off-by: hyunback <[email protected]>

hyunback removed the WIP work in progress label Jun 26, 2024

e-ddykim approved these changes Jun 26, 2024

View reviewed changes

...s/intel_gpu/src/kernel_selector/kernels/convolution/convolution_kernel_b_fs_yx_fsv16_1x1.cpp Outdated Show resolved Hide resolved

...s/intel_gpu/src/kernel_selector/kernels/convolution/convolution_kernel_b_fs_yx_fsv16_1x1.cpp Outdated Show resolved Hide resolved

Update

02d92f0

Signed-off-by: hyunback <[email protected]>

sshlyapn reviewed Jun 26, 2024

View reviewed changes

Remove dummy internal buffer.

57e4133

Signed-off-by: hyunback <[email protected]>

yeonbok approved these changes Jun 28, 2024

View reviewed changes

yeonbok added this pull request to the merge queue Jun 28, 2024

Merged via the queue into openvinotoolkit:master with commit a0d195d Jun 28, 2024

hyunback deleted the sa_conv_fsv16_poc branch April 8, 2025 08:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[GPU] Support fsv16 Shape agnostic convolution. #25020

[GPU] Support fsv16 Shape agnostic convolution. #25020

Uh oh!

hyunback commented Jun 14, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

e-ddykim left a comment

Uh oh!

Uh oh!

Uh oh!

sshlyapn Jun 26, 2024

Uh oh!

hyunback Jun 27, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[GPU] Support fsv16 Shape agnostic convolution. #25020

[GPU] Support fsv16 Shape agnostic convolution. #25020

Uh oh!

Conversation

hyunback commented Jun 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Details:

Tickets:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

e-ddykim left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sshlyapn Jun 26, 2024

Choose a reason for hiding this comment

Uh oh!

hyunback Jun 27, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

hyunback commented Jun 14, 2024 •

edited

Loading