[CPU] Reduce node supports fp16 precision #18227

xuchen-intel · 2023-06-25T04:39:15Z

Details:

Reduce node supports fp16 precision.

Tickets:

https://jira.devtools.intel.com/browse/CVS-108837

xuchen-intel · 2023-06-25T04:58:19Z

@dmitry-gorokhov Hi Dmitry, could you please take a look?

yuxu42 · 2023-06-25T05:15:59Z

@chenhu-wang @maxnick Could you please take a review? Thanks!

src/plugins/intel_cpu/src/nodes/reduce.cpp

chenhu-wang · 2023-06-29T07:14:49Z

src/plugins/intel_cpu/tests/functional/shared_tests_instances/skip_tests_config.cpp

+    if (!InferenceEngine::with_cpu_x86_fp16()) {
+        // Skip fp16 tests for paltforms that don't support fp16 precision
+        retVector.emplace_back(R"(.*INFERENCE_PRECISION_HINT=(F|f)16.*)");
+    }


You add with_cpu_x86_fp16() to help tell capability of avx512_f16 and use it.
https://github.com/openvinotoolkit/openvino/pull/18227/files#diff-2ec2dbc37b12dc344e3d56d476bebb0a795777886906b5598ec2c91873404c0dR76

There is no limitation to support f16 only on avx512_f16. If on avx2, can support it directly. If no avx2, should be supported with fallback on f32 internally.

The function name is ambiguous, maybe we can remove it as no needed.

Agreed and applied. Thanks Chenhu! And to make this happen, this config here is also revised to be avx2 for fp16. Please feel free to commet, if there should be any concerns.
cc @dmitry-gorokhov @usstq

For the change in common part, should we have performance test? As f16 is not always outperform f32, reduce maybe ok as the computing complexity is low, but not sure for other computing bound node and shapes as precision convert overhead introduced. Given not all nodes support f16, so f16 hint is actually mixed precision in execution.

According to our discussion, for now we only activate FP16 test cases for platforms with tAVX512_FP16 instructions. FP16 test cases for AVX2 will be activated after the follow-ups in #16500 (comment) be applied. Thanks Chenhu!

src/plugins/intel_cpu/src/graph.cpp

src/plugins/intel_cpu/tests/functional/shared_tests_instances/skip_tests_config.cpp

src/plugins/intel_cpu/tests/functional/single_layer_tests/instances/x64/reduce.cpp

src/plugins/intel_cpu/tests/functional/single_layer_tests/classes/reduce.cpp

xuchen-intel added the category: CPU OpenVINO CPU plugin label Jun 25, 2023

xuchen-intel requested review from a team as code owners June 25, 2023 04:39

yuxu42 added this to the 2023.1 milestone Jun 25, 2023

xuchen-intel force-pushed the feature/reduce_node_fp16 branch 2 times, most recently from 4c6b4c5 to 9427afb Compare June 26, 2023 02:14

xuchen-intel requested a review from a team as a code owner June 26, 2023 02:14

github-actions bot added the category: inference OpenVINO Runtime library - Inference label Jun 26, 2023

maxnick assigned chenhu-wang Jun 26, 2023

maxnick requested a review from chenhu-wang June 26, 2023 14:08

chenhu-wang requested changes Jun 29, 2023

View reviewed changes

xuchen-intel requested review from dmitry-gorokhov and removed request for a team July 11, 2023 01:24

xuchen-intel assigned dmitry-gorokhov and unassigned chenhu-wang Jul 11, 2023

dmitry-gorokhov reviewed Jul 13, 2023

View reviewed changes

xuchen-intel force-pushed the feature/reduce_node_fp16 branch from 58ae9cc to d16b8b4 Compare July 14, 2023 03:36

xuchen-intel requested a review from dmitry-gorokhov July 14, 2023 04:10

xuchen-intel force-pushed the feature/reduce_node_fp16 branch 2 times, most recently from 7eec8d5 to bd90799 Compare July 18, 2023 07:19

dmitry-gorokhov approved these changes Jul 18, 2023

View reviewed changes

dmitry-gorokhov enabled auto-merge (squash) July 18, 2023 09:11

dmitry-gorokhov merged commit 9334ad0 into openvinotoolkit:master Jul 18, 2023

xuchen-intel added 5 commits July 18, 2023 20:58

[CPU] Reduce node supports fp16 precision

0a3de8c

Apply review comments

495dc55

Apply 2rd review comments

2f0d003

Apply review comments round 3

a236af5

Remove fp16 test instances for ARM

bd90799

alvoron pushed a commit to alvoron/openvino that referenced this pull request Jul 24, 2023

[CPU] Reduce node supports fp16 precision (openvinotoolkit#18227)

4ef5240

dmitry-gorokhov mentioned this pull request Jul 26, 2023

[CPU] FP16 support - ARM changes #18394

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CPU] Reduce node supports fp16 precision #18227

[CPU] Reduce node supports fp16 precision #18227

Uh oh!

xuchen-intel commented Jun 25, 2023

Uh oh!

xuchen-intel commented Jun 25, 2023

Uh oh!

yuxu42 commented Jun 25, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chenhu-wang Jun 29, 2023 •

edited

Loading

Uh oh!

xuchen-intel Jul 5, 2023

Uh oh!

chenhu-wang Jul 8, 2023

Uh oh!

xuchen-intel Jul 10, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[CPU] Reduce node supports fp16 precision #18227

[CPU] Reduce node supports fp16 precision #18227

Uh oh!

Conversation

xuchen-intel commented Jun 25, 2023

Details:

Tickets:

Uh oh!

xuchen-intel commented Jun 25, 2023

Uh oh!

yuxu42 commented Jun 25, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chenhu-wang Jun 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

xuchen-intel Jul 5, 2023

Choose a reason for hiding this comment

Uh oh!

chenhu-wang Jul 8, 2023

Choose a reason for hiding this comment

Uh oh!

xuchen-intel Jul 10, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

chenhu-wang Jun 29, 2023 •

edited

Loading