[AMDGPU] Extend permlane16, permlanex16 and permlane64 intrinsic lowering for generic types #92725

vikramRH · 2024-05-20T08:47:57Z

Kindly review only top commits here (i.e commits except the first). These are incremental changes over #89217 , with core logic being the same. Only reason to split these up into separate PR is for ease of review.
This patch along with #89217 and #91190 should get us ready to enable 64 bit optimizations in atomic optimizer.

…ing for generic types

github-actions · 2024-05-20T08:51:24Z

✅ With the latest revision this PR passed the C/C++ code formatter.

…ring for generic types

arsenm

On this and the previous, can you add a section to AMDGPUUsage for the intrinsics and what types they support

clang/lib/CodeGen/CGBuiltin.cpp

arsenm · 2024-05-20T16:19:32Z

llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp

+      Register Src1Cast =
+          MRI.getType(Src1).isScalar()
+              ? Src1
+              : B.buildBitcast(LLT::scalar(Size), Src2).getReg(0);


Like the other patch, shouldn't need any bitcasts

Yes, I will take over the changes from #89217 once finalized,

vikramRH · 2024-05-29T12:05:52Z

Added/updated tests for permlanex16, permlane64
This needs [AMDGPU] Extend readlane, writelane and readfirstlane intrinsic lowering for generic types #89217 to land first so that only incremental changes can be reviewed.

clang/lib/CodeGen/CGBuiltin.cpp

vikramRH · 2024-06-17T12:20:27Z

Updated this PR to be in sync with #89217, However still plan is to land this only after changes in #89217 are accepted.

…ring for generic types (llvm#92725) These are incremental changes over llvm#89217 , with core logic being the same. This patch along with llvm#89217 and llvm#91190 should get us ready to enable 64 bit optimizations in atomic optimizer.

…ring for generic types (llvm#92725) These are incremental changes over llvm#89217 , with core logic being the same. This patch along with llvm#89217 and llvm#91190 should get us ready to enable 64 bit optimizations in atomic optimizer. Change-Id: Ief70422a47461606c29134b217f40204ee4a198b

[AMDGPU] Extend readlane, writelane and readfirstlane intrinsic lower…

b002711

…ing for generic types

vikramRH requested review from jayfoad, arsenm, b-sumner, pravinjagtap and cdevadas May 20, 2024 08:47

vikramRH force-pushed the permlane_generic branch from db19330 to 881e116 Compare May 20, 2024 10:02

vikramRH added 2 commits May 20, 2024 06:03

[AMDGPU] Extend permlane16, permlanex16 and permlane64 intrinsic lowe…

881e116

…ring for generic types

fix builtin handling

827d209

arsenm reviewed May 20, 2024

View reviewed changes

vikramRH added 2 commits May 27, 2024 02:57

Review comments

6047848

updated test cases, added new pointer/vector tests

8a36f07

arsenm reviewed May 29, 2024

View reviewed changes

clang/lib/CodeGen/CGBuiltin.cpp Outdated Show resolved Hide resolved

vikramRH added 5 commits May 31, 2024 08:13

Take over recent changes from original patch

5a4c4c4

add hepler to emit N-ary builtins

12155f5

Merge branch 'main' into permlane_generic

6714741

update with latest changes from llvm#89217

40381ca

clang format

4e4cdd9

vikramRH added 2 commits June 17, 2024 08:35

Merge branch 'main' into permlane_generic

fe9acb8

Merge branch 'main' into permlane_generic

0a0f93e

vikramRH changed the title ~~[AMDGPU][WIP] Extend permlane16, permlanex16 and permlane64 intrinsic lowering for generic types~~ [AMDGPU] Extend permlane16, permlanex16 and permlane64 intrinsic lowering for generic types Jun 23, 2024

vikramRH marked this pull request as ready for review June 23, 2024 17:05

vikramRH mentioned this pull request Jun 24, 2024

[AMDGPU] Enable atomic optimizer for 64 bit divergent values #96473

Closed

Merge branch 'main' into permlane_generic

ed6f695

arsenm approved these changes Jun 25, 2024

View reviewed changes

vikramRH merged commit 35f7b60 into llvm:main Jun 26, 2024
8 checks passed

pxl-th mentioned this pull request Apr 9, 2025

Consider ROCm releases with upstream llvm compatibility ROCm/llvm-project#263

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[AMDGPU] Extend permlane16, permlanex16 and permlane64 intrinsic lowering for generic types #92725

[AMDGPU] Extend permlane16, permlanex16 and permlane64 intrinsic lowering for generic types #92725

Uh oh!

vikramRH commented May 20, 2024 •

edited

Loading

Uh oh!

github-actions bot commented May 20, 2024 •

edited

Loading

Uh oh!

arsenm left a comment

Uh oh!

Uh oh!

Uh oh!

arsenm May 20, 2024

Uh oh!

vikramRH May 27, 2024

Uh oh!

vikramRH commented May 29, 2024

Uh oh!

Uh oh!

vikramRH commented Jun 17, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

[AMDGPU] Extend permlane16, permlanex16 and permlane64 intrinsic lowering for generic types #92725

[AMDGPU] Extend permlane16, permlanex16 and permlane64 intrinsic lowering for generic types #92725

Uh oh!

Conversation

vikramRH commented May 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented May 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arsenm left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

arsenm May 20, 2024

Choose a reason for hiding this comment

Uh oh!

vikramRH May 27, 2024

Choose a reason for hiding this comment

Uh oh!

vikramRH commented May 29, 2024

Uh oh!

Uh oh!

vikramRH commented Jun 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vikramRH commented May 20, 2024 •

edited

Loading

github-actions bot commented May 20, 2024 •

edited

Loading

vikramRH commented Jun 17, 2024 •

edited

Loading