-
Notifications
You must be signed in to change notification settings - Fork 13.4k
[AMDGPU] Extend permlane16, permlanex16 and permlane64 intrinsic lowering for generic types #92725
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
…ing for generic types
✅ With the latest revision this PR passed the C/C++ code formatter. |
db19330
to
881e116
Compare
…ring for generic types
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
On this and the previous, can you add a section to AMDGPUUsage for the intrinsics and what types they support
Register Src1Cast = | ||
MRI.getType(Src1).isScalar() | ||
? Src1 | ||
: B.buildBitcast(LLT::scalar(Size), Src2).getReg(0); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Like the other patch, shouldn't need any bitcasts
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, I will take over the changes from #89217 once finalized,
|
…ring for generic types (llvm#92725) These are incremental changes over llvm#89217 , with core logic being the same. This patch along with llvm#89217 and llvm#91190 should get us ready to enable 64 bit optimizations in atomic optimizer.
…ring for generic types (llvm#92725) These are incremental changes over llvm#89217 , with core logic being the same. This patch along with llvm#89217 and llvm#91190 should get us ready to enable 64 bit optimizations in atomic optimizer. Change-Id: Ief70422a47461606c29134b217f40204ee4a198b
…ring for generic types (llvm#92725) These are incremental changes over llvm#89217 , with core logic being the same. This patch along with llvm#89217 and llvm#91190 should get us ready to enable 64 bit optimizations in atomic optimizer. Change-Id: Ief70422a47461606c29134b217f40204ee4a198b
Kindly review only top commits here (i.e commits except the first). These are incremental changes over #89217 , with core logic being the same. Only reason to split these up into separate PR is for ease of review.
This patch along with #89217 and #91190 should get us ready to enable 64 bit optimizations in atomic optimizer.