Handle scalable store size in MemCpyOptimizer #118957

momchil-velikov · 2024-12-06T11:52:30Z

The compiler crashes with an ICE when it tries to create a memset with scalable size.

The compiler crashes with an ICE when it tries to create a `memset` with scalable size.

llvmbot · 2024-12-06T11:53:03Z

@llvm/pr-subscribers-llvm-transforms

@llvm/pr-subscribers-backend-aarch64

Author: Momchil Velikov (momchil-velikov)

Changes

The compiler crashes with an ICE when it tries to create a memset with scalable size.

Full diff: https://github.com/llvm/llvm-project/pull/118957.diff

2 Files Affected:

(modified) llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp (+2-1)
(added) llvm/test/CodeGen/AArch64/memset-scalable-size.ll (+56)

diff --git a/llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp b/llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp
index 0cba5d077da62b..fc5f6ff2b7f377 100644
--- a/llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp
+++ b/llvm/lib/Transforms/Scalar/MemCpyOptimizer.cpp
@@ -800,8 +800,9 @@ bool MemCpyOptPass::processStore(StoreInst *SI, BasicBlock::iterator &BBI) {
     // in subsequent passes.
     auto *T = V->getType();
     if (T->isAggregateType()) {
-      uint64_t Size = DL.getTypeStoreSize(T);
       IRBuilder<> Builder(SI);
+      Value *Size =
+          Builder.CreateTypeSize(Builder.getInt64Ty(), DL.getTypeStoreSize(T));
       auto *M = Builder.CreateMemSet(SI->getPointerOperand(), ByteVal, Size,
                                      SI->getAlign());
       M->copyMetadata(*SI, LLVMContext::MD_DIAssignID);
diff --git a/llvm/test/CodeGen/AArch64/memset-scalable-size.ll b/llvm/test/CodeGen/AArch64/memset-scalable-size.ll
new file mode 100644
index 00000000000000..8ea6330f235a69
--- /dev/null
+++ b/llvm/test/CodeGen/AArch64/memset-scalable-size.ll
@@ -0,0 +1,56 @@
+; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --version 5
+; RUN: opt -S --passes=memcpyopt < %s | FileCheck %s
+target triple = "aarch64-unknown-linux"
+
+define void @f0() {
+; CHECK-LABEL: define void @f0() {
+; CHECK-NEXT:  [[ENTRY:.*:]]
+; CHECK-NEXT:    [[P:%.*]] = alloca { <vscale x 16 x i1>, <vscale x 16 x i1> }, align 2
+; CHECK-NEXT:    [[TMP0:%.*]] = call i64 @llvm.vscale.i64()
+; CHECK-NEXT:    [[TMP1:%.*]] = mul i64 [[TMP0]], 4
+; CHECK-NEXT:    call void @llvm.memset.p0.i64(ptr align 2 [[P]], i8 0, i64 [[TMP1]], i1 false)
+; CHECK-NEXT:    call void @g(ptr [[P]])
+; CHECK-NEXT:    ret void
+;
+entry:
+  %p = alloca { <vscale x 16 x i1>, <vscale x 16 x i1>}, align 2
+  store { <vscale x 16 x i1>, <vscale x 16 x i1> } zeroinitializer, ptr %p, align 2
+  call void @g(ptr %p)
+  ret void
+}
+
+define void @f1() {
+; CHECK-LABEL: define void @f1() {
+; CHECK-NEXT:  [[ENTRY:.*:]]
+; CHECK-NEXT:    [[P:%.*]] = alloca { <vscale x 4 x float>, <vscale x 4 x float>, <vscale x 4 x float> }, align 16
+; CHECK-NEXT:    [[TMP0:%.*]] = call i64 @llvm.vscale.i64()
+; CHECK-NEXT:    [[TMP1:%.*]] = mul i64 [[TMP0]], 48
+; CHECK-NEXT:    call void @llvm.memset.p0.i64(ptr align 16 [[P]], i8 0, i64 [[TMP1]], i1 false)
+; CHECK-NEXT:    call void @g(ptr [[P]])
+; CHECK-NEXT:    ret void
+;
+entry:
+  %p = alloca {<vscale x 4 x float>, <vscale x 4 x float>, <vscale x 4 x float> }, align 16
+  store {<vscale x 4 x float>, <vscale x 4 x float>, <vscale x 4 x float> } zeroinitializer, ptr %p, align 16
+  call void @g(ptr %p)
+  ret void
+}
+
+define void @f2() {
+; CHECK-LABEL: define void @f2() {
+; CHECK-NEXT:  [[ENTRY:.*:]]
+; CHECK-NEXT:    [[P:%.*]] = alloca { <vscale x 8 x double>, <vscale x 8 x double>, <vscale x 8 x double> }, align 16
+; CHECK-NEXT:    [[TMP0:%.*]] = call i64 @llvm.vscale.i64()
+; CHECK-NEXT:    [[TMP1:%.*]] = mul i64 [[TMP0]], 192
+; CHECK-NEXT:    call void @llvm.memset.p0.i64(ptr align 16 [[P]], i8 0, i64 [[TMP1]], i1 false)
+; CHECK-NEXT:    call void @g(ptr [[P]])
+; CHECK-NEXT:    ret void
+;
+entry:
+  %p = alloca {<vscale x 8 x double>, <vscale x 8 x double>, <vscale x 8 x double> }, align 16
+  store {<vscale x 8 x double>, <vscale x 8 x double>, <vscale x 8 x double> } zeroinitializer, ptr %p, align 16
+  call void @g(ptr %p)
+  ret void
+}
+
+declare void @g(ptr)

jthackray

Thanks for the fix. LGTM.

nikic

Is it actually desirable to convert scalable load/store into a memset?

llvm/test/CodeGen/AArch64/memset-scalable-size.ll

nikic

.

momchil-velikov · 2024-12-06T12:08:39Z

Is it actually desirable to convert scalable load/store into a memset?

Ah, good point!

nikic

LGTM

Handle scalable store size in MemCpyOptimizer

7e2d603

The compiler crashes with an ICE when it tries to create a `memset` with scalable size.

momchil-velikov requested review from CarolineConcatto, Lukacma, SpencerAbson, artagnon, jthackray, nikic, rgwott and tmatheson-arm December 6, 2024 11:52

llvmbot added backend:AArch64 llvm:transforms labels Dec 6, 2024

jthackray approved these changes Dec 6, 2024

View reviewed changes

nikic reviewed Dec 6, 2024

View reviewed changes

llvm/test/CodeGen/AArch64/memset-scalable-size.ll Outdated Show resolved Hide resolved

nikic requested changes Dec 6, 2024

View reviewed changes

[fixup] Don't create a call to memset and move test to vscale-memset.ll

eca59d2

SpencerAbson mentioned this pull request Dec 6, 2024

[AArch64] Refactor implementation of FP8 types (NFC) #118969

Closed

nikic approved these changes Dec 6, 2024

View reviewed changes

momchil-velikov merged commit 5d9c321 into llvm:main Dec 6, 2024
8 checks passed

momchil-velikov deleted the fix-memset branch January 29, 2025 10:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Handle scalable store size in MemCpyOptimizer #118957

Handle scalable store size in MemCpyOptimizer #118957

Uh oh!

momchil-velikov commented Dec 6, 2024

Uh oh!

llvmbot commented Dec 6, 2024 •

edited

Loading

Uh oh!

jthackray left a comment

Uh oh!

nikic left a comment

Uh oh!

Uh oh!

nikic left a comment

Uh oh!

momchil-velikov commented Dec 6, 2024

Uh oh!

nikic left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Handle scalable store size in MemCpyOptimizer #118957

Handle scalable store size in MemCpyOptimizer #118957

Uh oh!

Conversation

momchil-velikov commented Dec 6, 2024

Uh oh!

llvmbot commented Dec 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jthackray left a comment

Choose a reason for hiding this comment

Uh oh!

nikic left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nikic left a comment

Choose a reason for hiding this comment

Uh oh!

momchil-velikov commented Dec 6, 2024

Uh oh!

nikic left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

llvmbot commented Dec 6, 2024 •

edited

Loading