Carry ExtractMostSignificantBits through to LIR and add constant folding support #117673

tannergooding · 2025-07-15T17:37:51Z

This doesn't update the IR to take advantage of any special patterns yet.

It does, however, simplify the codegen for V128<byte>.ExtractMostSignificantBits.

The logic was previously:

    op1 = op1 & Vector128.Create<ulong>(0x8080808080808080).AsByte();
    op1 = AdvSimd.ShiftLogical(op1, Vector128.Create<ulong>(0x00FFFEFDFCFBFAF9).AsSByte());

    return (Vector128.Sum(op1.GetUpper()) << 8) | Vector128.Sum(op1.GetLower());

The updated logic is:

    op1 = op1 & Vector128.Create<ulong>(0x8080808080808080).AsByte();
    op1 = AdvSimd.ShiftLogical(op1, Vector128.Create<ulong>(0x00FFFEFDFCFBFAF9).AsSByte());

    var tmp = AdvSimd.ZeroExtendWideningUpper(op1);
    tmp = AdvSimd.ShiftLeftLogical(tmp, 8);
    tmp = AdvSimd.AddWideningLower(tmp, op1.GetLower());

    return Vector128.Sum(tmp);

The original logic would generate:

            movi    v16.16b, #0x80
            and     v16.16b, v0.16b, v16.16b
            ldr     q17, [@RWD00]
            ushl    v16.16b, v16.16b, v17.16b
            mov     v17.16b, v16.16b
            addv    b17, v17.8b
            umov    w0, v17.b[0]
            ext     v16.16b, v16.16b, v16.16b, #8
            addv    b16, v16.8b
            umov    w1, v16.b[0]
            orr     w0, w0, w1,  LSL #8

While the newer logic is a bit smaller and avoids a second more expensive addv instruction:

            movi    v16.16b, #0x80
            and     v16.16b, v0.16b, v16.16b
            ldr     q17, [@RWD00]
            ushl    v16.16b, v16.16b, v17.16b
            uxtl2   v17.8h, v16.16b
            shl     v17.8h, v17.8h, #8
            uaddw   v16.8h, v17.8h, v16.8b
            addv    h16, v16.8h
            umov    w0, v16.h[0]

…ing support

dotnet-policy-service · 2025-07-15T17:38:55Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

tannergooding · 2025-07-15T18:39:53Z

Diffs look positive and it's a nice throughput improvement as well.

The few regressions are from places we have multiple ExtractMostSignificantBits() calls and the relevant constants are no longer able to be CSE'd, which is just another variant of #70182.

We should get even bigger wins if we add some optimizations for particular x.ExtractMostSignificantBits() patterns (i.e. cases like x.ExtractMSB() == 0 and such).

tannergooding · 2025-07-15T18:40:04Z

/azp run Fuzzlyn

azure-pipelines · 2025-07-15T18:40:18Z

Azure Pipelines successfully started running 1 pipeline(s).

Copilot

Pull Request Overview

This PR carries the ExtractMostSignificantBits intrinsic through to the LIR (Low-level Intermediate Representation) and adds constant folding support. The main goal is to enable better codegen optimization for SIMD operations that extract the most significant bits from vector elements.

Removes early expansion of ExtractMostSignificantBits intrinsics during import phase
Adds constant folding capabilities for ExtractMostSignificantBits in both value numbering and expression folding
Implements LIR-level rewriting for ExtractMostSignificantBits with platform-specific optimizations

Reviewed Changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated no comments.

Show a summary per file

File	Description
src/coreclr/jit/valuenum.cpp	Adds constant folding support for ExtractMostSignificantBits in value numbering
src/coreclr/jit/simd.h	Implements EvaluateExtractMSB template functions for constant evaluation
src/coreclr/jit/rationalize.h	Declares RewriteHWIntrinsicExtractMsb method for LIR rewriting
src/coreclr/jit/rationalize.cpp	Implements platform-specific LIR rewriting for ExtractMostSignificantBits
src/coreclr/jit/hwintrinsicxarch.cpp	Removes early expansion logic for x86/x64 short/ushort cases
src/coreclr/jit/hwintrinsiclistxarch.h	Updates intrinsic flags to enable special import and disable early codegen
src/coreclr/jit/hwintrinsiclistarm64.h	Updates intrinsic flags to enable special import and disable early codegen
src/coreclr/jit/hwintrinsicarm64.cpp	Removes early expansion logic for ARM64 ExtractMostSignificantBits
src/coreclr/jit/gentree.cpp	Adds constant folding support for ExtractMostSignificantBits in expression folding

src/coreclr/jit/gentree.cpp

EgorBo

Nice!

…get valid data

tannergooding · 2025-07-16T14:37:45Z

/ba-g unrelated android timeout and image acquisition failure that passed on last run.

Carry ExtractMostSignificantBits through to LIR and add constant fold…

a1e8723

…ing support

github-actions bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Jul 15, 2025

dotnet-policy-service bot assigned tannergooding Jul 15, 2025

tannergooding marked this pull request as ready for review July 15, 2025 18:40

tannergooding requested review from Copilot and EgorBo July 15, 2025 18:40

Copilot AI reviewed Jul 15, 2025

View reviewed changes

EgorBo reviewed Jul 15, 2025

View reviewed changes

src/coreclr/jit/gentree.cpp Show resolved Hide resolved

EgorBo approved these changes Jul 15, 2025

View reviewed changes

Ensure 64-bit masks create 64-bit constants when folded

ac33bfe

tannergooding force-pushed the arm64-extractmsb branch from 45e74a4 to ac33bfe Compare July 15, 2025 20:20

tannergooding added 2 commits July 15, 2025 15:04

Handle the fact that V512.EMSB always returns TYP_LONG

c9ad82d

Expose a GetRawBits and GetBitMask helper on simdmask_t to ensure we …

16db5f5

…get valid data

This was referenced Jul 16, 2025

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

Android tests timing out #117669

Closed

tannergooding merged commit c0d4efe into dotnet:main Jul 16, 2025
105 of 111 checks passed

tannergooding deleted the arm64-extractmsb branch July 16, 2025 14:38

build-analysis bot mentioned this pull request Jul 16, 2025

X509Certificates.Tests.ChainTests.BuildChainCustomTrustStore failure on linux musl #117723

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Carry ExtractMostSignificantBits through to LIR and add constant folding support #117673

Carry ExtractMostSignificantBits through to LIR and add constant folding support #117673

Uh oh!

tannergooding commented Jul 15, 2025 •

edited

Loading

Uh oh!

dotnet-policy-service bot commented Jul 15, 2025

Uh oh!

tannergooding commented Jul 15, 2025

Uh oh!

tannergooding commented Jul 15, 2025

Uh oh!

azure-pipelines bot commented Jul 15, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

EgorBo left a comment

Uh oh!

tannergooding commented Jul 16, 2025

Uh oh!

Uh oh!

Uh oh!

Carry ExtractMostSignificantBits through to LIR and add constant folding support #117673

Carry ExtractMostSignificantBits through to LIR and add constant folding support #117673

Uh oh!

Conversation

tannergooding commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dotnet-policy-service bot commented Jul 15, 2025

Uh oh!

tannergooding commented Jul 15, 2025

Uh oh!

tannergooding commented Jul 15, 2025

Uh oh!

azure-pipelines bot commented Jul 15, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

EgorBo left a comment

Choose a reason for hiding this comment

Uh oh!

tannergooding commented Jul 16, 2025

Uh oh!

Uh oh!

Uh oh!

tannergooding commented Jul 15, 2025 •

edited

Loading