JIT: Allow more containment opts in Tier0 #117622

saucecontrol · 2025-07-14T19:40:55Z

This enables embedded broadcast of non-const values in Tier0

Diffs are a net improvement, although there are a few regressions where an extra temp ends up being introduced due to arg swapping.

There are also a few 1- or 2-byte regressions where we swapped from containing a full vector load arg to containing a broadcast arg, which then forces EVEX encoding. It would be interesting to look at optimizing around that (separately -- it would impact FullOpts as well)

saucecontrol · 2025-07-15T18:19:04Z

cc @tannergooding

tannergooding · 2025-07-15T18:22:01Z

There are also a few 1- or 2-byte regressions where we swapped from containing a full vector load arg to containing a broadcast arg

We view this as an explicit improvement and the real "issue" is more that SPMI doesn't surface any size savings in the data section size. -- That is, while the codegen is 1-2 bytes bigger, we save 8-60 bytes of data section size and improve cache locality.

src/coreclr/jit/lowerxarch.cpp

saucecontrol · 2025-07-15T18:30:36Z

We view this as an explicit improvement and the real "issue" is more that SPMI doesn't surface any size savings in the data section size. -- That is, while the codegen is 1-2 bytes bigger, we save 8-60 bytes of data section size and improve cache locality.

The cases I'm referring to are like this:

where it's a broadcast either way, and we can contain either the broadcast or the full vector. It's always 2 instructions because they can't both be contained. Switching from containing the full vector to containing the broadcast means you have to switch to EVEX, so it's a net increase in size.

This particular regression only applies to instructions where we swap operands in order to be able to contain one, so I think we could simply give lower preference to CnsVec operands that might be turned into broadcast. Or something like that?

src/coreclr/jit/lowerxarch.cpp

tannergooding · 2025-07-15T18:47:11Z

This particular regression only applies to instructions where we swap operands in order to be able to contain one, so I think we could simply give lower preference to CnsVec operands that might be turned into broadcast. Or something like that?

Ah, I see.

Yeah, in general we want to prefer loads from arbitrary memory, then broadcastable constants, then regular constants.

saucecontrol · 2025-07-15T22:40:16Z

Disabled the aligned load containment. Diffs are smaller but still a net improvement.

saucecontrol · 2025-07-16T07:17:05Z

I've split the TryFoldCnsVecForEmbeddedBroadcast changes out into to #117700

tannergooding

LGTM. CC. @dotnet/jit-contrib for secondary review

tannergooding · 2025-07-22T03:55:21Z

/ba-g unrelated arm64 timeouts

github-actions bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Jul 14, 2025

dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label Jul 14, 2025

allow more containment opts in tier0

b042369

saucecontrol force-pushed the more-t0-opts branch from 4b78536 to b042369 Compare July 14, 2025 22:56

build-analysis bot mentioned this pull request Jul 15, 2025

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

3 tasks

saucecontrol marked this pull request as ready for review July 15, 2025 18:18

saucecontrol commented Jul 15, 2025

View reviewed changes

src/coreclr/jit/lowerxarch.cpp Outdated Show resolved Hide resolved

tannergooding reviewed Jul 15, 2025

View reviewed changes

src/coreclr/jit/lowerxarch.cpp Outdated Show resolved Hide resolved

tannergooding reviewed Jul 15, 2025

View reviewed changes

src/coreclr/jit/lowerxarch.cpp Show resolved Hide resolved

disable aligned load containment when opts disabled

f0bef79

This was referenced Jul 16, 2025

STATUS_UNSUCCESSFUL in RsaCryptRoundtrip_OaepSHA1 #29683

Open

Android tests timing out #117669

Closed

revert TryFoldCnsVecForEmbeddedBroadcast changes

e16f00e

This was referenced Jul 16, 2025

X509Certificates.Tests.ChainTests.BuildChainCustomTrustStore failure on linux musl #117723

Open

System.Security.Cryptography.X509Certificates.Tests fails on linux #117724

Closed

tannergooding requested a review from EgorBo July 16, 2025 16:10

tannergooding approved these changes Jul 16, 2025

View reviewed changes

EgorBo approved these changes Jul 21, 2025

View reviewed changes

Merge branch 'main' into more-t0-opts

00a3644

build-analysis bot mentioned this pull request Jul 21, 2025

System.Diagnostics.Tests.ProcessTests.TestCheckChildProcessUserAndGroupIds fails on Alpine jobs with "Operation not permitted" #117811

Open

tannergooding merged commit 0b2f272 into dotnet:main Jul 22, 2025
102 of 110 checks passed

saucecontrol deleted the more-t0-opts branch July 22, 2025 04:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

JIT: Allow more containment opts in Tier0 #117622

JIT: Allow more containment opts in Tier0 #117622

saucecontrol commented Jul 14, 2025 •

edited

Loading

Uh oh!

saucecontrol commented Jul 15, 2025

Uh oh!

tannergooding commented Jul 15, 2025

Uh oh!

Uh oh!

Uh oh!

saucecontrol commented Jul 15, 2025

Uh oh!

Uh oh!

tannergooding commented Jul 15, 2025

Uh oh!

saucecontrol commented Jul 15, 2025

Uh oh!

saucecontrol commented Jul 16, 2025

Uh oh!

tannergooding left a comment

Uh oh!

tannergooding commented Jul 22, 2025

Uh oh!

Uh oh!

Uh oh!

JIT: Allow more containment opts in Tier0 #117622

JIT: Allow more containment opts in Tier0 #117622

Conversation

saucecontrol commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

saucecontrol commented Jul 15, 2025

Uh oh!

tannergooding commented Jul 15, 2025

Uh oh!

Uh oh!

Uh oh!

saucecontrol commented Jul 15, 2025

Uh oh!

Uh oh!

tannergooding commented Jul 15, 2025

Uh oh!

saucecontrol commented Jul 15, 2025

Uh oh!

saucecontrol commented Jul 16, 2025

Uh oh!

tannergooding left a comment

Choose a reason for hiding this comment

Uh oh!

tannergooding commented Jul 22, 2025

Uh oh!

Uh oh!

Uh oh!

saucecontrol commented Jul 14, 2025 •

edited

Loading