Skip to content

[release/9.0-staging] Backport #113095: Better ARM intrinsics implementation for dn_simdhash #113234

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

kg
Copy link
Member

@kg kg commented Mar 6, 2025

See #113074.

Customer impact

Multiple customers have reported seeing significantly slower iOS/MacCatalyst AOT compilation times in .NET 9 compared to .NET 8. We found the slowdown was specific to ARM macs and was due to ARM specific inefficiencies with a hashtable we introduced early on in the .NET 9 cycle.

Both internal and external customers are affected by this regression. This PR delivers performance on par with what customers experienced in .NET 8.

Regression

Yes, 9.0 is significantly slower than 8.0 for this scenario

Testing

We tested on Mac arm64/x64 and Linux arm64 for completeness. Performance was acceptable on all three.

Risk

Low as the change is fairly narrow in scope.

Restrict neon implementation to arm64 because arm32 neon doesn't have wide vector registers
Add missing vreinterpret
arm64 build fixes
Ampere ARM64 optimizations
Cleanup simdhash arch header and remove redundant assert
Better define name
@ghost ghost added the needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners label Mar 6, 2025
@kg kg changed the title Backport #113095: Better ARM intrinsics implementation for dn_simdhash [release/9.0-staging] Backport #113095: Better ARM intrinsics implementation for dn_simdhash Mar 6, 2025
@kg kg added the Servicing-consider Issue for next servicing release review label Mar 6, 2025
@kg
Copy link
Member Author

kg commented Mar 6, 2025

cc @kotlarmilos @steveisok

@steveisok steveisok marked this pull request as ready for review March 6, 2025 22:01
@Copilot Copilot AI review requested due to automatic review settings March 6, 2025 22:01
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

Copy link
Member

@jeffschwMSFT jeffschwMSFT left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm. please get a code review. we will take for consideration in 9.0.x

@kotlarmilos
Copy link
Member

I suggest to wait until #113241 is merged and then backport the final changes. Also, let's standup the build measurements in the perf lab to validate the improvements.

@jeffschwMSFT jeffschwMSFT removed the Servicing-consider Issue for next servicing release review label Mar 7, 2025
@steveisok
Copy link
Member

Closing in favor of #113316

@steveisok steveisok closed this Mar 10, 2025
@github-actions github-actions bot locked and limited conversation to collaborators Apr 10, 2025
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
needs-area-label An area label is needed to ensure this gets routed to the appropriate area owners
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants