Check Avx512BW.IsSupported in BitArray.CopyTo() #114818

tfenise · 2025-04-18T16:33:32Z

runtime/src/libraries/System.Collections/src/System/Collections/BitArray.cs

Line 855 in 7e297e6

Vector512<byte> shuffled = Avx512BW.Shuffle(scalar.AsByte(), shuffleMask);

runtime/src/libraries/System.Collections/src/System/Collections/BitArray.cs

Line 860 in 7e297e6

Vector512<byte> normalized = Avx512BW.Min(extracted, ones);

It's using Avx512BW, so it should check Avx512BW.IsSupported first. Checking Avx512F.IsSupported is not enough, as there are (old) CPUs supporting Avx512F but not Avx512BW according to https://en.wikipedia.org/w/index.php?title=AVX-512&oldid=1281313848#CPUs_with_AVX-512.

dotnet-policy-service · 2025-04-18T16:34:06Z

Tagging subscribers to this area: @dotnet/area-system-collections
See info in area-owners.md if you want to be subscribed.

tannergooding · 2025-04-18T16:45:48Z

src/libraries/System.Collections/src/System/Collections/BitArray.cs

@@ -837,7 +837,7 @@ public unsafe void CopyTo(Array array, int index)
                Vector128<byte> lowerShuffleMask_CopyToBoolArray = Vector128.Create(0, 0x01010101_01010101).AsByte();
                Vector128<byte> upperShuffleMask_CopyToBoolArray = Vector128.Create(0x02020202_02020202, 0x03030303_03030303).AsByte();

-                if (Avx512F.IsSupported && (uint)m_length >= Vector512<byte>.Count)
+                if (Avx512BW.IsSupported && (uint)m_length >= Vector512<byte>.Count)


Since you're touching this anyways, would you want to update this to use the xplat APIs instead?

That is, swap these out:

Avx512BW.IsSupported -> Vector512.IsHardwareAccelerated

Avx512BW.Shuffle(x, y) -> Vector512.Shuffle(x, y)

Avx512F.And(x, y) -> x & y

Avx512BW.Min(x, y) -> Vector512.Min(x, y)

Avx512F.Store(x, y) -> y.Store(x)

Similar changes could also be made to the Avx2 path (using Vector256) and so on.

This change to Avx512BW is notably "more correct" than Avx512F, but notably it's not technically broken today since we require AVX512F+BW+CD+DQ+VL to be provided for any AVX512 support to exist. It's still goodness to fix, but moving away from the platform specific APIs altogether would be even better.

About Vector512.Shuffle(x, y)

runtime/src/libraries/System.Collections/src/System/Collections/BitArray.cs

Lines 928 to 930 in 1a51788

// Same logic as SSSE3 path, except we do not have Shuffle instruction.

// (TableVectorLookup could be an alternative - dotnet/runtime#1277)

// Instead we use chained ZIP1/2 instructions:

It seems that Arm64 might not have the proper instruction to support VectorXXX.Shuffle<byte>(x, y). Could it happen that VectorXXX.IsHardwareAccelerated is true but VectorXXX.Shuffle<byte>(x, y) has poor performance on some platforms?

It seems that Arm64 might not have the proper instruction to support VectorXXX.Shuffle(x, y). C

That comment is very outdated, VectorXXX.Shuffle covers the usage and does the correct thing.

Could it happen that VectorXXX.IsHardwareAccelerated is true but VectorXXX.Shuffle(x, y) has poor performance on some platforms?

Not for the way we're using it here. Theoretically it could happen if you were on a machine from 2005 and didn't have SSSE3 but hit the Vector128.IsHardwareAccelerated code path. This isn't really a concern for anyone using the current versions of .NET and could be adjusted later if it was a concern.

Today, Vector256.IsHardwareAccelerated strictly implies AVX2 support, and Vector512.IsHardwareAccelerated strictly implies AVX-512F+CD+DQ+BW+VL. Arm64 support for larger vectors comes in the form of Scalable Vector Extensions (SVE), which does not allow acceleration of the fixed-width vector types.

The Vector128 code in BitArray is outdated and could also use the xplat intrinsics.

Actually, after the runtime has proper AVX512 masking support #87097, the AVX512 path should probably use kmovq k, m64 and use that mask. This gives a reason to promote #87097. Will xplat Vector512 offer the same capacity?

Shuffle attempts to normalize behavior across platforms, including zeroing out of range indices and allowing full vector (cross-lane) shuffle/permute. If you don't need that behavior, you can use the new Vector512.ShuffleNative, which will emit the expected vpshufb.

vpermb is actually AVX512_VBMI, not just AVX-512F+CD+DQ+BW+VL

This is not a problem. The baseline ISA set is required for any acceleration of Vector512, but JIT can opportunistically use any ISA supported by the hardware. One of the benefits of the xplat vector APIs is that JIT can optimize or polyfill using whatever instructions are available.

Since you're touching this anyways, would you want to update this to use the xplat APIs instead?

@tannergooding , should this PR be merged or closed in the interim?

Lets take it, as it is technically correct and will ensure valid behavior in the case other runtimes use the same libraries and have their own distinct support. (There's no bug on RyuJIT today due to how we've defined things to work for ourselves)

Logged #116079 to track the move to xplat APIs

tannergooding · 2025-05-28T22:33:19Z

/ba-g unrelated build failures, simply updated an Isa.IsSupported check to be more correct

Update BitArray.cs

d99d12a

ghost added the area-System.Collections label Apr 18, 2025

dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label Apr 18, 2025

tannergooding reviewed Apr 18, 2025

View reviewed changes

Merge branch 'dotnet:main' into patch-1

3269ed8

build-analysis bot mentioned this pull request Apr 25, 2025

System.Globalization.Tests.IdnMappingIdnaConformanceTests failing on Windows #115006

Closed

tfenise mentioned this pull request Apr 25, 2025

Improve System.Collections.BitArray #115069

Closed

tannergooding approved these changes May 28, 2025

View reviewed changes

tannergooding merged commit cb502bd into dotnet:main May 28, 2025
81 of 85 checks passed

tannergooding mentioned this pull request May 28, 2025

Update BitArray.CopyTo to use the xplat intrinsics #116079

Open

tfenise deleted the patch-1 branch May 29, 2025 15:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Check Avx512BW.IsSupported in BitArray.CopyTo() #114818

Check Avx512BW.IsSupported in BitArray.CopyTo() #114818

Uh oh!

tfenise commented Apr 18, 2025

Uh oh!

dotnet-policy-service bot commented Apr 18, 2025

Uh oh!

tannergooding Apr 18, 2025

Uh oh!

tfenise Apr 18, 2025

Uh oh!

tannergooding Apr 18, 2025

Uh oh!

saucecontrol Apr 18, 2025

Uh oh!

tfenise Apr 18, 2025

Uh oh!

saucecontrol Apr 25, 2025 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

stephentoub May 28, 2025 •

edited

Loading

Uh oh!

tannergooding May 28, 2025

Uh oh!

tannergooding May 28, 2025

Uh oh!

tannergooding commented May 28, 2025

Uh oh!

Uh oh!

Uh oh!

	// Same logic as SSSE3 path, except we do not have Shuffle instruction.
	// (TableVectorLookup could be an alternative - dotnet/runtime#1277)
	// Instead we use chained ZIP1/2 instructions:

Check Avx512BW.IsSupported in BitArray.CopyTo() #114818

Check Avx512BW.IsSupported in BitArray.CopyTo() #114818

Uh oh!

Conversation

tfenise commented Apr 18, 2025

Uh oh!

dotnet-policy-service bot commented Apr 18, 2025

Uh oh!

tannergooding Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

tfenise Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

tannergooding Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

saucecontrol Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

tfenise Apr 18, 2025

Choose a reason for hiding this comment

Uh oh!

saucecontrol Apr 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Uh oh!

stephentoub May 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tannergooding May 28, 2025

Choose a reason for hiding this comment

Uh oh!

tannergooding May 28, 2025

Choose a reason for hiding this comment

Uh oh!

tannergooding commented May 28, 2025

Uh oh!

Uh oh!

Uh oh!

saucecontrol Apr 25, 2025 •

edited

Loading

stephentoub May 28, 2025 •

edited

Loading