Implement bitwise SSE ops & _mm_cmp*_ss #103

nominolo · 2017-10-07T22:16:27Z

Implement SSE bitwise AND, OR, AND-NOT, and XOR.
Implement all variants of _mm_cmp*_ss.

alexcrichton · 2017-10-08T02:05:41Z

src/x86/sse.rs

+/// Bitwise AND of packed single-precision (32-bit) floating-point elements.
+#[inline(always)]
+#[target_feature = "+sse"]
+// i586 only seems to generate plain `and` instructions, so ignore it.


This seems a little worrisome, I wonder if we don't have the right codegen for that then?

Er well apparently this is normal!

alexcrichton · 2017-10-08T02:29:07Z

Thanks! Mind rebasing now as well?

LLVM for i586 doesn't seem to generate `andps`, and instead generates 4 `and`s. Similar for the other operations.

alexcrichton · 2017-10-12T14:15:15Z

👍

* Add _mm_{and,andnot,or,xor}_ps * Add _mm_cmpeq_ss * Add _mm_cmplt_ss * Add _mm_cmple_ss * Add _mm_cmpgt_ss * Add _mm_cmpge_ss * Add _mm_cmpneq_ss * Add _mm_cmpnlt_ss * Add _mm_cmpnle_ss * Add _mm_cmpngt_ss * Add _mm_cmpnge_ss * Add _mm_cmpord_ss * Add _mm_cmpunord_ss * Fix _mm_{and,andnot,or,xor}_ps tests for i586 LLVM for i586 doesn't seem to generate `andps`, and instead generates 4 `and`s. Similar for the other operations.

* avx: _mm256_loadu_pd * avx: _mm256_storeu_pd * avx: _mm256_loadu_ps * avx: _mm256_storeu_ps * avx: fix _mm256_storeu_pd and _mm256_storeu_ps * avx: _mm256_loadu_si256 * avx: _mm256_undefined_si256 * avx: _mm256_maskload_pd * avx: _mm256_maskstore_pd * Attempt to fix CI (#108) Need to bring codegen units back to only one for now * [x86] sse4.2 add docs for _SIDD_EQUAL_RANGES (#107) - Add docs for the _SIDD_EQUAL_RANGES mode * Add _MM_TRANSPOSE4_PS pseudo-macro. (#106) This adds a strange macro, which I've replaced with a function, because it seems there are not many better alternatives. Also adds a test, and `#[allow(non_snake_case)]` to `#[simd_test]`. * Fix i586 tests * Implement bitwise SSE ops & _mm_cmp*_ss (#103) * Add _mm_{and,andnot,or,xor}_ps * Add _mm_cmpeq_ss * Add _mm_cmplt_ss * Add _mm_cmple_ss * Add _mm_cmpgt_ss * Add _mm_cmpge_ss * Add _mm_cmpneq_ss * Add _mm_cmpnlt_ss * Add _mm_cmpnle_ss * Add _mm_cmpngt_ss * Add _mm_cmpnge_ss * Add _mm_cmpord_ss * Add _mm_cmpunord_ss * Fix _mm_{and,andnot,or,xor}_ps tests for i586 LLVM for i586 doesn't seem to generate `andps`, and instead generates 4 `and`s. Similar for the other operations. * avx: _mm_maskload_pd * avx: _mm_maskstore_pd * avx: _mm256_maskload_ps * avx: _mm256_maskstore_ps * avx: _mm_maskload_ps, _mm_maskstore_ps * avx: _mm256_movehdup_ps * avx: _mm256_moveldup_ps

alexcrichton reviewed Oct 8, 2017

View reviewed changes

nominolo added 14 commits October 8, 2017 12:11

Add _mm_{and,andnot,or,xor}_ps

324d68e

Add _mm_cmpeq_ss

a5c8878

Add _mm_cmplt_ss

fa6c3a2

Add _mm_cmple_ss

83b9b0a

Add _mm_cmpgt_ss

5d14873

Add _mm_cmpge_ss

7e78a50

Add _mm_cmpneq_ss

b81a686

Add _mm_cmpnlt_ss

016252f

Add _mm_cmpnle_ss

476ce57

Add _mm_cmpngt_ss

8c20361

Add _mm_cmpnge_ss

bc78ea3

Add _mm_cmpord_ss

6147f16

Add _mm_cmpunord_ss

76dcd3c

Fix _mm_{and,andnot,or,xor}_ps tests for i586

02ae6b0

LLVM for i586 doesn't seem to generate `andps`, and instead generates 4 `and`s. Similar for the other operations.

nominolo force-pushed the sse_ops branch from 3fc19b0 to 02ae6b0 Compare October 8, 2017 10:12

alexcrichton merged commit 1cc08d7 into rust-lang:master Oct 12, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement bitwise SSE ops & _mm_cmp*_ss #103

Implement bitwise SSE ops & _mm_cmp*_ss #103

Uh oh!

nominolo commented Oct 7, 2017

Uh oh!

alexcrichton Oct 8, 2017

Uh oh!

alexcrichton Oct 8, 2017

Uh oh!

alexcrichton commented Oct 8, 2017

Uh oh!

alexcrichton commented Oct 12, 2017

Uh oh!

Uh oh!

Implement bitwise SSE ops & _mm_cmp*_ss #103

Implement bitwise SSE ops & _mm_cmp*_ss #103

Uh oh!

Conversation

nominolo commented Oct 7, 2017

Uh oh!

alexcrichton Oct 8, 2017

Choose a reason for hiding this comment

Uh oh!

alexcrichton Oct 8, 2017

Choose a reason for hiding this comment

Uh oh!

alexcrichton commented Oct 8, 2017

Uh oh!

alexcrichton commented Oct 12, 2017

Uh oh!

Uh oh!