[SIMD] auto-vectorization using instruction usdot

test: https://gcc.godbolt.org/z/f86hxd8cT
```
#define N 480

unsigned int
f (unsigned int res, signed char *restrict a,
   unsigned char *restrict b)
{
  for (__INTPTR_TYPE__ i = 0; i < N; ++i)
    {
      int av = a[i];
      int bv = b[i];
      signed short mult = av * bv;
      res += mult;
    }
  return res;
}
```

According [gcc-12](https://community.arm.com/arm-community-blogs/b/tools-software-ides-blog/posts/gcc-12), **Armv8.6-A** introduced a new dot-product instruction for when the sign of the operands differ called [usdot](https://developer.arm.com/documentation/ddi0596/2021-12/SIMD-FP-Instructions/USDOT--by-element---Dot-Product-with-unsigned-and-signed-integers--vector--by-element--). This instruction is introduced behind the **+i8mm** compiler flag.

Starting with GCC 12 the auto-vectorizer can now automatically recognize and use this instruction, while llvm can't.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SIMD] auto-vectorization using instruction usdot #63971

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[SIMD] auto-vectorization using instruction usdot #63971

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions