Implement the WavePrefixSum
HLSL Function
#99172
Labels
backend:DirectX
backend:SPIR-V
bot:HLSL
HLSL
HLSL Language Support
metabug
Issue to collect references to a group of similar or related issues.
WavePrefixSum
clang builtin,WavePrefixSum
clang builtin withhlsl_intrinsics.h
WavePrefixSum
toCheckHLSLBuiltinFunctionCall
inSemaChecking.cpp
WavePrefixSum
toEmitHLSLBuiltinExpr
inCGBuiltin.cpp
clang/test/CodeGenHLSL/builtins/WavePrefixSum.hlsl
clang/test/SemaHLSL/BuiltIns/WavePrefixSum-errors.hlsl
int_dx_WavePrefixSum
intrinsic inIntrinsicsDirectX.td
DXILOpMapping
ofint_dx_WavePrefixSum
to121
inDXIL.td
WavePrefixSum.ll
andWavePrefixSum_errors.ll
tests inllvm/test/CodeGen/DirectX/
int_spv_WavePrefixSum
intrinsic inIntrinsicsSPIRV.td
WavePrefixSum
lowering and map it toint_spv_WavePrefixSum
inSPIRVInstructionSelector::selectIntrinsic
.llvm/test/CodeGen/SPIRV/hlsl-intrinsics/WavePrefixSum.ll
DirectX
SPIR-V
OpGroupNonUniformFAdd:
Description:
A floating point add group operation of all Value
operands contributed by active invocations in the
group.
Result Type must be a scalar or vector of floating-point
type.
Execution is a Scope that identifies the group of
invocations affected by this command. It must be Subgroup.
The identity I for Operation is 0. If Operation is
ClusteredReduce, ClusterSize must be present.
The type of Value must be the same as Result Type. The method used
to perform the group operation on the contributed Value(s) from active
invocations is implementation defined.
ClusterSize is the size of cluster to use. ClusterSize must be a
scalar of integer type, whose Signedness operand is 0.
ClusterSize must come from a constant
instruction. Behavior is undefined unless
ClusterSize is at least 1 and a power of 2. If ClusterSize is
greater than the size of the group, executing this instruction
results in undefined behavior.
Capability:
GroupNonUniformArithmetic, GroupNonUniformClustered,
GroupNonUniformPartitionedNV
Missing before version 1.3.
6 + variable
350
<id>
Result Type
Result <id>
Scope <id>
Execution
Group Operation
Operation
<id>
Value
Optional
<id>
ClusterSize
Test Case(s)
Example 1
Example 2
Example 3
HLSL:
Returns the sum of all of the values in the active lanes with smaller indices than this one.
Syntax
Parameters
value
The value to sum up.
Return value
The sum of the values.
Remarks
The order of operations on this routine cannot be guaranteed. So, effectively, the [precise] flag is ignored within it.
A postfix sum can be computed by adding the prefix sum to the current lane's value.
Note that the active lane with the lowest index will always receive a 0 for its prefix sum.
This function is supported from shader model 6.0 in all shader stages.
Examples
On a machine with a wave size of 8, and all lanes active except lanes 0 and 4, the following values would be returned from WavePrefixSum.
See also
Overview of Shader Model 6
Shader Model 6
The text was updated successfully, but these errors were encountered: