================
@@ -1378,19 +1378,85 @@ The AMDGPU backend implements the following LLVM IR
intrinsics.
0: Target default
preference,
1: `Iterative strategy`, and
2: `DPP`.
- If target does not support
the DPP operations (e.g. gfx6/7),
+ If the target does not
support the DPP operations (e.g. gfx6/7),
reduction will be performed
using default iterative strategy.
- Intrinsic is currently only
implemented for i32.
+ Intrinsic is implemented
for i32 and i64 types.
+
+ llvm.amdgcn.wave.reduce.min Similar to
`llvm.amdgcn.wave.reduce.umin`, but performs a signed min
+ reduction on signed
integers.
+ Intrinsic is implemented
for i32 and i64 types.
+
+ llvm.amdgcn.wave.reduce.fmin Similar to
`llvm.amdgcn.wave.reduce.umin`, but performs a floating point min
+ reduction on floating point
values.
+ Intrinsic is implemented
for float and double types.
+ NAN values are not
canonnicalized.
----------------
arsenm wrote:
```suggestion
```
Misleading statement. Should specify it's like llvm.minnum
https://github.com/llvm/llvm-project/pull/175132
_______________________________________________
llvm-branch-commits mailing list
[email protected]
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-branch-commits