On 8/15/23 19:21, juzhe.zh...@rivai.ai wrote:
For float/double, the in-order fold-left reduction produced the same result as scalar codes.

But for _Float16 is not, I think the issue is not the reduction issue, is float 16 precision issue.
But if it's a float16 precision issue then I would have expected both the computations for the lhs and rhs values to have suffered similarly.

But if you're confident it's OK, then I won't object.
jeff

Reply via email to