https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77776
--- Comment #3 from Matthias Kretz <kretz at kde dot org> --- Did you consider the error introduced by scaling with __amax? I made sure that the division is without error by zeroing the mantissa bits. Here's a motivating example that shows an error of 1 ulp otherwise: https://godbolt.org/z/_U2K7e About std::fma, how bad is the performance hit if there's no instruction for it?