Pocl automatically sets fp-contract=off when generating code. This has
significant implications; in particular, nans are then handled incorrectly.
Pocl should never do this on its own -- this needs to be a user option,
enabled e.g. by pragmas or by choosing fast-math.

A nearby comment states

# With fp-contract we get calls to fma with processors which do not
# have fma instructions. These ruin the performance. Better to have
# the mul+add separated in the IR.

Which architecture and which benchmark is this?

Could we instead use fast-math for benchmarks only?

-erik

-- 
Erik Schnetter <[email protected]>
http://www.perimeterinstitute.ca/personal/eschnetter/
------------------------------------------------------------------------------
This SF.net email is sponsored by Windows:

Build for Windows Store.

http://p.sf.net/sfu/windows-dev2dev
_______________________________________________
pocl-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/pocl-devel

Reply via email to