> Did you see John`s followup that suggested a branch-free fast comparison?

I tried it in Helgrind, and resulted in a slowdown compared to the
obvious "if x < y then -1 else if x > y then 1 else 0", from 2m19
to 2m29 on one test.  gcc generates poor code for it, which I suspect
is getting many partial-register-write stalls.  This is something we
can come back to when the (inevitable) next round of performance tuning
happens.

J

-------------------------------------------------------------------------
SF.Net email is sponsored by:
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://sourceforge.net/services/buy/index.php
_______________________________________________
Valgrind-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/valgrind-developers

Reply via email to