No new Itanium numbers yet ... but I did wonder how this works on multi-socket x86 ... so I tweaked "t.c. to increase threads to 64 to max out my 4-socket Xeon E5-4650 (8 cores/socket 2 threads/core) and also print out the individual scores from each thread.
$ ./t /tmp 64 389827 717666 1540293 130764 681839 33357 606966 33716 33183 33230 69685 76422 352851 34940 257132 34192 34200 34098 34053 34459 234399 33678 241571 545912 620857 65818 32853 739440 33697 683655 741366 36208 385775 446198 45974 33056 403944 717415 254782 166754 702745 43661 1042180 437367 43751 503342 154223 706917 878167 43802 51667 660875 33261 522425 33627 33637 33446 33604 52963 33688 406088 551690 446474 33289 Threads = 64 Total loops: 19109114 Individual thread performance varies from 32853 to 1540293. A factor of 46.9 Sometimes it is good to sacrifice fairness for throughput. But wow! Running for longer [ s/sleep(10)/sleep(300) ] gave things a chance to even out - but I still see a factor of 3.5 between the fastest and the slowest. -Tony -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/