Hi! I'm running julia release 0.3 on an hpc system that features two 6-core 2.4GHz Intel Westmere processors (a total of 12 processor cores) per node. I noticed my program is significantly slower on that machine than on my mac, where I run this on 1 core (2.4GHz Intel Core i5). (runtime is 60 secs vs 90 secs)
I would expect the hpc cores to be slightly slower but was surprised it was that much. I was wondering if any of the options I set to build this are culprit for this? I do run the binary of release 0.3 on my mac, whereas I build from source with those options on the hpc: OPENBLAS_TARGET_ARCH=NEHALEM OPENBLAS_DYNAMIC_ARCH = 0 OPENBLAS_USE_THREAD=0 JULIA_CPU_TARGET=core2 I can build julia without the final option. I cannot build it without the first three options. I cannot run multiple nodes without the final option, since I get an error Target architecture mismatch. Please delete or regenerate sys.{so,dll,dylib}.