[OMPI users] Oversubscribing in 1.8.3 vs 1.6.5

Eric Chamberland Tue, 9 Dec 2014 12:14:45 -0500 (EST)

Hi,

we were used to do oversubscribing just to do code validation in nightlyautomated parallel runs of our code.

I just compiled openmpi 1.8.3 and launched the whole suit ofsequential/parallel tests and noticed a *major* slowdown inoversubscribed parallel tests with 1.8.3 compared to 1.6.5.

For example, on my computer (2 cpu), a validation test of 64 processeslaunched with 1.8.3 took 1500 seconds (~29 minutes) to execute, whilethe very same test compiled with 1.6.5 took only 7.4 seconds!

To have this result with 1.6.5 we had to set the variable"OMPI_MCA_mpi_yield_when_idle=1", but it seems to have no effects in1.8.3 when I launch more processes than number of core in my computer,even if it is still mentioned to work (seehttp://www.open-mpi.org/faq/?category=running#force-aggressive-degraded). However,when I launch with fewer processes than number of core, then it isfaster without "OMPI_MCA_mpi_yield_when_idle=1", which is the samebehavior in 1.6.5.


I tried to launch with a host file like this:

localhost slots=2

but it changed nothing...

What do I do wrong?

Is it possible to retrieve "performances" of 1.6.5 for oversubscription?

Is there a compilation option that I have to enable in 1.8.3?

Here are the config.log and "ompi_info --all" files for both versions ofmpi:


http://www.giref.ulaval.ca/~ericc/ompi_bug/config.165.log.gz
http://www.giref.ulaval.ca/~ericc/ompi_bug/config.183.log.gz
http://www.giref.ulaval.ca/~ericc/ompi_bug/ompi_info.all.165.txt.gz
http://www.giref.ulaval.ca/~ericc/ompi_bug/ompi_info.all.183.txt.gz

Thanks,

Eric

[OMPI users] Oversubscribing in 1.8.3 vs 1.6.5

Reply via email to