Hi,

Am 26.05.2017 um 17:16 schrieb Roberto Nunnari:

> Hello.
> 
> I need to upgrade our cluster HPC and would like to hear from the experts the 
> caveats and thinks to keep in mind.
> 
> For instance, I heard that at a certain point openmpi (after 1.5.4)

Unless you are the only user on a node, it's usually advisable to turn off the 
automatic core binding in Open MPI: "--bind-to-none" or "--bind-to none" 
(please check the man page for your version). The latter one applies for 2.1.1.

The rationale is, that otherwise each Open MPI job doesn't know anything about 
any other one, and they may bind to the same cores (both counting from core 
zero on). Some cores would be oversubscribed, others are idling.

The Tight Integration of Open MPI into SGE was not automatically detected with 
2.0.x and 2.1.0, but 2.1.1 is fine again. Workarounds exist to get the broken 
ones functional again.


> introduced some changes that cause incompatibilities with my present version 
> of OGE...
> 
> At present:
> queuing system: Oracle Grid Engine 6.2u7
> OS: CentOS 6.4
> openmpi: 1.5.4
> Parallel Envs: smp(only on one node) and orte(can span multiple nodes)
> 
> After upgrade:
> queuing system: Son of Grid Engine 8.1.9
> OS: CentOS 7.3
> openmpi: 1.10.3

Why 1.10.3? Nowadays I would jump to 2.1.1. It's even straight forward to 
compile it and put it in a place of your choice, like /usr/local or /opt/local 
to avoid to mix up with any library from the distribution.


> Parallel Envs: smp(only on one node) and orte(can span multiple nodes)
> 
> I never worked with Son of Grid Engine before.. I'll need to build it all.. 
> then import the configuration from OGE.. and make sure integration with 
> openmpi is working right..

There is a tool to save and load the configuration. But as you are new to SGE, 
I think the best could be to save the text files, but implement all the 
settings by hand. Then you know exactly what was changed where (i.e.: not to 
simply load the configuration, but check the differences compared to the 
default, and then make the necessary adjustments). But the differences between 
OGE and SGE are not huge.

$SGE_ROOT/util/upgrade_modules/{load_sge_config.sh,save_sge_config.sh}

-- Reuti
_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Reply via email to