Le 15/08/2014 14:59, Andrej Prsa a écrit :
> Hi Brice,
>
>> Your kernel looks recent enough, can you try upgrading your BIOS ? You
>> have version 3.0b and there's a 3.5 version at
>> http://www.supermicro.com/aplus/motherboard/opteron6000/sr56x0/h8qg6-f.cfm
> Flashing bios is not the easiest option for us since I'd need to bring
> down the whole cluster that is in heavy use. Also, we're running only
> linux and the bios file on the Supermicro website seems to be for
> windows only. I might be able to get around that using biosdisk,
> though. Are there any adverse side-effects from this bios bug on
> hwloc/mpi operation? I can certainly try and schedule the bios flash
> if you think it's worth it.
>
> Thanks,
> Andrej

Most people don't care about cache when binding with MPI, so you may
just ignore the issue and hide the message by setting
HWLOC_HIDE_ERRORS=1 in the environment. It may work fine (assuming MPIs
don't have troubles with asymmetric topologies where there are some
missing L3).

Otherwise, hwloc can load the topology from XML. So we'll just need to
generate a fixed topology, export it to XML and set an environment
variable to have hwloc load from there. A single file may even be enough
for all similar nodes assuming your MPI and/or applications don't look
in deep details of hwloc topologies.

Brice

Reply via email to