Dear devels,

I have been trying out 1.8.2rcs recently and found a show-stopping
problem on our cluster. Running any job with any number of processors
larger than 32 will always employ only 32 cores per node (our nodes
have 48 cores). We are seeing identical behavior with 1.8.2rc4,
1.8.2rc2, and 1.8.1. Running identical programs shows no such issues
with version 1.6.5, where all 48 cores per node are working. While our
system is running torque/maui, the problem is evident by running mpirun
directly.

I am attaching hwloc topology in case that helps -- I am aware of a
buggy bios code that trips hwloc, but I don't know if that might be an
issue or not. I am happy to help debugging if you can provide me with
guidance.

Thanks,
Andrej

Attachment: cluster.output
Description: Binary data

Attachment: cluster.tar.bz2
Description: application/bzip

Reply via email to