Dear devels, I have been trying out 1.8.2rcs recently and found a show-stopping problem on our cluster. Running any job with any number of processors larger than 32 will always employ only 32 cores per node (our nodes have 48 cores). We are seeing identical behavior with 1.8.2rc4, 1.8.2rc2, and 1.8.1. Running identical programs shows no such issues with version 1.6.5, where all 48 cores per node are working. While our system is running torque/maui, the problem is evident by running mpirun directly.
I am attaching hwloc topology in case that helps -- I am aware of a buggy bios code that trips hwloc, but I don't know if that might be an issue or not. I am happy to help debugging if you can provide me with guidance. Thanks, Andrej
cluster.output
Description: Binary data
cluster.tar.bz2
Description: application/bzip