Hi all,
we just released Qlustar 10.1, the full-fledged Cluster OS for HPC
and storage. This is our first release also supporting CentOS with OpenHPC
integration.
The download button is at https://qlustar.com/download
Enjoy,
Roland
---
https://www.q-leap.com / https://qlustar.com
---
--- HPC / Storage / Cloud Linux Cluster OS ---
SJ> I'm sorry it's taking so long -- I'm on it though.
SJ> On 03/24/2017 01:56 PM, Roland Fehrenbacher wrote:
>>>>>>> "SJ" == Sylvain Jeaugey writes:
>> Hi Sylvain,
t CUDA 8. Do you think you could ask your team
members at Nvidia how this new behaviour in libcudart can be suppressed?
BTW: Disabling nvml support for the internal hwloc has the effect that
OpenMPI doesn't link in libnvidia-ml.so.x anymore, but has no effect on
the messages.
Thanks,
to suppress these warnings with a "normal" build? I guess the answer
must be yes, since 1.8.x didn't have this problem. The real question
then would be how ...
Thanks,
Roland
SJ> On 03/21/2017 11:05 AM, Roland Fehrenbacher wrote:
>>>>>>> "SJ"
> "SJ" == Sylvain Jeaugey writes:
Hi Silvain,
I get the "NVIDIA : ..." run-time error messages just by compiling
with "--with-cuda=/usr":
./configure --prefix=${prefix} \
--mandir=${prefix}/share/man \
--infodir=${prefix}/share/info \
--sysconfdir=/etc/openmpi/${VERSION} --with-
nst.
Thanks,
Roland
SJ> On 03/16/2017 04:23 AM, Roland Fehrenbacher wrote:
>> Hi,
>>
>> OpenMPI 2.0.2 built with cuda support brings up lots of warnings
>> like
>>
>> NVIDIA: no NVIDIA devices found
>>
>> when
Hi,
OpenMPI 2.0.2 built with cuda support brings up lots of warnings like
NVIDIA: no NVIDIA devices found
when running on HW without Nvidia devices. Is there a way to suppress
these warnings? It would be quite a hassle to maintain different OpenMPI
builds on clusters with just some GPU machines.
> "Nathan" == Nathan Hjelm writes:
Hi Nathan
Nathan> I want to close the loop on this issue. 1.8.5 will address
Nathan> it in several ways:
Nathan> - knem support in btl/sm has been fixed. A sanity check was
Nathan>disabling knem during component registration. I wrote t