Dmitry,
I'm not too familiar with the internals of OpenMPI, but I just tried 4.1.5
on NCSA Delta and received the same IPC errors (no mca flags switched).
The actual calls didn't fail this time to perform the actual operation, so
maybe that's an improvement from v4.0.x to v4.1.x?
Thanks,
Mi
There is a relevant explanation of the same issue reported for Julia:
https://github.com/JuliaGPU/CUDA.jl/issues/1053
пт, 30 мая 2025 г. в 19:05, Mike Adams :
> Hi Tommy,
>
> I'm setting btl_smcuda_use_cuda_ipc_same_gpu 0 and btl_smcuda_use_cuda_ipc 0.
>
> So, are you saying that with these param
Hi Tommy,
I'm setting btl_smcuda_use_cuda_ipc_same_gpu 0 and btl_smcuda_use_cuda_ipc 0.
So, are you saying that with these params, it is also not using GPUDirect
RDMA?
PSC Bridges 2 only has v4 OpenMPI, but they may be working on installing v5
now. Everything works on v5 on NCSA Delta - I'll
Hi,
I'm not sure if it's a known issue, in v4.0 possibly, not sure about v4.1
or v5.0 - can you try?
As far as CUDA IPC - how are you disabling it? I don't remember the mca
params in v4.0
If it's either through pml ucx, or smcuda then no, it won't use it.
-Tommy
On Saturday, May 24, 2025 at 8: