Re: [OMPI users] MPI_Get is slow with structs containing padding

Joseph Schuchart via users Thu, 30 Mar 2023 12:06:50 -0700

Hi Antoine,

That's an interesting result. I believe the problem with datatypes withgaps is that MPI is not allowed to touch the gaps. My guess is that forthe RMA version of the benchmark the implementation either has to revertback to an active message packing the data at the target and sending itback or (which seems more likely in your case) transfer each objectseparately and skip the gaps. Without more information on your setup(using UCX?) and the benchmark itself (how many elements? what does thetarget do?) it's hard to be more precise.

A possible fix would be to drop the MPI datatype for the RMA use andtransfer the vector as a whole, using MPI_BYTE. I think there is also away to modify the upper bound of the MPI type to remove the gap, usingMPI_TYPE_CREATE_RESIZED. I expect that that will allow MPI to touch thegap and transfer the vector as a whole. I'm not sure about the detailsthere, maybe someone can shed some light.


HTH
Joseph

On 3/30/23 18:34, Antoine Motte via users wrote:

Hello everyone,
I recently had to code an MPI application where I send std::vectorcontents in a distributed environment. In order to try differentapproaches I coded both 1-sided and 2-sided point-to-pointcommunication schemes, the first one uses MPI_Window and MPI_Get, thesecond one uses MPI_SendRecv.
I had a hard time figuring out why my implementation with MPI_Get wasbetween 10 and 100 times slower, and I finally found out that MPI_Getis abnormally slow when one tries to send custom datatypes includingpadding.
Here is a short example attached, where I send a struct {double, int}(12 bytes of data + 4 bytes of padding) vs a struct {double, int, int}(16 bytes of data, 0 bytes of padding) with both MPI_SendRecv andMPI_Get. I got these results :
mpirun -np 4 ./compareGetWithSendRecv
{double, int} SendRecv : 0.0303547 s
{double, int} Get : 1.9196 s
{double, int, int} SendRecv : 0.0164659 s
{double, int, int} Get : 0.0147757 s
I run it with both Open MPI 4.1.2 and with intel MPI 2021.6 and gotthe same results.
Is this result normal? Do I have any solution other than addinggarbage at the end of the struct or at the end of the MPI_Datatype toavoid padding?
Regards,

Antoine Motte

Re: [OMPI users] MPI_Get is slow with structs containing padding

Reply via email to