...@open-mpi.org
Subject: [OMPI users] OMPI CUDA IPC synchronisation/fail-silent problem
Hey all,
to test the performance of my application I duplicated the call to the function
that will issue the computation on two GPUs 5 times. During the 4th and 5th run
of the algorithm, however, the algorithm
Hey all,
to test the performance of my application I duplicated the call to the
function that will issue the computation on two GPUs 5 times. During the
4th and 5th run of the algorithm, however, the algorithm yields
different results (9 instead of 20):
# datatype: double
# datapoints: 2
# ma