On Sat, Aug 31, 2019 at 4:28 PM Smith, Barry F. wrote:
>
> Any explanation for why the scaling is much better for CPUs and than
> GPUs? Is it the "extra" time needed for communication from the GPUs?
>
The GPU work is well load balanced so it weak scales perfectly. When you
put that work in the
Any explanation for why the scaling is much better for CPUs and than GPUs? Is
it the "extra" time needed for communication from the GPUs?
Perhaps you could try the GPU version with Junchao's new MPI-aware CUDA
branch (in the gitlab merge requests) that can speed up the communication from