>
>
> Allocation/free of pinned memory is slow, usually on the order of several
> milliseconds. So these numbers look normal. Is there any opportunity to
> reuse the pinned memory in these functions?
>
Ah, good. So we understand this.
I was thinking of adding a member to the class and allocated i
On Dec 22, 2020, at 3:38 PM, Mark Adams
mailto:mfad...@lbl.gov>> wrote:
I am MPI serial LU solving a smallish matrix (2D, Q3, 8K equations) on a Summit
node (42 P9 cores, 6 V100 GPUs) using cuSparse and Kokkos kernels. The cuSparse
performance is terrible.
I solve the same TS problem in MPI
I am MPI serial LU solving a smallish matrix (2D, Q3, 8K equations) on a
Summit node (42 P9 cores, 6 V100 GPUs) using cuSparse and Kokkos kernels.
The cuSparse performance is terrible.
I solve the same TS problem in MPI serial on each global process. I run
with NP=1 or (all) 7 cores/MPI per GPU:
M
Hello All,
If anyone is on macOS and using brew to get gfortran, you may find it suddenly
doesn’t work anymore.
Fix is to add /Library/Developer/CommandLineTools/SDKs/MacOSX.sdk/usr/lib to
$LIBRARY_PATH in startup shell file, as per this SO post.
https://apple.stackexchange.com/questions/40899