Re: [petsc-dev] problem with MatSeqAIJCUSPARSEILUAnalysisAndCopyToGPU

2020-12-22 Thread Mark Adams
> > > Allocation/free of pinned memory is slow, usually on the order of several > milliseconds. So these numbers look normal. Is there any opportunity to > reuse the pinned memory in these functions? > Ah, good. So we understand this. I was thinking of adding a member to the class and allocated i

Re: [petsc-dev] problem with MatSeqAIJCUSPARSEILUAnalysisAndCopyToGPU

2020-12-22 Thread Zhang, Hong via petsc-dev
On Dec 22, 2020, at 3:38 PM, Mark Adams mailto:mfad...@lbl.gov>> wrote: I am MPI serial LU solving a smallish matrix (2D, Q3, 8K equations) on a Summit node (42 P9 cores, 6 V100 GPUs) using cuSparse and Kokkos kernels. The cuSparse performance is terrible. I solve the same TS problem in MPI

[petsc-dev] problem with MatSeqAIJCUSPARSEILUAnalysisAndCopyToGPU

2020-12-22 Thread Mark Adams
I am MPI serial LU solving a smallish matrix (2D, Q3, 8K equations) on a Summit node (42 P9 cores, 6 V100 GPUs) using cuSparse and Kokkos kernels. The cuSparse performance is terrible. I solve the same TS problem in MPI serial on each global process. I run with NP=1 or (all) 7 cores/MPI per GPU: M

[petsc-dev] Gfortran Fix for "library not found for -lSystem" macOS Big Sur

2020-12-22 Thread Jacob Faibussowitsch
Hello All, If anyone is on macOS and using brew to get gfortran, you may find it suddenly doesn’t work anymore. Fix is to add /Library/Developer/CommandLineTools/SDKs/MacOSX.sdk/usr/lib to $LIBRARY_PATH in startup shell file, as per this SO post. https://apple.stackexchange.com/questions/40899