Hi PETSc-developers, Does PETSc currently provide (either native or third party support) for MatSolve that can be performed entirely on a GPU given a factored matrix? i.e. a direct solver that would store the factors L and U on the device and use the GPU to solve the linear system. It does not matter if the GPU is not used for the factorization as we intend to solve the same linear system for 100s of iterations and thus try to prevent GPU->CPU transfers for the MatSolve phase.
Currently, I've built PETSc@main (commit 9c433d, 10/03) with superlu-dist@develop, both of which are configured with CUDA. With this, I'm seeing that each call to PCApply/MatSolve involves one GPU->CPU transfer. Is it possible to avoid this? Thank You, Sajid Ali (he/him) | Research Associate Scientific Computing Division Fermi National Accelerator Laboratory s-sajid-ali.github.io