Hi,

I'm trying to accelerate a shift-invert eigensolve with GPU, but the
computation seems to be spending a lot of its time in the CPU. Looking at
the output with "-log_view -log_view_gpu_time" I see that MatLUFactorNum is
not using the GPU (GPU Mflops/s is 0), and is taking the majority of the
computation time. Is LU factorization on the GPU supported? I am currently
applying the command line options "-vec_type cuda -mat_type aijcusparse",
please let me know if there are other options I can apply to accelerate the
LU factorization as well. I tried digging through the documentation but
couldn't find a clear answer.

Thanks in advance!

Kind regards,
Greg KM

-- 
*Gregory D. Kahanamoku-Meyer*
PhD Candidate
quantum computing | cryptography | high-performance computing
Department of Physics
University of California at Berkeley
personal website <https://gregdmeyer.github.io/>

Reply via email to