Re: [petsc-dev] code review request : txpetscgpu package removal

Paul Mullowney Tue, 25 Jun 2013 14:09:11 -0700

Hey Karl,

Not too heavy. I've already converted much of this code to remove thispackage while supporting existing features, though I haven't pushed itinto the fork. The real question is whether we want to go down this pathor not.

Right now, I think CUSP does not support SpMVs in streams. Thus, inorder to get an effective multi GPU SpMV (for all the different storageformats), one has to rewrite all the SpMV kernels (for all the differentstorage formats) to use streams. This adds a lot of additional code tosupport. I would prefer to just call some CUSP API with a stream as aninput argument but I don't think that exists at the moment. I'm not surewhat to do here. Once the other code is accepted, perhaps we canaddress this problem then?

How 'heavy' is this dependency? Is there some 'blocker' which preventsa complete integration, or is it just not finished yet?
I would like to start a discussion on the changes I have made which
primarily affect aijcusparse, mpiaijcusparse, and veccusp. There are two
commits that need to be reviewed:

Alright, I'll comment there.

I think (1) should be reviewed first as this adds most of the serial GPU
capability to aijcusparse. The second commit (2) adds changes to veccusp
and mpiaijcusparse to get an efficient multi-GPU SpMV.

It works across node but you have to know what you're doing. This is atough problem to solve universally because its (almost) impossible todetermine the number of mpi ranks per node in an mpi run. I've neverseen an MPI function that returns this information.

Right now, a 1-1 pairing between CPU core and GPU will work across anysystem with any number of nodes. I've tested this on a system with 2nodes, 4 GPUs per node (so "mpirun -n 8 -npernode 4" would work)


Thanks,
-Paul

I assume that this only works on a single node in order to enumerateand initialize the GPUs correctly?
Best regards,
Karli

Re: [petsc-dev] code review request : txpetscgpu package removal

Reply via email to