"Smith, Barry F." <bsm...@mcs.anl.gov> writes: > Sorry, I wasn't clear. Just meant something simpler. Compress the matrix to > copy it to the GPU for faster transfers (and uncompress it appropriately on > the GPU).
Oh, perhaps. Probably not relevant with NVLink (because it's nearly as fast as DRAM), but could be a win for PCI-e.