Could you use a sparse representation for the electron positions rather than a dense scalar grid?
Something like a list of positions, an octree, or some other adaptive mesh might allow you to fit smaller sub-problems in each work-group and merge these results back into the global representation with more sequential access patterns...? Karl _______________________________________________ PyOpenCL mailing list [email protected] http://lists.tiker.net/listinfo/pyopencl
