Hi Andrea, On Thu, Jul 19, 2012 at 4:26 PM, Andrea Cesari <[email protected]> wrote: > The problem is that the results match with cpu only for dim_x and dim_y > minor of 32. > For higher dimensions the cpu and gpu results are different.
When you change dim_x and dim_y values, do you also change grid and block size in call to findmin? Judging by the code, they are hardcoded to always be (32,32,1) and (1,1). _______________________________________________ PyCUDA mailing list [email protected] http://lists.tiker.net/listinfo/pycuda
