Thanks for your reply. I've read the first thread you mention, that ends without a solution http://pycuda.2962900.n2.nabble.com/PyCUDA-pycuda-test-failures-tp5320194p5320194.html
Maybe I'm doing a huge mistake but it does not seem to be a precision detail. The following code (a simplification of test_gpuarray), returns 30 from the CPU and 14 from the GTX480, either with integer, float32 or float64. I don't get it. Can anybody explain me what I'm doing wrong please? Thanks import pycuda.autoinit import numpy import pycuda.gpuarray as gpuarray from pycuda.curandom import rand as curand a = numpy.array([1,2,3,4])#.astype(numpy.float32) a_gpu = gpuarray.to_gpu(a) b = a b_gpu = gpuarray.to_gpu(b) dot_ab = numpy.dot(a, b) dot_ab_gpu = gpuarray.dot(a_gpu, b_gpu).get() print "CPU dot product:", dot_ab print "GPU dot product:", dot_ab_gpu -- View this message in context: http://pycuda.2962900.n2.nabble.com/failed-test-gpuarray-on-GTX480-tp5574551p5578236.html Sent from the PyCuda mailing list archive at Nabble.com. _______________________________________________ PyCUDA mailing list PyCUDA@tiker.net http://lists.tiker.net/listinfo/pycuda