Thanks for your reply.
I've read the first thread you mention, that ends without a solution
http://pycuda.2962900.n2.nabble.com/PyCUDA-pycuda-test-failures-tp5320194p5320194.html

Maybe I'm doing a huge mistake but it does not seem to be a precision
detail.
The following code (a simplification of test_gpuarray), returns 30 from the
CPU and 14 from the GTX480, either with integer, float32 or float64.
I don't get it. Can anybody explain me what I'm doing wrong please?
Thanks

import pycuda.autoinit
import numpy
import pycuda.gpuarray as gpuarray
from pycuda.curandom import rand as curand

a = numpy.array([1,2,3,4])#.astype(numpy.float32)
a_gpu = gpuarray.to_gpu(a)
b = a
b_gpu = gpuarray.to_gpu(b)

dot_ab = numpy.dot(a, b)

dot_ab_gpu = gpuarray.dot(a_gpu, b_gpu).get()

print "CPU dot product:", dot_ab
print "GPU dot product:", dot_ab_gpu


-- 
View this message in context: 
http://pycuda.2962900.n2.nabble.com/failed-test-gpuarray-on-GTX480-tp5574551p5578236.html
Sent from the PyCuda mailing list archive at Nabble.com.

_______________________________________________
PyCUDA mailing list
PyCUDA@tiker.net
http://lists.tiker.net/listinfo/pycuda

Reply via email to