On Tue, 28 Sep 2010 23:56:47 +0200, Tomasz Rybak <bogom...@post.pl> wrote:
> I have idea for (maybe) checking whether problem is with PyCUDA,
> CUDA toolkit, or driver.
> Can you force PyCUDA to generate not sm_20 code, but 1x?
> I have found that it is determined in line 190 of file
> pycuda/compiler.py:
> arch = "sm_%d%d" % Context.get_device().compute_capability()
> Try to change it to
> arch = "sm_10"
> and so on, and check whether you get incorrect 14 in such
> a case.
> 
> If there is simpler way of changing architecture to which
> PyCUDA generates code, feel free to use it and share this
> information.

arch can be overridden from the SourceModule arguments:
http://documen.tician.de/pycuda/driver.html#module-pycuda.compiler

I'm getting 30;30 on a C2050 with the 3.2rc drivers/toolkit.

Andreas

Attachment: pgpobC6padsEp.pgp
Description: PGP signature

_______________________________________________
PyCUDA mailing list
PyCUDA@tiker.net
http://lists.tiker.net/listinfo/pycuda

Reply via email to