Hi guys, Just wondering if anyone had a chance to resolve this yet...
On Fri, Jan 27, 2012 at 2:51 PM, Bogdan Opanchuk <manti...@gmail.com> wrote: > Hi Andreas, > > On Sat, Jan 28, 2012 at 3:23 AM, Andreas Kloeckner > <li...@informa.tiker.net> wrote: > > Indeed, inserting __syncthreads() after the > > shared array declaration brings the error down to more reasonable values > > for me. Jesse, my recommendation would be to use that as a workaround > > while we figure out a more permanent fix. > > Can't we do this: > >> 1. Using "extern __shared__ out_type sdata[]" and setting the size of > >> shared memory when preparing the kernel. > We can pass dtype instead of ctype to > get_reduction_kernel_and_types(), and convert it to ctype + data size > inside. > > Best regards, > Bogdan > > _______________________________________________ > PyCUDA mailing list > PyCUDA@tiker.net > http://lists.tiker.net/listinfo/pycuda >
_______________________________________________ PyCUDA mailing list PyCUDA@tiker.net http://lists.tiker.net/listinfo/pycuda