Hi guys,

Just wondering if anyone had a chance to resolve this yet...


On Fri, Jan 27, 2012 at 2:51 PM, Bogdan Opanchuk <manti...@gmail.com> wrote:

> Hi Andreas,
>
> On Sat, Jan 28, 2012 at 3:23 AM, Andreas Kloeckner
> <li...@informa.tiker.net> wrote:
> > Indeed, inserting __syncthreads() after the
> > shared array declaration brings the error down to more reasonable values
> > for me. Jesse, my recommendation would be to use that as a workaround
> > while we figure out a more permanent fix.
>
> Can't we do this:
> >> 1. Using "extern __shared__ out_type sdata[]" and setting the size of
> >> shared memory when preparing the kernel.
> We can pass dtype instead of ctype to
> get_reduction_kernel_and_types(), and convert it to ctype + data size
> inside.
>
> Best regards,
> Bogdan
>
> _______________________________________________
> PyCUDA mailing list
> PyCUDA@tiker.net
> http://lists.tiker.net/listinfo/pycuda
>
_______________________________________________
PyCUDA mailing list
PyCUDA@tiker.net
http://lists.tiker.net/listinfo/pycuda

Reply via email to