[PyCuda] global memory?

Dan Goodman Mon, 02 Feb 2009 10:42:42 -0800

Hi all,

I hope this isn't a stupid question for this list, I've only juststarted using CUDA programming.

What I want to do is implement the numpy operation J=where(x>x0) for agpu array x and a fixed constant x0. I want J to be computed on the GPU(so that x doesn't have to be copied from the GPU to the CPU) but thento be copied to the CPU. How would I go about doing this?

I was thinking about using the global memory space of the GPU basically,and just using a single thread on the GPU to do the thresholdingoperation. This isn't a very efficient way to use the GPU but I don'tsee how I can do it in a parallel way. The thresholding operation isperformed many, many times with the array x updated (by the GPU) inbetween, but each individual thresholding operation is only expected toreturn an array J with a handful of values. For example, x might be anarray of 30,000 elements, and J might be say 5-20 elements.

So my question is basically, how can I allocate space on the globalmemory using PyCuda, and then copy from this space. I couldn't decidehow to do this from the docs (or even if its possible).

Of course if anyone has another idea for a parallel way to do mythresholding operation that would also be great! :-)


Thanks in advance for any help,
Dan

_______________________________________________
PyCuda mailing list
[email protected]
http://tiker.net/mailman/listinfo/pycuda_tiker.net

[PyCuda] global memory?

Reply via email to