Hi Frank,

Frank Ihle <frank.i...@yahoo.de> writes:
> I try to speed up my Python program with a not so trivial algorithm, so 
> I need to know. What is the correct way of transferring a list of lists 
> of floats to the (Py)CUDA Kernel?

Nested, variable-sized structures are generally tricky to map onto
array-shape hardware. You'll likely want to store your data in a
CSR-like data structure:

https://en.wikipedia.org/wiki/Sparse_matrix

Scans (such as the one in PyCUDA) can help significantly with the
resulting index computations.

Hope that helps,
Andreas

_______________________________________________
PyCUDA mailing list
PyCUDA@tiker.net
https://lists.tiker.net/listinfo/pycuda

Reply via email to