Hi Frank, Frank Ihle <frank.i...@yahoo.de> writes: > I try to speed up my Python program with a not so trivial algorithm, so > I need to know. What is the correct way of transferring a list of lists > of floats to the (Py)CUDA Kernel?
Nested, variable-sized structures are generally tricky to map onto array-shape hardware. You'll likely want to store your data in a CSR-like data structure: https://en.wikipedia.org/wiki/Sparse_matrix Scans (such as the one in PyCUDA) can help significantly with the resulting index computations. Hope that helps, Andreas _______________________________________________ PyCUDA mailing list PyCUDA@tiker.net https://lists.tiker.net/listinfo/pycuda