Dear deal.II developers and users, Recently, I spent some spare time assimilating CUDA C programming in the last few months, and I already know very well how to use CUDA stream events to let CPU and kernel (GPU) execution work asynchronously with efficiently overlapping data transfer between CPU and GPU, how to use shared memory to ensure global memory coalescing efficiently, how to map threads to matrix elements either using CARTESIAN x, y, z or a row/column mapping in GPU, as well as how to use shared memory to enhance data reuse. Most importantly, for actual practice, we have a GPU accelerator (i.e. NVIDIA Tesla K40) that was bought last year. I plan to use the CUDA C programming to deal with big data or image identification with artificial intelligence (deep learning) for atmospheric data.
Anyway, I am writing to ask if I could get involved with a deal.II project of people who are working on asynchronous adaptive mesh refinement for acceleration or other performance improvement in deal.II with CUDA C programming. Thanks! Sincerely, Chih-Che -- The deal.II project is located at http://www.dealii.org/ For mailing list/forum options, see https://groups.google.com/d/forum/dealii?hl=en --- You received this message because you are subscribed to the Google Groups "deal.II User Group" group. To unsubscribe from this group and stop receiving emails from it, send an email to dealii+unsubscr...@googlegroups.com. For more options, visit https://groups.google.com/d/optout.