Dear deal.II developers and users,

Recently, I spent some spare time assimilating CUDA C programming in the
last few months, and I already know very well how to use CUDA stream events
to let CPU and kernel (GPU) execution work asynchronously with efficiently
overlapping data transfer between CPU and GPU, how to use shared memory to
ensure global memory coalescing efficiently, how to map threads to matrix
elements either using CARTESIAN x, y, z or a row/column mapping in GPU, as
well as how to use shared memory to enhance data reuse. Most importantly,
for actual practice, we have a GPU accelerator (i.e. NVIDIA Tesla K40) that
was bought last year. I plan to use the CUDA C programming to deal with big
data or image identification with artificial intelligence (deep learning)
for atmospheric data.

Anyway, I am writing to ask if I could get involved with a deal.II project
of people who are working on asynchronous adaptive mesh refinement for
acceleration or other performance improvement in deal.II with CUDA C
programming.

Thanks!

Sincerely,

Chih-Che

-- 
The deal.II project is located at http://www.dealii.org/
For mailing list/forum options, see 
https://groups.google.com/d/forum/dealii?hl=en
--- 
You received this message because you are subscribed to the Google Groups 
"deal.II User Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to dealii+unsubscr...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply via email to