Wait at end of OpenACC asynchronous kernels regions

2022-01-13 Thread Thomas Schwinge
nchronous > kernels regions instead. (Or rather, use structured 'data' (as we're now doing), but with appropriate 'async' clauses.) > The attached patch just adds a "wait" operation before the end of > the enclosing data region. This works, but introduces undesirable > synchronisa

[PATCH 1/3] [og9] Wait at end of OpenACC asynchronous kernels regions

2019-08-13 Thread Julian Brown
This patch provides a workaround for unreliable operation of asynchronous kernels regions on AMD GCN. At present, kernels regions are decomposed into a series of parallel regions surrounded by a data region capturing the data-movement clauses needed by the region as a whole: #pragma acc kernels