On 09.08.21 15:55, Tobias Burnus wrote:
Now that the GCN/OpenACC patches for this have been committed today, I think it makes sense to add it to the documentation. (I was told that some follow-up items are still pending, but as the feature does work ...)
I think the follow-up patches have now been committed. How about the attached patch? Tobias ----------------- Siemens Electronic Design Automation GmbH; Anschrift: Arnulfstraße 201, 80634 München; Gesellschaft mit beschränkter Haftung; Geschäftsführer: Thomas Heurung, Frank Thürauf; Sitz der Gesellschaft: München; Registergericht München, HRB 106955
gcc-12/changes.html (GCN): >1 workers per gang diff --git a/htdocs/gcc-12/changes.html b/htdocs/gcc-12/changes.html index 479bd6c5..738f4c73 100644 --- a/htdocs/gcc-12/changes.html +++ b/htdocs/gcc-12/changes.html @@ -360,6 +360,10 @@ a work-in-progress.</p> <li>Debug experience with ROCGDB has been improved.</li> <li>Support for the type <code>__int128_t</code>/<code>integer(kind=16)</code> was added.</li> + <li>For offloading, the limitation of using only one wavefront per compute + unit (CU) has been lifted; up to 40 workgroup per CU and 16 wavefronts + per workgroup are supported. Additionally, the number of used wavefronts + and workgroups was tuned for performance.</li> </ul> <!-- <h3 id="arc">ARC</h3> -->