https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104916
--- Comment #2 from Tom de Vries <vries at gcc dot gnu.org> --- Created attachment 52629 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=52629&action=edit Attempt, runs into driver internal error FTR, this is an attempt at a fix. It does the "predicate ld/st to only execute in lane 0, and propagate ld result", expect for modes where we run into a problem with producing a shuffle (HFmode, V2SImode), where we use the "wrap each ld/st in two bar.warp.sync" approach instead. For libgomp/testsuite/libgomp.c++/target_teams_distribute__parallel_for__simd.C (copied from OvO) I end up with: ... Linking Link complete: 0.000000ms Link log warning : Stack size for entry function '_Z48test_target_teams_distribute__parallel_for__simdv$_omp_fn$0' cannot be statically determined libgomp: Link error log fatal : Internal error: reference to deleted section ... This may be due to a problem with the patch or the driver, I'm not sure.