[PATCH 3/8] nvptx -muniform-simt

2016-10-14 Thread Alexander Monakov
This patch implements -muniform-simt code generation option, which is used to emit code for OpenMP offloading. The goal is to emit code that can either execute "normally", or can execute in a way that keeps all lanes in a given warp active, their local state synchronized, and observable effects fr

Re: [PATCH 3/8] nvptx -muniform-simt

2016-06-24 Thread Sandra Loosemore
On 06/22/2016 12:39 PM, Alexander Monakov wrote: Ping. On Mon, 13 Jun 2016, Alexander Monakov wrote: Does the following look better? @item -muniform-simt @opindex muniform-simt Generate code that may keep all lanes in each warp active, even when observable effects from execution must appear as

Re: [PATCH 3/8] nvptx -muniform-simt

2016-06-22 Thread Alexander Monakov
Ping. On Mon, 13 Jun 2016, Alexander Monakov wrote: > On Sun, 12 Jun 2016, Sandra Loosemore wrote: > > On 06/09/2016 10:53 AM, Alexander Monakov wrote: > > > +@item -muniform-simt > > > +@opindex muniform-simt > > > +Generate code that allows to keep all lanes in each warp active, even > > > when

Re: [PATCH 3/8] nvptx -muniform-simt

2016-06-13 Thread Alexander Monakov
On Sun, 12 Jun 2016, Sandra Loosemore wrote: > On 06/09/2016 10:53 AM, Alexander Monakov wrote: > > +@item -muniform-simt > > +@opindex muniform-simt > > +Generate code that allows to keep all lanes in each warp active, even when > > Allows *what* to keep? E.g. what is doing the keeping here? If

Re: [PATCH 3/8] nvptx -muniform-simt

2016-06-12 Thread Sandra Loosemore
On 06/09/2016 10:53 AM, Alexander Monakov wrote: [snip] --- a/gcc/doc/invoke.texi +++ b/gcc/doc/invoke.texi @@ -19621,6 +19621,16 @@ for OpenMP offloading, but the option is exposed on its own for the purpose of testing the compiler; to generate code suitable for linking into programs usin

[PATCH 3/8] nvptx -muniform-simt

2016-06-09 Thread Alexander Monakov
This patch implements -muniform-simt code generation option, which is used to emit code for OpenMP offloading. The goal is to emit code that can either execute "normally", or can execute in a way that keeps all lanes in a given warp active, their local state synchronized, and observable effects fr