Re: [PATCH] coroutines: Fix compile error with symmetric transfers [PR94359]

2020-04-14 Thread Iain Sandoe
Rainer Orth  wrote:

>> Rainer Orth  wrote:
 On Sat, Apr 11, 2020 at 03:46:18PM +0100, Iain Sandoe wrote:
> Unfortunately, several targets have ABI constraints that prevent
> an indirect tail-call, which results in the PRs compile error.
>> 
>> So .. after scanning the current published testsuite results I have:
>> 
>> // See PR94359 - some targets are unable to make general indirect tailcalls
>> // for example, between different DSOs.
>> // { dg-xfail-run-if "" { hppa*-*-hpux11* } }
>> // { dg-xfail-run-if "" { ia64-*-linux-gnu } }
>> // { dg-xfail-run-if "" { { lp64 && { powerpc*-linux-gnu } } || { *-*-aix*
>> } } }
>> // { dg-xfail-run-if "" { sparc-*-solaris2* } }
> 
> this is wrong: for one, it needs to allow for 64-bit-default SPARC
> configurations (i.e. sparcv9-*-* and sparc64-*-*), and AFAICS there's
> nothing Solaris-specific here.  So this should be sparc*-*-* instead.
> 
>> I’d say it would be a reasonable idea to get this in sooner rather than
>> later, we
>> can always increase the list (or a target maintainer can trivially add
>> their target)
> 
> Fine with me.  If the list gets much longer or the conditions more
> complex, it might still be worthwhile to use an effective-target keyword
> instead.

This is what I’ve pushed after re-checking on
x86_64-linux/darwin
sparc-solaris11
powerpc64-linux-gnu

I won’t be entirely surprised if the list needs amendment or additions
but that’s realistically what I can test.

thanks for the reviews,

Iain

coroutines: Fix compile error with symmetric transfers [PR94359]

For symmetric transfers to work with C++20 coroutines, it is
currently necessary to tail call the callee coroutine from resume
method of the caller coroutine. The current codegen marks these
resume calls as "MUST_TAIL_CALL" to indicate that the tail call is
required for correctness.

Unfortunately, several targets have ABI constraints that prevent
an indirect tail-call, which results in the PRs compile error.

The change here tests the target sibcall hook for the resume
expression and only marks it as requiring a tail call if that's
supported.

This doesn't fix the underlying problem; that really a solution is
needed to allow the tail-calls (or equivalent) to take place - but
that will be deferred until next stage 1.

gcc/cp/ChangeLog:

2020-04-14  Iain Sandoe  

PR c++/94359
* coroutines.cc (build_actor_fn): Check that the target can
support the resume tailcall before mandating it.

gcc/testsuite/ChangeLog:

2020-04-14  Iain Sandoe  

PR c++/94359
* g++.dg/coroutines/torture/symmetric-transfer-00-basic.C:
Expect a run fail for targets without arbitrary indirect
tail-calls.


diff --git a/gcc/cp/coroutines.cc b/gcc/cp/coroutines.cc
index 57172853639..e4ba642d527 100644
--- a/gcc/cp/coroutines.cc
+++ b/gcc/cp/coroutines.cc
@@ -2376,14 +2376,22 @@ build_actor_fn (location_t loc, tree coro_frame_type, 
tree actor, tree fnbody,
   tree resume = build_call_expr_loc
 (loc, builtin_decl_explicit (BUILT_IN_CORO_RESUME), 1, addr);
 
+  /* In order to support an arbitrary number of coroutine continuations,
+ we must tail call them.  However, some targets might not support this
+ for indirect calls, or calls between DSOs.
+ FIXME: see if there's an alternate strategy for such targets.  */
   /* Now we have the actual call, and we can mark it as a tail.  */
   CALL_EXPR_TAILCALL (resume) = true;
-  /* ... and for optimisation levels 0..1, mark it as requiring a tail-call
- for correctness.  It seems that doing this for optimisation levels that
- normally perform tail-calling, confuses the ME (or it would be logical
- to put this on unilaterally).  */
-  if (optimize < 2)
-CALL_EXPR_MUST_TAIL_CALL (resume) = true;
+  /* Temporarily, switch cfun so that we can use the target hook.  */
+  push_struct_function (actor);
+  if (targetm.function_ok_for_sibcall (NULL_TREE, resume))
+{
+  /* ... and for optimisation levels 0..1, which do not normally tail-
+   -call, mark it as requiring a tail-call for correctness.  */
+  if (optimize < 2)
+   CALL_EXPR_MUST_TAIL_CALL (resume) = true;
+}
+  pop_cfun ();
   resume = coro_build_cvt_void_expr_stmt (resume, loc);
   add_stmt (resume);
 
@@ -3951,7 +3959,7 @@ morph_fn_to_coro (tree orig, tree *resumer, tree 
*destroyer)
 
   push_deferring_access_checks (dk_no_check);
 
-  /* Actor ...  */
+  /* Build the actor...  */
   build_actor_fn (fn_start, coro_frame_type, actor, fnbody, orig, param_uses,
  &local_var_uses, param_dtor_list, initial_await, final_await,
  body_aw_points.await_number, frame_size);

diff --git 
a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C 
b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
index 864846e365c..6f379c8e77a 100644
--- a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
+++ b/gcc/testsuite/g++.dg/coroutines/tort

Re: [PATCH] coroutines: Fix compile error with symmetric transfers [PR94359]

2020-04-14 Thread Rainer Orth
Hi Iain,

> Rainer Orth  wrote:
>>>
>>> On Sat, Apr 11, 2020 at 03:46:18PM +0100, Iain Sandoe wrote:
 Unfortunately, several targets have ABI constraints that prevent
 an indirect tail-call, which results in the PRs compile error.
>
> So .. after scanning the current published testsuite results I have:
>
> // See PR94359 - some targets are unable to make general indirect tailcalls
> // for example, between different DSOs.
> // { dg-xfail-run-if "" { hppa*-*-hpux11* } }
> // { dg-xfail-run-if "" { ia64-*-linux-gnu } }
> // { dg-xfail-run-if "" { { lp64 && { powerpc*-linux-gnu } } || { *-*-aix*
> } } }
> // { dg-xfail-run-if "" { sparc-*-solaris2* } }

this is wrong: for one, it needs to allow for 64-bit-default SPARC
configurations (i.e. sparcv9-*-* and sparc64-*-*), and AFAICS there's
nothing Solaris-specific here.  So this should be sparc*-*-* instead.

> I’d say it would be a reasonable idea to get this in sooner rather than
> later, we
> can always increase the list (or a target maintainer can trivially add
> their target)

Fine with me.  If the list gets much longer or the conditions more
complex, it might still be worthwhile to use an effective-target keyword
instead.

Rainer

-- 
-
Rainer Orth, Center for Biotechnology, Bielefeld University


Re: [PATCH] coroutines: Fix compile error with symmetric transfers [PR94359]

2020-04-14 Thread Iain Sandoe

Hi Rainer,

Rainer Orth  wrote:


On Sat, Apr 11, 2020 at 03:46:18PM +0100, Iain Sandoe wrote:

Unfortunately, several targets have ABI constraints that prevent
an indirect tail-call, which results in the PRs compile error.


So .. after scanning the current published testsuite results I have:

// See PR94359 - some targets are unable to make general indirect tailcalls
// for example, between different DSOs.
// { dg-xfail-run-if "" { hppa*-*-hpux11* } }
// { dg-xfail-run-if "" { ia64-*-linux-gnu } }
// { dg-xfail-run-if "" { { lp64 && { powerpc*-linux-gnu } } || { *-*-aix*  
} } }

// { dg-xfail-run-if "" { sparc-*-solaris2* } }

* I don’t see the armv8l- fail in the results.
* hppa-linux seems OK.
* I’m avoiding writing powerpc and rs6000 aix, since I expect the result to  
be

the same for both.

I’d say it would be a reasonable idea to get this in sooner rather than  
later, we
can always increase the list (or a target maintainer can trivially add  
their target)


happy to test on the platforms I have access to if this seems a reasonable  
start

thanks
Iain



Re: [PATCH] coroutines: Fix compile error with symmetric transfers [PR94359]

2020-04-14 Thread Iain Sandoe

Rainer Orth  wrote:


Hi Segher,


On Sat, Apr 11, 2020 at 03:46:18PM +0100, Iain Sandoe wrote:

Unfortunately, several targets have ABI constraints that prevent
an indirect tail-call, which results in the PRs compile error.



diff --git
a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
index 864846e365c..8211e8250ff 100644
---  
a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
+++  
b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C

@@ -1,4 +1,5 @@
-//  { dg-do run }
+// { dg-do run }
+// { dg-xfail-run-if "no indirect tailcall" { { lp64 && {
powerpc64*-linux-gnu } } || { *-*-solaris2* *-*-aix* } } }


lp64 && powerpc*-*-linux (we have biarch compilers :-) )

The problem is not that there is no indirect tailcall; the problem is
that no tailcall can be done to a routine that (potentially) has a
different TOC.

From rs6000_function_ok_for_sibcall:
 Under the AIX or ELFv2 ABIs we can't allow calls to non-local
 functions, because the callee may have a different TOC pointer to
 the caller and there's no way to ensure we restore the TOC when
 we return.


so shouldn't the above be

 lp64 && powerpc*-*-*

instead to cover AIX, too?  Or are there other PowerPC ABIs that don't
have this issue?


Just for once, Darwin (both PPC and X86) has no problem (doesn’t use the TOC
and has a per-function “got”), so that’s one.

Iain



Re: [PATCH] coroutines: Fix compile error with symmetric transfers [PR94359]

2020-04-14 Thread Rainer Orth
Hi Segher,

> On Sat, Apr 11, 2020 at 03:46:18PM +0100, Iain Sandoe wrote:
>> Unfortunately, several targets have ABI constraints that prevent
>> an indirect tail-call, which results in the PRs compile error.
>
>> diff --git
>> a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
>> b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
>> index 864846e365c..8211e8250ff 100644
>> --- a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
>> +++ b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
>> @@ -1,4 +1,5 @@
>> -//  { dg-do run }
>> +// { dg-do run }
>> +// { dg-xfail-run-if "no indirect tailcall" { { lp64 && {
>> powerpc64*-linux-gnu } } || { *-*-solaris2* *-*-aix* } } }
>
> lp64 && powerpc*-*-linux (we have biarch compilers :-) )
>
> The problem is not that there is no indirect tailcall; the problem is
> that no tailcall can be done to a routine that (potentially) has a
> different TOC.
>
> From rs6000_function_ok_for_sibcall:
>   Under the AIX or ELFv2 ABIs we can't allow calls to non-local
>   functions, because the callee may have a different TOC pointer to
>   the caller and there's no way to ensure we restore the TOC when
>   we return.

so shouldn't the above be

  lp64 && powerpc*-*-*

instead to cover AIX, too?  Or are there other PowerPC ABIs that don't
have this issue?

Rainer

-- 
-
Rainer Orth, Center for Biotechnology, Bielefeld University


Re: [PATCH] coroutines: Fix compile error with symmetric transfers [PR94359]

2020-04-14 Thread Segher Boessenkool
Hi!

On Sat, Apr 11, 2020 at 03:46:18PM +0100, Iain Sandoe wrote:
> Unfortunately, several targets have ABI constraints that prevent
> an indirect tail-call, which results in the PRs compile error.

> diff --git 
> a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C 
> b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
> index 864846e365c..8211e8250ff 100644
> --- a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
> +++ b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
> @@ -1,4 +1,5 @@
> -//  { dg-do run }
> +// { dg-do run }
> +// { dg-xfail-run-if "no indirect tailcall" { { lp64 && { 
> powerpc64*-linux-gnu } } || { *-*-solaris2* *-*-aix* } } }

lp64 && powerpc*-*-linux (we have biarch compilers :-) )

The problem is not that there is no indirect tailcall; the problem is
that no tailcall can be done to a routine that (potentially) has a
different TOC.

>From rs6000_function_ok_for_sibcall:
  Under the AIX or ELFv2 ABIs we can't allow calls to non-local
  functions, because the callee may have a different TOC pointer to
  the caller and there's no way to ensure we restore the TOC when
  we return.


Segher


Re: [PATCH] coroutines: Fix compile error with symmetric transfers [PR94359]

2020-04-13 Thread Rainer Orth
Hi Iain,

> Rainer Orth  wrote:
>
>>> diff --git
>>> a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
>>> b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
>>> index 864846e365c..8211e8250ff 100644
>>> ---  
>>> a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
>>> +++
>>> b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
>>> @@ -1,4 +1,5 @@
>>> -//  { dg-do run }
>>> +// { dg-do run }
>>> +// { dg-xfail-run-if "no indirect tailcall" { { lp64 && {
>>> powerpc64*-linux-gnu } } || { *-*-solaris2* *-*-aix* } } }
>>>
>>> #if __has_include()
>>
>> unfortunately, the dg-xfail-run-if is wrong.  E.g. it causes XPASSes on
>> i386-pc-solaris2.11.
>
> .. so that behaves in a similar way to PPC?
> fail on m64 pass on m32?
> (I don’t have access to any x86 solaris testing)

no, it PASSes for 32 and 64-bit alike, just as e.g. Linux/x86_64.

> according to my testing, sparc solaris fails for m32 and m64 (so the
> condition doesn’t need any multilib discriminator there)

Right, same here.

>> You should base this on the cpu part of the triplet in general, not on
>> the OS.  Besides, according to gcc-testresults postings, the test FAILs
>> on other targets as well: armv8l-unknown-linux-gnueabihf, hppa*, and ia64.
>>
>> Besides, unless you want to introduce an effective-target keyword (with
>> documentation in sourcebuild.texi), probably overkill for a single use,
>> you can have more than one dg-xfail-run-if line to improve readibility.
>
> I’ll take a look at those  and make multiple xfail-run-if’s
> (one per target might be the neatest, and easier for a target maintainer to
> add if one is missed).

I thought about separate ones for cases that need additional conditions
and an alphabetical list of target triplets (just -*-*)
for the rest.

Rainer

-- 
-
Rainer Orth, Center for Biotechnology, Bielefeld University


Re: [PATCH] coroutines: Fix compile error with symmetric transfers [PR94359]

2020-04-13 Thread Iain Sandoe

Hi Rainer,

Rainer Orth  wrote:

diff --git  
a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C  
b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C

index 864846e365c..8211e8250ff 100644
---  
a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
+++  
b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C

@@ -1,4 +1,5 @@
-//  { dg-do run }
+// { dg-do run }
+// { dg-xfail-run-if "no indirect tailcall" { { lp64 && {  
powerpc64*-linux-gnu } } || { *-*-solaris2* *-*-aix* } } }


#if __has_include()


unfortunately, the dg-xfail-run-if is wrong.  E.g. it causes XPASSes on
i386-pc-solaris2.11.


.. so that behaves in a similar way to PPC?
fail on m64 pass on m32?
(I don’t have access to any x86 solaris testing)

according to my testing, sparc solaris fails for m32 and m64 (so the  
condition doesn’t need any multilib discriminator there)



You should base this on the cpu part of the triplet in general, not on
the OS.  Besides, according to gcc-testresults postings, the test FAILs
on other targets as well: armv8l-unknown-linux-gnueabihf, hppa*, and ia64.

Besides, unless you want to introduce an effective-target keyword (with
documentation in sourcebuild.texi), probably overkill for a single use,
you can have more than one dg-xfail-run-if line to improve readibility.


I’ll take a look at those  and make multiple xfail-run-if’s
(one per target might be the neatest, and easier for a target maintainer to  
add if one is missed).


thanks
Iain



Re: [PATCH] coroutines: Fix compile error with symmetric transfers [PR94359]

2020-04-13 Thread Rainer Orth
Hi Iain,

> diff --git 
> a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C 
> b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
> index 864846e365c..8211e8250ff 100644
> --- a/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
> +++ b/gcc/testsuite/g++.dg/coroutines/torture/symmetric-transfer-00-basic.C
> @@ -1,4 +1,5 @@
> -//  { dg-do run }
> +// { dg-do run }
> +// { dg-xfail-run-if "no indirect tailcall" { { lp64 && { 
> powerpc64*-linux-gnu } } || { *-*-solaris2* *-*-aix* } } }
>  
>  #if __has_include()

unfortunately, the dg-xfail-run-if is wrong.  E.g. it causes XPASSes on
i386-pc-solaris2.11.

You should base this on the cpu part of the triplet in general, not on
the OS.  Besides, according to gcc-testresults postings, the test FAILs
on other targets as well: armv8l-unknown-linux-gnueabihf, hppa*, and ia64.

Besides, unless you want to introduce an effective-target keyword (with
documentation in sourcebuild.texi), probably overkill for a single use,
you can have more than one dg-xfail-run-if line to improve readibility.

Rainer

-- 
-
Rainer Orth, Center for Biotechnology, Bielefeld University


Re: [PATCH] coroutines: Fix compile error with symmetric transfers [PR94359]

2020-04-13 Thread Nathan Sidwell

On 4/11/20 10:46 AM, Iain Sandoe wrote:

Hi Folks,
sorry for the long CC list - please feel free to ignore if you don’t care :)

I propose that this PR should be re-categorized as a “C++” one.

The reason is that this is not an oversight in the GCC implementation,
but a problem present in the general case.  Library implementors feel
strongly that absence of symmetric transfer on important targets is a
Bad Thing.

===   possibilities to resolve this …..

We have, in the case of coroutine tail-calls, a useful factor in that all
such calls pass a single pointer, to the coroutine state frame.  That
frame is initially set up in the DSO that spawns a given coroutine actor
function.  Thus, at the point the frame is built, we have access to the
TOC, GOT, PLT for the relevant DSO.

It would seem excessive to resort to some kind of trampoline when we
already have somewhere to put the data we need to restore.

So .. what I’d like to do is to prototype a solution (probably on PPC) and
then take the necessary (coroutine) ABI amendments to the “coroutines
ABI group” (I am the editor of the current doc - which doesn’t have any
ps-component).

=== band-aid to fix the PR for stage 4.

tested on x86_64-linux, darwin (no loss of tail-call)
powerpc64-linux-gnu (tail call is OK on m32, and bypassed on m64)
solaris2.11 (tail call is bypassed on m32 and 64).

OK for master?
thanks
Iain

=== this is the commit message.

For symmetric transfers to work with C++20 coroutines, it is
currently necessary to tail call the callee coroutine from resume
method of the caller coroutine.  The current codegen marks these
resume calls as "MUST_TAIL_CALL" to indicate that the tail call is
required for correctness.

Unfortunately, several targets have ABI constraints that prevent
an indirect tail-call, which results in the PRs compile error.

The change here tests the target sibcall hook for the resume
expression and only marks it as requiring a tail call if that's
supported.

This doesn't fix the underlying problem; that really a solution is
needed to allow the tail-calls (or equivalent) to take place - but
that will be deferred until next stage 1.


This is fine from my PoV for gcc 10.

nathan

--
Nathan Sidwell