On Fri, 13 Nov 2020, Ard Biesheuvel <[email protected]> wrote:
On Fri, 13 Nov 2020 at 12:05, Adrian Ratiu <[email protected]> wrote:

Hi Ard, On Fri, 13 Nov 2020, Ard Biesheuvel <[email protected]> wrote: > On Thu, 12 Nov 2020 at 22:23, Adrian Ratiu > <[email protected]> wrote: >> >> From: Nathan Chancellor <[email protected]> >> >> Drop warning because kernel now requires GCC >= v4.9 after >> commit 6ec4476ac825 ("Raise gcc version requirement to >> 4.9"). >> >> Reported-by: Nick Desaulniers <[email protected]> >> Signed-off-by: Nathan Chancellor <[email protected]> >> Signed-off-by: Adrian Ratiu <[email protected]> > > Again, this does not do what it says on the tin. > > If you want to disable the pragma for Clang, call that out in > the commit log, and don't hide it under a GCC version change. I am not doing anything for Clang in this series. The option to auto-vectorize in Clang is enabled by default but doesn't work for some reason (likely to do with how it computes the cost model, so maybe not even a bug at all) and if we enable it explicitely (eg via a Clang specific pragma) we get some warnings we currently do not understand, so I am not changing the Clang behaviour at the recommendation of Nick. So this is only for GCC as the "tin" says :) We can fix clang separately as the Clang bug has always been present and is unrelated.

But you are adding the IS_GCC check here, no? Is that equivalent? IOW, does Clang today identify as GCC <= 4.6?

I see what you mean now. Thanks.

Clang identifies as GCC <= 4.6 yes, so the code is not strictly speaking equivalent. The warning to upgrade GCC doesn't make sense for Clang but I should mention removing it in the commit message as well.

>
> Without the pragma, the generated code is the same as the
> generic code, so it makes no sense to build xor-neon.ko at all,
> right?
>

Yes that is correct and that is the reason why in v1 I opted to
not build xor-neon.ko for Clang anymore, but that got NACKed, so
here I'm fixing the low hanging fruit: the very obvious & clear
GCC problems.



Fair enough.

>> ---
>>  arch/arm/lib/xor-neon.c | 9 +--------
>>  1 file changed, 1 insertion(+), 8 deletions(-)
>>
>> diff --git a/arch/arm/lib/xor-neon.c b/arch/arm/lib/xor-neon.c
>> index b99dd8e1c93f..e1e76186ec23 100644
>> --- a/arch/arm/lib/xor-neon.c
>> +++ b/arch/arm/lib/xor-neon.c
>> @@ -19,15 +19,8 @@ MODULE_LICENSE("GPL");
>>   * -ftree-vectorize) to attempt to exploit implicit parallelism and emit
>>   * NEON instructions.
>>   */
>> -#if __GNUC__ > 4 || (__GNUC__ == 4 && __GNUC_MINOR__ >= 6)
>> +#ifdef CONFIG_CC_IS_GCC
>>  #pragma GCC optimize "tree-vectorize"
>> -#else
>> -/*
>> - * While older versions of GCC do not generate incorrect code, they fail to
>> - * recognize the parallel nature of these functions, and emit plain ARM 
code,
>> - * which is known to be slower than the optimized ARM code in asm-arm/xor.h.
>> - */
>> -#warning This code requires at least version 4.6 of GCC
>>  #endif
>>
>>  #pragma GCC diagnostic ignored "-Wunused-variable"
>> --
>> 2.29.2
>>

Reply via email to