[Bug tree-optimization/89653] Missing vectorization of loop containing std::min/std::max and temporary

2022-11-01 Thread moritz.kreutzer at siemens dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89653 --- Comment #11 from Moritz Kreutzer --- I am currently out of the office, with limited to no email access. I will be returning on November 28. For urgent questions regarding ARM64 support please contact Julian Hornich, for GPGPU-related issues

[Bug c++/91819] New: ICE when iterating over enum values

2019-09-19 Thread moritz.kreutzer at siemens dot com
++ Assignee: unassigned at gcc dot gnu.org Reporter: moritz.kreutzer at siemens dot com Target Milestone: --- Created attachment 46899 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=46899=edit Preprocessed source and backtrace Hi, we are getting an ICE with the latest trunk of

[Bug tree-optimization/91198] GCC not generating AVX-512 compress/expand instructions

2019-07-19 Thread moritz.kreutzer at siemens dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91198 --- Comment #4 from Moritz Kreutzer --- > How would a vectorized version with the intrinsic look like? Something along the lines of (assuming insize is a multiple of 16): __mmask16 mask;

[Bug tree-optimization/91198] GCC not generating AVX-512 compress/expand instructions

2019-07-18 Thread moritz.kreutzer at siemens dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91198 --- Comment #2 from Moritz Kreutzer --- Sure, I should have said that I'm talking about auto vectorization. I'm aware that we could use intrinsics, but of course that'll always be our last resort for obvious reasons.

[Bug tree-optimization/91198] New: GCC not generating AVX-512 compress/expand instructions

2019-07-18 Thread moritz.kreutzer at siemens dot com
Priority: P3 Component: tree-optimization Assignee: unassigned at gcc dot gnu.org Reporter: moritz.kreutzer at siemens dot com Target Milestone: --- We have a simple loop to select values based on a condition from one array and store the selected values contiguously

[Bug tree-optimization/89653] Missing vectorization of loop containing std::min/std::max and temporary

2019-03-25 Thread moritz.kreutzer at siemens dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89653 --- Comment #7 from Moritz Kreutzer --- Thanks for taking this up Richard! I just want to check back: Do you need any assistance with testing or more information from my side?

[Bug c++/89653] New: Missing vectorization of loop containing std::min/std::max and temporary

2019-03-11 Thread moritz.kreutzer at siemens dot com
Priority: P3 Component: c++ Assignee: unassigned at gcc dot gnu.org Reporter: moritz.kreutzer at siemens dot com Target Milestone: --- Godbolt worksheet: https://godbolt.org/z/F6m5hl GCC (trunk and all earlier versions) fails to vectorize (SSE/AVX2/AVX-512

[Bug tree-optimization/89618] Inner loop won't vectorize unless dummy statement is included

2019-03-07 Thread moritz.kreutzer at siemens dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89618 --- Comment #3 from Moritz Kreutzer --- Great, thanks for the quick action Richard!

[Bug c/89618] New: Inner loop won't vectorize unless dummy statement is included

2019-03-07 Thread moritz.kreutzer at siemens dot com
Priority: P3 Component: c Assignee: unassigned at gcc dot gnu.org Reporter: moritz.kreutzer at siemens dot com Target Milestone: --- We have a loop in which we are scattering data to an array of length "n" where can assure no write conflicts only within confi

[Bug tree-optimization/88464] AVX-512 vectorization of masked scatter failing with "not suitable for scatter store"

2018-12-17 Thread moritz.kreutzer at siemens dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88464 --- Comment #16 from Moritz Kreutzer --- I can confirm the fix from my side. Thanks again!

[Bug tree-optimization/88464] AVX-512 vectorization of masked scatter failing with "not suitable for scatter store"

2018-12-14 Thread moritz.kreutzer at siemens dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88464 --- Comment #11 from Moritz Kreutzer --- Jakub, I can confirm it's working for masked gathers (we have a similar pattern elsewhere in our code) with the latest trunk. Thanks for looking at the scatters as well!

[Bug tree-optimization/88464] AVX-512 vectorization of masked scatter failing with "not suitable for scatter store"

2018-12-13 Thread moritz.kreutzer at siemens dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=88464 --- Comment #8 from Moritz Kreutzer --- Thanks for the input and for confirming that "for conditional ones (both MASK_LOAD and MASK_STORE) the support for the cases when using a mask register rather than a vector register with mask either hasn't