https://gcc.gnu.org/bugzilla/show_bug.cgi?id=82498
--- Comment #10 from Marc Glisse <glisse at gcc dot gnu.org> --- f1...f6 already have a LROTATE_EXPR in the .original dump. The others don't get one until forwprop1, which is after einline, so there is a small chance of inlining causing other optimizations that mess with rotate detection (or the large-ish code before rotate is recognized may prevent early inlining, missing optimizations). I guess without going through the large job of moving the rotate code from forwprop to match.pd it would be possible to add one basic transform to recognize precisely the case in those intrinsics, if we pick one in f7...f11.