https://gcc.gnu.org/bugzilla/show_bug.cgi?id=51980
christophe.lyon at st dot com changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |christophe.lyon at st dot com --- Comment #14 from christophe.lyon at st dot com --- As of current trunk the softfp case looks like this: sqrlen4D_16u8: vmov d16, r0, r1 @ v16qi vmov d17, r2, r3 vld1.64 {d18-d19}, [sp:64] vabd.u8 q9, q8, q9 vmull.u8 q8, d18, d18 vmull.u8 q9, d19, d19 vuzp.32 q8, q9 vpaddl.u16 q8, q8 vpadal.u16 q8, q9 vmov r0, r1, d16 @ v4si vmov r2, r3, d17 bx lr which looks quite good.