On Sun, Apr 26, 2020 at 5:25 PM Dave Love <lovesh...@fedoraproject.org>
wrote:

> Kevin Kofler <kevin.kof...@chello.at> writes:
>
> > Has anyone (upstream or elsewhere) ever looked into doing an SSE2
> version of
> > the vector code? It should be faster than scalar (especially considering
> > that the "scalar" floating-point code (under the default -mfpmath=sse)
> > actually loads everything into SSE2 registers as well, but does not
> actually
> > make use of the vectorization) and it would match the baseline of many
> > distributions and upstreams out there.
>
> What's preventing vectorization with sse2 (or other architecture' base
> SIMD) anyhow, if anything?  Use something
> like
>
> gcc -Ofast -fopt-info-vec-missed
>

I can't comment on the exact command line used, but I did experiment with a
recent pull request adding SSE 4.1

Full details here:

https://github.com/drowe67/LPCNet/pull/25

Thanks,
Richard
_______________________________________________
devel mailing list -- devel@lists.fedoraproject.org
To unsubscribe send an email to devel-le...@lists.fedoraproject.org
Fedora Code of Conduct: 
https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: 
https://lists.fedoraproject.org/archives/list/devel@lists.fedoraproject.org

Reply via email to