Let me just mention in this thread the volk package. Vector Optimized Library of Kernels implements a high level set of DSP operations, with fast implementations chosen at runtime depending on available CPU SIMD instructions.
There is always a generic fallback implementation that runs on any CPU. So a tool looking for special instructions will certainly find them in volk. -Maitland