On 02/09/2015 07:40, Amit Shah wrote:
>> The buffer_find_nonzero_offset() will be called to check the zero page
>> > during live migration, it's a hot function. buffer_find_nonzero_offset()
>> > has already been optimized with SSE2 instructions, for platform that
>> > supports AVX2, we can optimize this function with AVX2 instructions and
>> > achieve about 25% performance gain.
> This should be a good improvement.  I recall Dave and I had a chat
> about this in the past too.
> 
> I've not yet reviewed the patchset, but I doubt anyone will have
> objections.  I'll review this shortly.

I think we need a better way to enable it than a configure option,
however.  AVX2 machines are rare, and no one would end up using it
except perhaps Gentoo or other source-based distros.

Perhaps something like the GCC ifunc attribute?

Paolo

Reply via email to