Re: [Pixman] [PATCH 2/2] sse2, mmx: Remove initial unaligned loops in fetchers

2013-09-04 Thread Søren Sandmann
Siarhei Siamashka siarhei.siamas...@gmail.com writes: On Thu, 29 Aug 2013 05:59:26 +0200 sandm...@cs.au.dk (Søren Sandmann) wrote: Siarhei Siamashka siarhei.siamas...@gmail.com writes: With this new alignment assumption, such an optimization becomes even more impossible,

Re: [Pixman] [PATCH 2/2] sse2, mmx: Remove initial unaligned loops in fetchers

2013-09-04 Thread Bill Spitzak
Søren Sandmann wrote: Here is another proposal, but I'm not sure it's really better: - The combiners are made to return a buffer. The returned buffer is expected to contain the combined result and may be any of the passed src/mask/dest buffers. Almost all combiners will continue to combine

Re: [Pixman] [PATCH] sse2: faster bilinear scaling (pack 4 pixels to write with MOVDQA)

2013-09-04 Thread Søren Sandmann
Siarhei Siamashka siarhei.siamas...@gmail.com writes: The loops are already unrolled, so it was just a matter of packing 4 pixels into a single XMM register and doing aligned 128-bit writes to memory via MOVDQA instructions for the SRC compositing operator fast path. For the other fast paths,

[Pixman] [PATCH] Drop support for 8-bit precision in bilinear filtering

2013-09-04 Thread Søren Sandmann
From: Søren Sandmann Pedersen s...@redhat.com The default has been 7-bit for a while now, and the quality improvement with 8-bit precision is not enough to justify keeping the code around as a compile-time option. --- pixman/pixman-mmx.c | 35 +++