Jonathan Morton <[email protected]> writes:

> Actually, I took the trouble to optimise some of the PDF operators
> while converting them, with the intended side-effect of also making
> them more readable.  While doing so, I realised that component-alpha
> implementation of even the HSL filters was quite reasonable - just
> calculate the resultant colour independently of the alpha, and mask
> off the result in the same way as the other PDF operators.  This
> neatly fills in a generality hole.

Right, leaving those combinations out was probably a mistake.

> > In any case, we'll almost certainly want to accelerate this pipeline
> > not only with NEON, but also SSE and AVX, so regardless of how it
> > eventually gets integrated, that's worth keeping in mind. To do this
> > properly, we'll need to solve the problem of how to install CPU
> > specific fetchers.
> 
> This may be a question of whether, on a specific CPU, using a
> vector-int-to-float conversion is faster than the three or four table
> lookups as implemented here.  A scalar int-to-float conversion is
> almost certainly slower.

I certainly hope that the solution in the patch we've seen so far is
not the final one, because it looks to me like it allocates half a
megabyte *per process*, just for conversion tables.

Even half a megabyte statically allocated in the binary is probably
too much.


Soren
_______________________________________________
Pixman mailing list
[email protected]
http://lists.freedesktop.org/mailman/listinfo/pixman

Reply via email to