Hi Vince,
I'm happy to include the patch,
I just have a few unclarities, hope somebody can clear them..
(1) memcpy is speed tested with (I think) aligned accesses (based on
D_MALLOC adresses) but I think we'll see a lot of unaligned memcpy's
too, but that side of the implementation looks kinda weak.. Anyone care
to give some figures for unaligned copy? Have a look at
direct_find_best_memcpy() in lib/direct/memcpy.c, and fidget a bit with
buf1 and buf2.
(2) what happens on a big-endian ARM if I just include the patch? Having
trouble finding this dependancy in the patch.. Will need to fix this, or
put a show stopper somewhere for big-endian, so the patch doesn't break
something.
Greets
Niels
vince wrote:
Hello,
Ive been working on trying to improve the performance of directfb 1.3.0
on the arm platform. The attached patch will replace the default libc
memcpy with a faster implementation. Ive tested this patch using an
AT91RM9200, but should work on other ARM targets.
Hope this will be useful to others.
Regards,
Vince
------------------------------------------------------------------------
_______________________________________________
directfb-dev mailing list
directfb-dev@directfb.org
http://mail.directfb.org/cgi-bin/mailman/listinfo/directfb-dev
--
.------------------------------------------.
| DirectFB - Hardware accelerated graphics |
| http://www.directfb.org/ |
"------------------------------------------"
_______________________________________________
directfb-dev mailing list
directfb-dev@directfb.org
http://mail.directfb.org/cgi-bin/mailman/listinfo/directfb-dev