Re: [fpc-pascal] Implementing AggPas with PtcGraph

Nikolay Nikolov Wed, 21 Jun 2017 16:47:26 -0700


On 06/22/2017 02:42 AM, Nikolay Nikolov wrote:

On 06/22/2017 01:21 AM, James Richters wrote:
putimage can be accelerated, although it would still have to do amemory copy.
Like this?
https://github.com/Zaaphod/ptcpas/compare/Zaaphod_Custom?expand=1#diff-fb31461e009ff29fda5c35c5115978b4
This is amazingly faster. I ran a test of just ptcgraph.putimage()in a loop, putting the same image over and over 1000 times and timingit. The original ptcgraph.putimage() took 18.017 seconds. After Iapplied this, the same loop took 1.056 seconds. Quite animprovement! It's still nowhere near as fast as just drawing stuffwith ptcgraph directly, but for doing a memory copy of the entirescreen, it's very fast
Yes, that's a good start. That was exactly what I meant :)
I have an idea on how I could speed it up even further....
If I set up a second array with 1 bit per pixel, then (somehow)aggpas could set bits in this array to 1 whenever it changed acorresponding bit. Now by analyzing the 'pixel changed' array oneword at a time, (or maybe longword or qword at a time) I could justskip over all the words that =0 and when I come across a word that <>0 I could do a binary search of that word to only change the pixelsthat need to be changed. If very little on the screen has changed,this would be quite a bit faster because the pixel changed arraywould be 1/16 the size of the full buffer.
The only way this would be of any benefit though is if aggpas set thebits in the 'pixel changed' array while it was changing the pixels ofthe buffer, because at that time it already has the array positionand the fact that something changed available. If I had to analyzethe buffer separately and create the 'pixels changed' array, it wouldtake too long.
That sounds like a little bit of a special case - it'll work whereyou're using putimage for a large area, that has very few pixels set.Perhaps just reimplementing the general algorithm in inline asm, byusing SSE (or MMX) vector instructions would be the fastest, but maybeit's not worth the pain and the pascal implementation is fast enoughfor you. Just experiment and see what works best :)

Btw, I looked at your code again and saw a quick and cheap optimization- just move the case statement (case BitBlt of) outside the inner loop(for i:=X to X1 do), so the value of BitBlt is not checked once everypixel, but once per row.


Nikolay

_______________________________________________
fpc-pascal maillist  -  fpc-pascal@lists.freepascal.org
http://lists.freepascal.org/cgi-bin/mailman/listinfo/fpc-pascal

Re: [fpc-pascal] Implementing AggPas with PtcGraph

Reply via email to