Re: Fop 0.20.5 vs Fop Trunk Performace

Andreas Delmelle Wed, 13 Feb 2008 12:16:40 -0800

On Feb 12, 2008, at 21:10, Puppala, Kumar (LNG-CON) wrote:


Hi Kumar

>
> Just to be sure: which revision of the trunk are you trying out?
I obtained the latest from the trunk on Jan 22nd. Informationpertaining to the fopTrunk as seen in the status.xml file is:
“status.xml 614201 2008-01-22 14:02:27Z jeremias”

OK, I think you can safely update this to the latest. Not that itwill matter much.

<snip />
Yes, I am instantiating FopFactory just once. Initially I was notdoing that but I changed my code to instantiate it just once. Theresults provided are with the change.


> Apart from that, focusing purely on FOP Trunk, if you know how to
> narrow it down to specific methods/calls that cause the increase in
> processing time that would help us a lot.

I do have the complete Heap report. Some of the classes havingmaximum instances are as shown below:

Well, it's not so much the number of objects I'm thinking of, butrather, how much time is spent executing specific methods and whichones take longer in later iterations. The actual cause of theslowdown may precisely be located in a class of which there arerelatively few instances alive, if I judge correctly. Or did youalready check whether the bulk of the increase in processing-time isreally only spent on garbage-collection?

463132 instances of class org.apache.fop.traits.MinOptMax
441537 instances of class org.apache.fop.layoutmgr.NonLeafPosition

<snip />


I am not sure if this is something expected.

Is this an overall total, or a snapshot taken at a given point? Theseare figures I'd expect for a rather large page-sequence...

Small question: Did you, by any chance, also try different JVM
versions? Different platform?
No. I can try on jre1.6.0_04. Since we are running the current FOPon Solaris platform, we are performing our tests on Solaris.
> Do you know which XML parser / XSLT processor gets used at runtime?
We do not use an XSLT processor. We generate the FO file using anin-house application and feed it to the FOP Server. Since I amusing the default handler, I think it’s using SAX Parser behind thescenes.


Right, now I remember you already mentioned this earlier.

<snip />
In local tests I ran here, with two concurrent threads and a shared
FopFactory instance, the processing time remains quite stable for me
(test run on Apple JVM 1.5 using a  document that generates two page-
sequences (=2+69 pages; the larger page-sequence contains forced
breaks for each page))`

My tests are much more diverse. Each iteration contains about 120testcases. Each testcase targets a specific feature that we use.Hence each such iteration covers most of the features like tables,cells, images, big documents, rowspanning, columnSpanning, dualcolumn layout etc… In total I would say I am generating about 3000pages per iteration. When comparing the results, I am comparingthem after each such iteration for about 15 times and I am seeing agradual increase in processing times.

Interesting. Can you somehow dump the testcases as a set of physicalFO files, and make that available somewhere? This would make itpossible for us to run the same tests locally, and investigate further.

If this is impossible for you, then I'd advise to start with adrastically trimmed-down version of your test-suite, and graduallychange and/or increase the number of tests. See if you can isolatethe problem to a specific set of files (tables? markers? customfonts? etc.) At least that will give us a clue on where to startlooking. What may also prove valuable is to try the tests using adifferent renderer.


One more remark:

> 2)       I do see a lot of garbage collection happening in the new
> FOP. The collection times are also very high.

As I already hinted at, this is not bad per se. This could simplyindicate that FOP Trunk offers the GC more opportunities to clean up,so as to reduce the average footprint (when looking at it like aseries of snapshots). Memory-consumption vs. processing-speed isvirtually always a trade-off: the less info is cached, the morecomputations need to be performed multiple times, but a calculatorthat caches /all/ results and /never/ makes the same computationtwice, requires an insane amount of memory...

That said, it still remains strange that the processing timeincreases with the number of runs... Can you try leaving theiterations running into the hundreds or thousands? Does the time keepincreasing? By the same amount?

<snip />
Come to think of it: are your images stored on a local disk, or is
there any network traffic involved that might explain the increasing
lag...?

The images are stored on local disk. However, I do see betterresults for testcases containing Images and hence I do not believethat there are any network traffic issues involved.

Sorry, I did not mean 'images' but more generally 'documents'. Arethe input/output files all located on the same machine, or does someof it come from/end up on different machines? If so, are thesemachines dedicated to serving the I/O requests for your FOP process,or are they used for other processes as well?



Cheers

Andreas
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Fop 0.20.5 vs Fop Trunk Performace

Reply via email to