Re: [GSoC] Wiki page for progress informations

Luca Furini Wed, 31 May 2006 01:58:25 -0700

Jeremias Maerki wrote:

did you already investigate how footnotes are implemented? Can you say
anything about how similar the problem of footnotes is to before-floats?
Just so you don't have to start from scratch while there may be
something to build upon. After all, the footnotes also contain some
logic to move certain parts to a different page than where anchor is
located.


A few quick comments about the footnote implementation:

1) the FootnoteLM returns only the sequence of elements representing theinline part (not the footnote-body part); it just adds to the last(inline) box a reference to the FootnoteBodyLM.

2) the LineLM, after computing the breaks, adds to each (block) boxrepresenting a line the references to the FootnoteBodyLM whose citationsare in that line

3) during the remaining of the element collection phase, these referencesare not used (but in the creation of "combined" element lists, when theyshould be copied inside the new elements)

4) the PageSequenceLM.PageBreaker.getNextKnuthElements() method, afterreceiving all the (block) elements, scans them looking for footnoteinformation, gets the elements from the referenced FootnoteBodyLM and putsthem in a different list (at the moment a list of lists, but this issub-optimal), and from the footnote-separator (in a separate list)

5) these lists are looked at in PageBreakingAlgorithm.computeDifference(),where we try to add some footnote content to the "normal" page contentusing getFootnoteSplit(), and in computeDemerits(), where some extrademerits are added if we break a footnote or some footnotes are deferred.

This last point at the moment is performed using manyPageBreakingAlgorithm private variables, which is maybe not the best wayto do it, as we must be very careful about their initialization and theiruse, especially when the algorithm restarts. I think that a "state" objectstoring these variables could be used to store these values, andexplicitly passed along the methods instead of relying on the classmembers, but concerning this I'd like to hear the opinions of the othercommitters ...

Insertion of before-floats could be implemented in a similar way, givingthe precedence to the footnote insertion (as it is affected by more strictconstraints).

An important difference between a footnote and a before-float is that thelatter does not have an "inline part", so (if we want to follow the samepattern) we need to either store the reference inside a previously-createdbox or to add some new elements containing the reference (but we must besure that these elements cannot be parted from the previous ones, see theconstraints in section 6.10.2 in the spec).

A crucial point is the demerit function as, if I remember correctly, itgreatly affect the computational complexity of the breaking algorithm(thre should be a M. Plass paper concerning this).

HTH

Another thing that we may need to keep in mind: There was lots of desire
from the user community that FOP supports large documents (long-term
goal, not necessary yours). I wrote that a first-fit algorithm could
help free memory earlier. Obviously, for complex before-float situations
a total-fit approach is probably more interesting as it can come up with
more "creative" solutions. I'm just mentioning it so we keep the bigger
picture in mind and since there could be conflicting goals.

A "first degree" of first-fit algorithm could be achieved quite quickly byhaving a BreakingAlgorithm interface which is implemented by a TotalFitBA(the existing implementation) and a FirstFitBA which would have a muchsimpler considerLegalBreak() method that, instead of the complex set ofnodes, just keeps in mind a single node.

This would surely decrease the memory footprint, but is not (I think) whatwe really want, as this simplified algorithm would be performed on thewhole sequence of elements.

In order to start processing the sequence as soon as we receive a fewelements we need to do some deeper changes.

An idea (I just had it now, so I did not fully consider all itsimplications).At the moment, the block-level LM collect elements from their children andreturn just a single sequence (if there are no break conditions); we couldhave a parameter requesting them to return after they receive each childsub-sequence, and have a canStartComputingBreak() method that returns trueif the sequence contains enough elements and we are using a first-fitalgorithm, or false otherwise ...

Sorry for the long post ... and for the long absence too, but it seemsthat just after thinking "great, now I've really got some time to spend onFOP" I receive tons of other things to do ... :-(


Regards
    Luca

Re: [GSoC] Wiki page for progress informations

Reply via email to