Re: svn commit: r328381 - in /xmlgraphics/fop/trunk/src/java/org/apache/fop: area/inline/ layoutmgr/inline/ render/ render/pdf/ render/xml/

Luca Furini Wed, 26 Oct 2005 02:16:57 -0700

Manuel Mall wrote:

I have a question on this. You break in TextArea the text into wordsbased on CharUtilities.isAnySpace. Is this guaranteed to be consistentwith the breaking and adjustment calculations in TextLayoutManager? I amconcerned we may be using different rules for word breaking in differentplaces.

As far as consistency is concerned, I agree with you: the handling of thedifferent kinds of spaces (breaking, non-breaking, fixed width, ...) isstill quite incomplete and "dispersed" over different classes. Just to addanother example, the CharacterLM implicitly "expects" its character to bea non-space character and has its own lines of code concerning thecreation of the elements, while it could share the methods already calledby the TextLM.

Having a single, centralized class taking care of the breaking (be it aJava utility class or a Fop one) and a single, shared method implementingthe creation of the elements would surely increase consistency andclarity.

Somehow it doesn't feel right to me that TextLayoutManager does all thebreaking and calculations and then we give the whole chunk to TextAreaand it breaks it again using a possibly different algorithm but stillusing the adjustment value calculated by TextLayoutManager.

When I was trying to fix bug 36238 I initially started modifyingTextLM#createTextArea(), using the AreaInfo objects to create WordAreasand SpaceAreas, but I then decided to move the "string splitting" insideTextArea because:

1) if WordAreas and SpaceAreas are not directly created by the LMs, thereis no need to change a single line of code inside the classes creatingTextAreas; this is not a real "reason" supporting the choice, just anhandy consequence of it;

2) if TextArea still provides a getText() method, the renderers are notforced to render the text word by word and space by space if their wordspacing treatment is not affected by multi-byte characters; but onceagain, this is not a real reason as we could provide this method anyway;

3) although both SpaceArea and WordArea hava an "offset" attribute it isATM not used, so these areas does not carry any formatting information;their only purpose is to "highlight" spaces, thus allowing some specificrenderer to handle them correctly regardless of their encoding; in otherwords, we are not losing braking and calculations, we simply do not needthem anymore as we already know exactly which text will be placed in eachline, and how wide it will be once it's correctly adjusted;

4) the text that will be placed in a line cannot be directly taken from"textArray" (in the TextLM), and the string "str" should be used insteadanyway, as it may be different from the concatenation of the single piecesof text; at the moment the only difference concerns the hyphenationcharacter "-" added at the end of the line, but I suspect that indifferent languages there could be other differences; so, we cannot simplycreate a WordAreas for each AreaInfo object.

So, if you find it strange to break the text, put it together and split itagain, me too! :-) But this initial feeling disappeared when I realizedthat the final splitting does not involve "breaking" in its proper sense,but just "classification" of characters.

This is why I did what I did; if I did not manage to convince you ... youcan try and convince me! :-)


Regards
    Luca

Re: svn commit: r328381 - in /xmlgraphics/fop/trunk/src/java/org/apache/fop: area/inline/ layoutmgr/inline/ render/ render/pdf/ render/xml/

Reply via email to