Re: Unicode compliant Line Breaking

2005-11-07 Thread Jeremias Maerki
1. +1 2. +1 3.b) +1 for the separatable parts although c) is also ok for now. +1 to try to find synergies with the code in Batik. If I were you I'd create a branch and put your stuff in there. It's easier for everyone to follow and to help (wishful thinking). On 31.10.2005 08:25:12 Manuel Mall w

Re: Unicode compliant Line Breaking

2005-11-02 Thread J.Pietschmann
Simon Pepping wrote: I mean, will our current method of finding possible line breaking points using the hyphenation tables be part of a TR14 compliant system to find line break opportunities? In some sense yes, but I'm not sure what you really mean. Currently, spaces and slashes ("/") as well

Re: Unicode compliant Line Breaking

2005-11-02 Thread Simon Pepping
On Tue, Nov 01, 2005 at 11:17:08PM +0100, J.Pietschmann wrote: > Simon Pepping wrote: > >Is our current hyphenation method a subset of Unicode's method? > > Umm. What's the relation between hyphenation and TR14 (except for > handling soft hyphens)? I guess you confuse finding line breaks > in gene

Re: Unicode compliant Line Breaking

2005-11-01 Thread J.Pietschmann
Simon Pepping wrote: Is our current hyphenation method a subset of Unicode's method? Umm. What's the relation between hyphenation and TR14 (except for handling soft hyphens)? I guess you confuse finding line breaks in general and line breaking due to hyphenation. I seem to recall that the hyp

Re: Unicode compliant Line Breaking

2005-11-01 Thread Simon Pepping
On Mon, Oct 31, 2005 at 03:25:12PM +0800, Manuel Mall wrote: > In a previous post Joerg pointed to the Unicode Standard Annex #14 on > Line Breaking (http://www.unicode.org/reports/tr14/) and his initial > implementation: http://people.apache.org/~pietsch/linebreak.tar.gz. > > I had since a clos

Re: Unicode compliant Line Breaking

2005-11-01 Thread Manuel Mall
On Tue, 1 Nov 2005 07:27 pm, [EMAIL PROTECTED] wrote: > Hi Manuel, > > Manuel Mall <[EMAIL PROTECTED]> wrote on 11/01/2005 04:24:05 AM: > > On Tue, 1 Nov 2005 01:33 am, [EMAIL PROTECTED] wrote: >> Had a look at the Batik code: Same algorithm as Joerg wrote (not > > surprising as UAX#14 actually c

Re: Unicode compliant Line Breaking

2005-11-01 Thread thomas . deweese
Hi Manuel, Manuel Mall <[EMAIL PROTECTED]> wrote on 11/01/2005 04:24:05 AM: > On Tue, 1 Nov 2005 01:33 am, [EMAIL PROTECTED] wrote: > > Just an FYI, Batik also currently has an implementation of > > the Unicode TR14 word breaking alg. > > (org.apache.batik.gvt.flow.TextLineBreak). > Thom

Re: Unicode compliant Line Breaking

2005-11-01 Thread Manuel Mall
On Tue, 1 Nov 2005 01:33 am, [EMAIL PROTECTED] wrote: > Hi all, > > Just an FYI, Batik also currently has an implementation of > the Unicode TR14 word breaking alg. > (org.apache.batik.gvt.flow.TextLineBreak). > > As far as performance is concerned it should be fairly fast > as it i

Re: Unicode compliant Line Breaking

2005-10-31 Thread thomas . deweese
Hi all, Just an FYI, Batik also currently has an implementation of the Unicode TR14 word breaking alg. (org.apache.batik.gvt.flow.TextLineBreak). As far as performance is concerned it should be fairly fast as it is mostly just table based. The Web Maestro <[EMAIL PROTECTED]> wro

Re: Unicode compliant Line Breaking

2005-10-31 Thread The Web Maestro
IMO, Unicode conformant line-breaking is an important goal for FOP to achieve. But before I vote, I have a question: On Oct 30, 2005, at 11:25 PM, Manuel Mall wrote: 2. I would also like to propose that the Unicode conformant line breaking be implemented using our own pair-table based implem

Re: Unicode compliant Line Breaking

2005-10-31 Thread Jingjing Lee
my votes: 1.) +1 2.) +1 3.c) +1 BTW, more than 2.a, even the most up-to-date jdk (1.5.0_05) is not full UAX#14 compliant. It treat QU as (A) instead of (XB/XA). So we REALLY need a independent impl to follow the Unicode standard. __ Yahoo! FareC

Unicode compliant Line Breaking

2005-10-30 Thread Manuel Mall
In a previous post Joerg pointed to the Unicode Standard Annex #14 on Line Breaking (http://www.unicode.org/reports/tr14/) and his initial implementation: http://people.apache.org/~pietsch/linebreak.tar.gz. I had since a closer look at both UAX#14 and Joerg's code. Because I liked what I saw I