Re: Line Breaking

2007-10-12 Thread Boris Zbarsky
David E. Ross wrote: > Why can't this be resolved by users adhering to Appendix C of RFC 3986 Because in practice they don't. -Boris ___ dev-tech-layout mailing list dev-tech-layout@lists.mozilla.org https://lists.mozilla.org/listinfo/dev-tech-layout

Re: Line Breaking

2007-10-12 Thread David E. Ross
On 10/10/2007 6:57 AM, Masayuki Nakano wrote: > Jonas Sicking wrote: >> Would it be at all possible to do different line wrapping for urls in >> japanese (and similar) text, and in western languages? > > fm... It's interesting. But it seems that the hidden prefs are much > better. And should it

Re: Line Breaking

2007-10-10 Thread [EMAIL PROTECTED]
On Oct 11, 2:57 am, Masayuki Nakano <[EMAIL PROTECTED]> wrote: > Jonas Sicking wrote: > > Would it be at all possible to do different line wrapping for urls in > > japanese (and similar) text, and in western languages? > > fm... It's interesting. But it seems that the hidden prefs are much > better

Re: Line Breaking

2007-10-10 Thread Jonas Sicking
Masayuki Nakano wrote: > Jonas Sicking wrote: >> Would it be at all possible to do different line wrapping for urls in >> japanese (and similar) text, and in western languages? > > fm... It's interesting. But it seems that the hidden prefs are much > better. And should it be localizable pref? (

Re: Line Breaking

2007-10-10 Thread Masayuki Nakano
I filed bug 399321 for switchable URL breaker. https://bugzilla.mozilla.org/show_bug.cgi?id=399321 -- Masayuki Nakano <[EMAIL PROTECTED]> Manager, Internationalization, Mozilla Japan. Personal Web Site (Written in Japanese): http://www.d-toybox.com/studio/ ___

Re: Line Breaking

2007-10-10 Thread Masayuki Nakano
Jonas Sicking wrote: > Would it be at all possible to do different line wrapping for urls in > japanese (and similar) text, and in western languages? fm... It's interesting. But it seems that the hidden prefs are much better. And should it be localizable pref? (We may be able to use lang attri

Re: Line Breaking

2007-10-09 Thread Jonas Sicking
Boris Zbarsky wrote: > Peter Weilbacher wrote: >> But it drives me nuts when I paste URLs into the Bugzilla comment >> field that they are wrapped. I want to turn that off. (When looking at >> the comment after submission they are not wrapped any more, that makes >> it even more confusing.) Am I

Re: Line Breaking

2007-10-09 Thread Peter Weilbacher
On 10/09/07 02:54, Masayuki Nakano wrote: > hmm... I'm not sure it is problem. I think there are two patterns when > you paste/write the URIs. > > 1. URI is a independent. > > I.e., URI is an only member of a paragraph. In western context, we use > empty line for paragraph separator. So, the w

Re: Line Breaking

2007-10-08 Thread Boris Zbarsky
Peter Weilbacher wrote: > But it drives me nuts when I paste URLs into the Bugzilla > comment field that they are wrapped. I want to turn that off. (When > looking at the comment after submission they are not wrapped any more, > that makes it even more confusing.) Am I the only one annoyed by th

Re: Line Breaking

2007-10-08 Thread Masayuki Nakano
Peter Weilbacher wrote: > On Mon, 8 Oct 2007 18:59:38 UTC, Masayuki Nakano wrote: > >> Peter Weilbacher wrote: >>> Is there a way that I as a user can switch off file path/URI breaking? I >>> find that very annoying. Is it planned to implement that or should I >>> file an RFE? >> If you are autho

Re: Line Breaking

2007-10-08 Thread Masayuki Nakano
Peter Weilbacher wrote: > On Mon, 8 Oct 2007 12:17:16 UTC, Masayuki Nakano wrote: > >> I updated http://wiki.mozilla.org/Gecko:Line_Breaking >> >> The document explains the rules of current trunk simply. > > Is there a way that I as a user can switch off file path/URI breaking? I > find that very

Re: Line Breaking

2007-10-08 Thread Masayuki Nakano
I updated http://wiki.mozilla.org/Gecko:Line_Breaking The document explains the rules of current trunk simply. # Should be moved to MDC? Thanks. -- Masayuki Nakano <[EMAIL PROTECTED]> Manager, Internationalization, Mozilla Japan. Personal Web Site (Written in Japanese): http://www.d-toybox.com/

Re: Line Breaking

2007-09-19 Thread [EMAIL PROTECTED]
Masayuki-san landed an update to the line breaking code last night. The main impact is that we don't allow a break opportunity "too close" (currently,. less than 6 characters away) from another break opportunity unless it's at a whitespace boundary. This should make thing

Re: Line Breaking

2007-09-03 Thread [EMAIL PROTECTED]
On Sep 3, 8:50 am, Boris Zbarsky <[EMAIL PROTECTED]> wrote: > Masayuki Nakano wrote: > > We can remove date format checking. If it is less than 5, "/2007" and > > "2007-" is breakable at before/after them. > > But I'd expect both "right-wing" and "left-wing" to be breakable. Yes, but if we don't a

Re: Line Breaking

2007-09-03 Thread Boris Zbarsky
Masayuki Nakano wrote: > er, I meant that "a statement about typical table cell widths on typical > web pages". I guess we're only talking about intra-word breaking, right? So whitespace can still be broken on anytime? In that case, you might be right. Let's give it a shot. -Boris _

Re: Line Breaking

2007-09-03 Thread Masayuki Nakano
Masayuki Nakano wrote: > Boris Zbarsky wrote: >> Masayuki Nakano wrote: >>> And I think that your examples doesn't grow up the table cell width >>> in actual web pages (and also doesn't overflow from fixed width box). >> >> I'm not sure I follow. Is that a statement about typical table cell >> w

Re: Line Breaking

2007-09-03 Thread Masayuki Nakano
Boris Zbarsky wrote: > Masayuki Nakano wrote: >> And I think that your examples doesn't grow up the table cell width in >> actual web pages (and also doesn't overflow from fixed width box). > > I'm not sure I follow. Is that a statement about typical table cell > widths on typical web pages, or

Re: Line Breaking

2007-09-02 Thread Boris Zbarsky
Masayuki Nakano wrote: > And I think that your examples doesn't grow up the table cell width in > actual web pages (and also doesn't overflow from fixed width box). I'm not sure I follow. Is that a statement about typical table cell widths on typical web pages, or a statement about its fundamen

Re: Line Breaking

2007-09-02 Thread Masayuki Nakano
r should be backed out. But the > URL/File path breaker is important for Japanese > Marketing/Users/Designers. So, I cannot accept it. > > We need simple way for 1.9. And we should implement prioritized > line-breaking in Mozilla2 (or later). Until then, we should take simple

Re: Line Breaking

2007-09-02 Thread Masayuki Nakano
apanese Marketing/Users/Designers. So, I cannot accept it. We need simple way for 1.9. And we should implement prioritized line-breaking in Mozilla2 (or later). Until then, we should take simple and low risk way. And I think that your examples doesn't grow up the table cell width in ac

Re: Line Breaking

2007-09-02 Thread Boris Zbarsky
Masayuki Nakano wrote: > We can remove date format checking. If it is less than 5, "/2007" and > "2007-" is breakable at before/after them. But I'd expect both "right-wing" and "left-wing" to be breakable. Perhaps we should look not only at the number of chars but also at what the chars are or

Re: Line Breaking

2007-09-02 Thread Masayuki Nakano
Boris Zbarsky wrote: > Masayuki Nakano wrote: >> 'near' is defined as 6 characters now. Therefore, in Western word, >> never breaking at first 6 characters and last 6 characters. i.e., if a >> word is broken, it has 11 characters at least. > > Why 6? I actually think 4 would be a lot more reaso

Re: Line Breaking

2007-09-02 Thread Boris Zbarsky
Masayuki Nakano wrote: > 'near' is defined as 6 characters now. Therefore, in Western word, never > breaking at first 6 characters and last 6 characters. i.e., if a word is > broken, it has 11 characters at least. Why 6? I actually think 4 would be a lot more reasonable... -Boris

Re: Line Breaking

2007-09-02 Thread Masayuki Nakano
I posted the new approach patch to bug 389056. The general breaking rule is not changed from previous patch. But the patch doesn't break at 'near' from start of word, end of word and previous breaking point in western language context. 'near' is defined as 6 characters now. Therefore, in Wester

Re: Line Breaking

2007-08-25 Thread lisika
On 25 elo, 21:41, [EMAIL PROTECTED] wrote: > It is interesting that, in addition to the ASCII hyphen-minus (U > +002D), Unicode specifies even a "regular" hyphen (U+2010). It is Now, here is an interesting bug. I wrote my previous message with a text editor and copy-pasted it into the Google Grou

Re: Line Breaking

2007-08-25 Thread lisika
> GL: Non-breaking ("Glue") (XB/XA) (Non-tailorable) > Non-breaking characters prohibit breaks on either side, but that prohibition > can be overridden by SP or ZW. In particular, when NBSP follows SPACE, > there is a break opportunity after the SPACE and NBSP will go as visible > space onto

Re: Line Breaking

2007-08-16 Thread Masayuki Nakano
fantasai wrote: > Masayuki Nakano wrote: >> >> I don't take this idea in latest patch. But the latest patch fixes >> many problems, e.g., the all testcases clears in: >> https://bugzilla.mozilla.org/attachment.cgi?id=275972 >> # The chemical context will be broken. >> >> I hope that the latest pat

Re: Line Breaking

2007-08-13 Thread fantasai
Masayuki Nakano wrote: > > I don't take this idea in latest patch. But the latest patch fixes many > problems, e.g., the all testcases clears in: > https://bugzilla.mozilla.org/attachment.cgi?id=275972 > # The chemical context will be broken. > > I hope that the latest patch is landed to trunk fo

Re: Line Breaking

2007-08-09 Thread Masayuki Nakano
[EMAIL PROTECTED] wrote: > On 9 elo, 18:41, Masayuki Nakano <[EMAIL PROTECTED]> wrote: > >> https://bugzilla.mozilla.org/attachment.cgi?id=275972 > > I think there is an error in the "Should break inside" list where the > second item says that the combination SPACE+NBSP is breakable. > According

Re: Line Breaking

2007-08-09 Thread lisika
On 9 elo, 18:41, Masayuki Nakano <[EMAIL PROTECTED]> wrote: > https://bugzilla.mozilla.org/attachment.cgi?id=275972 I think there is an error in the "Should break inside" list where the second item says that the combination SPACE+NBSP is breakable. According to UAX 14, a non-breaking character pr

Re: Line Breaking

2007-08-09 Thread Masayuki Nakano
Boris Zbarsky wrote: > [EMAIL PROTECTED] wrote: >> Well, I'm not an expert on chemistry, so I'll have to refer to Alan >> Wood who originally raised the issue with chemical names: >> https://bugzilla.mozilla.org/show_bug.cgi?id=95067#c78 > > He raises an excellent suggestion: disallow breaks at '-

Re: Line Breaking

2007-08-08 Thread Boris Zbarsky
[EMAIL PROTECTED] wrote: > Well, I'm not an expert on chemistry, so I'll have to refer to Alan > Wood who originally raised the issue with chemical names: > https://bugzilla.mozilla.org/show_bug.cgi?id=95067#c78 He raises an excellent suggestion: disallow breaks at '-' if this would result in a 4

Re: Line Breaking

2007-08-08 Thread lisika
On 6 elo, 21:50, Masayuki Nakano <[EMAIL PROTECTED]> wrote: > [EMAIL PROTECTED] wrote: > > Not breaking dates at all may be a good idea in itself, but I would > > consider it more important to allow at least some kind of breaks in > > long chemical names, such as '2-bromo-4,4-dichlorophenol'. It ma

Re: Line Breaking

2007-08-06 Thread Justin Wood (Callek)
Masayuki Nakano wrote: > David E. Ross wrote: >> On 8/5/2007 12:43 PM, Masayuki Nakano wrote [in part]: 4. DEGREE SIGN is not breakable if after character is character class. But it's still breakable if after character is numeric, for compatibility. >> >> Thus, >> xxx xxx xxx 12

Re: Line Breaking

2007-08-06 Thread Masayuki Nakano
her the typographical benefits really overweight the danger of > confusing some users. > > We should also keep in mind that the consistent line breaking > principles in Latin scripts sometimes allowed characters to be used > even in unconventional ways. For example, occasionally the

Re: Line Breaking

2007-08-06 Thread lisika
ader it's kind of natural to assume a space there. The same goes for allowing breaks at curly backets, semicolons etc. Thus, for each exception, we should consider carefully whether the typographical benefits really overweight the danger of confusing some users. We should also keep in mind that th

Re: Line Breaking

2007-08-06 Thread Masayuki Nakano
David E. Ross wrote: > On 8/5/2007 12:43 PM, Masayuki Nakano wrote [in part]: >>> 4. DEGREE SIGN is not breakable if after character is character class. But >>> it's >>> still breakable if after character is numeric, for compatibility. > > Thus, > xxx xxx xxx 126° 15' 32" > near the end of

Re: Line Breaking

2007-08-05 Thread David E. Ross
On 8/5/2007 12:43 PM, Masayuki Nakano wrote [in part]: >> >> 4. DEGREE SIGN is not breakable if after character is character class. But >> it's >> still breakable if after character is numeric, for compatibility. Thus, xxx xxx xxx 126° 15' 32" near the end of a line could break as

Re: Line Breaking

2007-08-05 Thread Masayuki Nakano
Hi, all. I posted new patch to bug 389056. https://bugzilla.mozilla.org/show_bug.cgi?id=389056 https://bugzilla.mozilla.org/attachment.cgi?id=275308&action=diff This patch has many fixes from feedbacks of this thread and other bugs. > Masayuki Nakano (Mozilla Japan) 2007-08-05 02:31:20 PDT >

Re: Line Breaking

2007-08-05 Thread Boris Zbarsky
[EMAIL PROTECTED] wrote: [long mail snipped] I pretty much agree with this e-mail. I wonder how hard some of this is to implement... but it would be great if we can do it. -Boris ___ dev-tech-layout mailing list dev-tech-layout@lists.mozilla.org htt

Re: Line Breaking

2007-08-04 Thread David E. Ross
On 8/4/2007 11:08 AM, [EMAIL PROTECTED] wrote [in part]: > > In English, '$' is usually followed by a number ($20), while in some > other languages, the currency symbol comes after the number ('20$' or > '20 $' -- the latter example preferably with a no-break space). Also, > in some languages, ca

Re: Line Breaking

2007-08-04 Thread lisika
On 2 elo, 20:19, fantasai <[EMAIL PROTECTED]> wrote: > Masayuki Nakano wrote: > > And we should also break after '&' and ';' or '=' > > I don't see a problem with breaking after ';'. I can't recall how they're > particularly relevant to URLs, but I also can't think of any cases where > that would b

Re: Line Breaking

2007-08-04 Thread lisika
/linebr.html Basically, the generic (language-independent) line breaking rules should be as simple as possible while at the same time trying to respect the conventions of natural languages. Thus, each character should default to the kind of line breaking that was most likely expected of it in its

Re: Line Breaking

2007-08-04 Thread lisika
/linebr.html Basically, the generic (language-independent) line breaking rules should be as simple as possible while at the same time trying to respect the conventions of natural languages. Thus, each character should default to the kind of line breaking that was most likely expected of it in its

Re: Line Breaking

2007-08-03 Thread Boris Zbarsky
long organic chemical names. I'm not sure there's a perfect solution there. ;( > Note that if *authors* hope that the text should not broken around > hyphen, they should use non-breakable hyphen (U+2011). There's plenty of plain ASCII text we end up rendering (e-mail comes

Re: Line Breaking

2007-08-03 Thread Masayuki Nakano
[EMAIL PROTECTED] wrote: > On Aug 4, 12:51 pm, Masayuki Nakano <[EMAIL PROTECTED]> wrote: >> Japanese language can break everywhere except a few exceptions. I.e., we >> are breaking words always. So, for us, the old rule (only breaks around >> SPACE) is strange. Because some points (around punctuat

Re: Line Breaking

2007-08-03 Thread Masayuki Nakano
Boris Zbarsky wrote: > David E. Ross wrote: >> dates (e.g., 8/2/07, which I would usually write "2Aug07" to avoid >> confusion with 8Feb07) > > Arguably, the '-' in 2007-08-02 should not break either. Nor should the > '-' in "Examples 1-5". Nor in 2007-Aug-02. '-', '/' and '=' are not brea

Re: Line Breaking

2007-08-03 Thread [EMAIL PROTECTED]
On Aug 4, 12:51 pm, Masayuki Nakano <[EMAIL PROTECTED]> wrote: > Japanese language can break everywhere except a few exceptions. I.e., we > are breaking words always. So, for us, the old rule (only breaks around > SPACE) is strange. Because some points (around punctuations and > parentheses), we ca

Re: Line Breaking

2007-08-03 Thread Masayuki Nakano
[EMAIL PROTECTED] wrote: > On Aug 2, 10:12 pm, Masayuki Nakano <[EMAIL PROTECTED]> wrote: >> [EMAIL PROTECTED] wrote: >>> I really want to find out what the jp-critical need for linebreaking >>> of Latin-1 text is. >> we need to break URLs in most cases, therefore, I think, we should break >> after

Re: Line Breaking

2007-08-03 Thread [EMAIL PROTECTED]
On Aug 2, 10:12 pm, Masayuki Nakano <[EMAIL PROTECTED]> wrote: > [EMAIL PROTECTED] wrote: > > I really want to find out what the jp-critical need for linebreaking > > of Latin-1 text is. > > we need to break URLs in most cases, therefore, I think, we should break > after '/' for path part of URLs.

Re: Line Breaking

2007-08-02 Thread Boris Zbarsky
David E. Ross wrote: > dates (e.g., 8/2/07, which I would usually write "2Aug07" to avoid > confusion with 8Feb07) Arguably, the '-' in 2007-08-02 should not break either. Nor should the '-' in "Examples 1-5". Nor in 2007-Aug-02. -Boris ___ de

Re: Line Breaking

2007-08-02 Thread David E. Ross
On 8/2/2007 10:19 AM, fantasai wrote [in part]: > Masayuki Nakano wrote: >> [EMAIL PROTECTED] wrote: >>> I really want to find out what the jp-critical need for linebreaking >>> of Latin-1 text is. >> we need to break URLs in most cases, therefore, I think, we should break >> after '/' for path pa

Re: Line Breaking

2007-08-02 Thread Boris Zbarsky
fantasai wrote: > I don't see a problem with breaking after ';'. I can't recall how they're > particularly relevant to URLs The same way that '?' is, basically. I agree that this is a reasonable place to break, not only in URIs but in general. -Boris __

Re: Line Breaking

2007-08-02 Thread fantasai
Masayuki Nakano wrote: > [EMAIL PROTECTED] wrote: >> I really want to find out what the jp-critical need for linebreaking >> of Latin-1 text is. > > we need to break URLs in most cases, therefore, I think, we should break > after '/' for path part of URLs. And also we should break after '\' for

Re: Line Breaking

2007-08-02 Thread Masayuki Nakano
[EMAIL PROTECTED] wrote: > I really want to find out what the jp-critical need for linebreaking > of Latin-1 text is. we need to break URLs in most cases, therefore, I think, we should break after '/' for path part of URLs. And also we should break after '\' for windows file path too. And we sho

Re: Line Breaking

2007-07-30 Thread [EMAIL PROTECTED]
On Jul 27, 7:53 am, fantasai <[EMAIL PROTECTED]> wrote: > There's a meta bug at >https://bugzilla.mozilla.org/show_bug.cgi?id=206152 > but I think we need a little more centralized planning. Filing individual > bugs isn't going to give us a coherent picture of what we're doing and what > we wan

Line Breaking

2007-07-26 Thread fantasai
Masayuki recently landed bug 255990 (Characters below U+0100 not subject to line-breaking rules), which has improved our line breaking in some cases (we break after slashes!) and made it worse in others (we break 's/he'!) https://bugzilla.mozilla.org/show_bug.cgi?id=255990 There&#x

Re: white-space:pre and line breaking

2006-08-04 Thread fantasai
Robert O'Callahan wrote: Jean-Marc Desperrier wrote: Well I'm not expert about that but I just had a look in the CSS2 norm, and you can find this about using 'white-space: pre' : http://www.w3.org/TR/1998/REC-CSS2-19980512/text.html#white-space-prop [...] Lines are only broken at newlines in

Re: white-space:pre and line breaking

2006-07-27 Thread Jean-Marc Desperrier
Jean-Marc Desperrier wrote: Shifting the topic a bit, currently using , and therefore also text/plain, breaks a lot of i18n features. It's not nice when combining character stop to combine [...] Or when you can not use ZWJ (U+200D) in gmail composition window : http://familleandries.iquebec.co

Re: CJK line breaking

2006-07-05 Thread Robert O'Callahan
L. David Baron wrote: We do? nsTextTransformer only seems to invoke it in the *Unicode* functions, and https://bugzilla.mozilla.org/show_bug.cgi?id=255990 says we don't. Actually, I should have thought about that bug earlier. There, people are advocating *always* using the JISX4051 rules. Tha

Re: CJK line breaking

2006-07-05 Thread Robert O'Callahan
Robert O'Callahan wrote: L. David Baron wrote: On Thursday 2006-07-06 11:26 +1200, Robert O'Callahan wrote: While working on some inline layout stuff, I've run into nsJISx4051LineBreaker (which we use for all line breaking, actually). We do? nsTextTransformer only seems to i

Re: CJK line breaking

2006-07-05 Thread Robert O'Callahan
L. David Baron wrote: On Thursday 2006-07-06 11:26 +1200, Robert O'Callahan wrote: While working on some inline layout stuff, I've run into nsJISx4051LineBreaker (which we use for all line breaking, actually). We do? nsTextTransformer only seems to invoke it in the *Unicode* func

Re: CJK line breaking

2006-07-05 Thread L. David Baron
On Thursday 2006-07-06 11:26 +1200, Robert O'Callahan wrote: > While working on some inline layout stuff, I've run into > nsJISx4051LineBreaker (which we use for all line breaking, actually). We do? nsTextTransformer only seems to invoke it in the *Unicode* func

CJK line breaking

2006-07-05 Thread Robert O'Callahan
While working on some inline layout stuff, I've run into nsJISx4051LineBreaker (which we use for all line breaking, actually). Apparently it's intended to work like this: if a word (delimited by whitespace) contains at least one CJK character, then we apply JISX4051 rules to break w

Re: white-space:pre and line breaking

2006-07-05 Thread L. David Baron
On Thursday 2006-07-06 10:18 +1200, Robert O'Callahan wrote: > Jean-Marc Desperrier wrote: > > Well I'm not expert about that but I just had a look in the CSS2 norm, > > and you can find this about using 'white-space: pre' : > > http://www.w3.org/TR/1998/REC-CSS2-19980512/text.html#white-space-prop

Re: white-space:pre and line breaking

2006-07-05 Thread Robert O'Callahan
Jean-Marc Desperrier wrote: > Well I'm not expert about that but I just had a look in the CSS2 norm, > and you can find this about using 'white-space: pre' : > http://www.w3.org/TR/1998/REC-CSS2-19980512/text.html#white-space-prop > [...] Lines are only broken at newlines in the source, or at >

Re: white-space:pre and line breaking

2006-07-05 Thread Jean-Marc Desperrier
Justin Wood (Callek) wrote: Robert O'Callahan wrote: We break runs of CJK ideographs between certain ideograph pairs using some standard algorithm. We suppress this breaking in runs that are white-space:pre. On one hand, that means authors using white-space to prevent breaking in regular text al

Re: white-space:pre and line breaking

2006-07-04 Thread Justin Wood (Callek)
Robert O'Callahan wrote: We break runs of CJK ideographs between certain ideograph pairs using some standard algorithm. We suppress this breaking in runs that are white-space:pre. On one hand, that means authors using white-space to prevent breaking in regular text also have it working by analogy

white-space:pre and line breaking

2006-07-04 Thread Robert O'Callahan
We break runs of CJK ideographs between certain ideograph pairs using some standard algorithm. We suppress this breaking in runs that are white-space:pre. On one hand, that means authors using white-space to prevent breaking in regular text also have it working by analogy in CJK text. On the other