Is the Subject field of an e-mail an obvious example of plain text where no higher level protocol application is possible?

2012-07-20 Thread Karl Pentzlin
Looking for an example of plain text which is obvious to anybody, it seems to me that the Subject field of e-mails is a good example. Common e-mail software lets you enter any text but gives you never access to any higher-level protocol. Possibly you can select the font in which the subject line

Re: Is the Subject field of an e-mail an obvious example of plain text where no higher level protocol application is possible?

2012-07-20 Thread Philippe Verdy
The Subject filed is subject to special encoding like Quoted-Printable or Base64 using specific prefixes. This is necessary because the MIME headers spreciying the ail encoding only applies to the mail body but not to the headers themselves. For this reason it is not stricly plain text.

Re: Is the Subject field of an e-mail an obvious example of plain text where no higher level protocol application is possible?

2012-07-20 Thread Asmus Freytag
On 7/20/2012 8:41 AM, Karl Pentzlin wrote: Looking for an example of plain text which is obvious to anybody, it seems to me that the Subject field of e-mails is a good example. By common convention, certain notational features have been relegated to styled text. Super and subscript in

Re: Is the Subject field of an e-mail an obvious example of plain text where no higher level protocol application is possible?

2012-07-20 Thread Jukka K. Korpela
2012-07-20 19:52, Philippe Verdy wrote: The Subject fi[el]d is subject to special encoding like Quoted-Printable or Base64 using specific prefixes. This is a matter of character encoding. All plain text inevitably has some encoding, and the encoding may vary without changing the plain text

RE: Is the Subject field of an e-mail an obvious example of plain text where no higher level protocol application is possible?

2012-07-20 Thread Shawn Steele
A) it can use quoted-printable B) See RFC 6532/6530 - Now it can be UTF-8 :) -Shawn

Unicode String Models

2012-07-20 Thread Mark Davis ☕
I put together some notes on different ways for programming languages to handle Unicode at a low level. Comments welcome. http://macchiati.blogspot.com/2012/07/unicode-string-models-many-programming.html Macchiato »http://macchiati.blogspot.com/2012/07/unicode-string-models-many-programming.html

Re: Is the Subject field of an e-mail an obvious example of plain text where no higher level protocol application is possible?

2012-07-20 Thread Jukka K. Korpela
2012-07-20 20:19, Asmus Freytag wrote: On 7/20/2012 8:41 AM, Karl Pentzlin wrote: Looking for an example of plain text which is obvious to anybody, it seems to me that the Subject field of e-mails is a good example. By common convention, certain notational features have been relegated to

Re: Unicode String Models

2012-07-20 Thread David Starner
On Fri, Jul 20, 2012 at 1:31 PM, Mark Davis ☕ m...@macchiato.com wrote: I put together some notes on different ways for programming languages to handle Unicode at a low level. Comments welcome. Macchiato » Many programming languages (and most modern software) have moved to Unicode model of

Re: Is the Subject field of an e-mail an obvious example of plain text where no higher level protocol application is possible?

2012-07-20 Thread Asmus Freytag
On 7/20/2012 1:34 PM, Jukka K. Korpela wrote: 2012-07-20 20:19, Asmus Freytag wrote: On 7/20/2012 8:41 AM, Karl Pentzlin wrote: Looking for an example of plain text which is obvious to anybody, it seems to me that the Subject field of e-mails is a good example. By common convention, certain

Re: Unicode String Models

2012-07-20 Thread martin
That means that it is best to optimize for BMP characters (and as a subset, ASCII and Latin-1), and fall into a ‘slow path’ when a supplementary character is encountered. I'm concerned about the statement/implication that one can optimize for ASCII and Latin-1. It's too easy for a lot of

RE: Unicode String Models

2012-07-20 Thread Murray Sargent
Mark wrote: “I put together some notes on different ways for programming languages to handle Unicode at a low level. Comments welcome.” Nice article as far as it goes and additions are forthcoming. In addition to multiple code units per character in UTF-8 and UTF-16, there are variation

Re: Unicode String Models

2012-07-20 Thread Mark Davis ☕
Thanks, nice article. We got into some of those hair caret positioning issues back at Apple; we even had a design that would associate a series of lines (which could be slanted and positioned) with a ligature, but ultimately 1/m gets you 99% of the value, with very little cost. (My article was

Re: Unicode String Models

2012-07-20 Thread Martin J. Dürst
On 2012/07/21 7:01, David Starner wrote: I'm concerned about the statement/implication that one can optimize for ASCII and Latin-1. It's too easy for a lot of developers to test speed with the English/European documents they have around and test correctness only with Chinese. I see the argument