On Wed, Dec 12, 2001 at 12:32:17PM +0000, Markus Kuhn wrote:
> Jarkko Hietaniemi wrote on 2001-12-11 21:44 UTC:
> > My spec is at home but I think it's illegal in subsequent text.
> > (Blindly concatenating text for several files could of course
> > lead into such a situation.)
> 
> The BOM is illegal nowhere. The BOM is a perfectly normal Unicode

Rats.  That's what I get from making things up without looking things up.

> character, namely the ZERO WIDTH NO-BREAK SPACE. Browsers must display
> it exactly as such (that is: not display a strange character), wherever
> it appears. When you test this in your browser, it is also a good

To confuse the issue I think in Unicode 3.2 the BOM will become just
BOM, and a new character will take the ZWNBS role.  As if applications
had already been aware of the old rules...

> opportunity to test that the Plane 15 tagging characters are not
> displayed as well.
> 
> Some recommendations for treating the BOM under Unix and in encoding
> converters are in
> 
>   http://www.cl.cam.ac.uk/~mgk25/unicode.html
> 
> Markus
> 
> -- 
> Markus G. Kuhn, Computer Laboratory, University of Cambridge, UK
> Email: mkuhn at acm.org,  WWW: <http://www.cl.cam.ac.uk/~mgk25/>

-- 
$jhi++; # http://www.iki.fi/jhi/
        # There is this special biologist word we use for 'stable'.
        # It is 'dead'. -- Jack Cohen

Reply via email to