Helge Hafting <[EMAIL PROTECTED]> writes:
| Angus Leeming wrote:
| > UTF-8 is a multi-byte encoding. It's useful for output to file
| > because the data are stored as characters (bytes). So, much of a
| > UTF-8 encoded file will be human readable; only the multi-byte
| > sequences will not.
| >
| Actually, the multibyte sequences are human readable
| too, if the human is reading them on an xterm, a linux console,
You mean viewable. xterm (etc.) read the utf-8 for you and shows the
correct glyph (or grapheme... whatever)
--
Lgb