Henry Spencer writes:

> It's worth noting that some implementations which generally hew pretty
> close to this make one exception:  they represent ASCII NUL (U+0000) as
> 11000000 10000000, so that 00000000 can be used as a terminator within
> programs without worrying that it will collide with a user character. 

Yes, the Java virtual machine uses this encoding for strings in .class
files. It is internal to Java and not visible at user level.
The JDK documentation of "java.io.DataOutput.writeUTF" says:
"Writes a Unicode string by encoding it using modified UTF-8 format."
Therefore I don't think people will be misled to use this function for
text output.

Bruno
-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/

Reply via email to