On Thu, Apr 08, 2004 at 04:17:41AM -0400, Michael B Allen wrote:
> >   - knows that the input is zero terminated
> 
> I have great difficulty in envisioning the opposite. Binary file formats
> and network protocols have a lot of zero terminated strings in all sorts
> of encodings.
> 
> >   - does not know whether this is an 8-bit, 16-bit or 32-bit
> >     wide and aligned zero
> 
> Again for me it's rare that an application would not need to know what
> data it's dealing with. Applications do not exist in a vacuum. You have to
> do I/O in which case the the encoding of text is usually predefined or
> negotiated. You do not always have the luxury of defining how text is
> represeted throughout the system.

However, the case where 1: data is zero-terminated *and* 2: you don't at
least know whether you're dealing with an 8-, 16- or 32-bit encoding is,
in my experience, non-existant.  After all, "zero-terminated" is
meaningless unless you know what "zero" means--an 8-bit, 16-bit or
32-bit zero?

-- 
Glenn Maynard

--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to