> For ext2 and almos all "traditional" unix fs they don't care about
    > encodings, it is just bytes. so if you type in utf-8 they are stored
    > in utf-8.
    > 

    This behaviour is mandated by POSIX.  Any nonzero sequence of bytes
    except '/' and '\0' are legal in filenames, in any order.  The only
    ones which have special meaning are "." and "..".

I am not so sure of that. POSIX-2001 says:

Filename

 A name consisting of 1 to {NAME_MAX} bytes used to name a file.
The characters composing the name may be selected from the set
of all character values excluding the slash character and the null byte.
The filenames dot and dot-dot have special meaning.

Byte

 An individually addressable unit of data storage that is exactly an octet,
used to store a character or a portion of a character

Character

 A sequence of one or more bytes representing a single graphic symbol
or control code. 

Andries
--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to