Marco Cimarosti schreef:

> > Is this in accordance with the Unicode standard, or do I have
> > to remove the second BOM?
>
> IMHO, Unicode should not specify such a behavior. Deciding what a
shell

IMHO, it should. The guideline that says a text file can have a
U+FEFF at the beginning, but it really shouldn't have U+FEFFs
elsewhere, implies that a second BOM should be removed if possible.
Of course this should be done only if the operating system knows the
files are text files, either implicitly by checking the file types,
or by the user manually forcing the OS to treat them as such.
In that case, removing the BOM that would end up somewhere in the
middle is the natural thing to do, just as removing the EOF marker
at the end of the first file is.
I'm not going into the implementation part; just pointing out that
this issue is not something an operating system can ignore.

Pim Blokland


Reply via email to