H. Peter Anvin writes: > The point is that I don't think iconv should emit BOMs unless you > explicitly ask for them. The only existing standard for UTF-16 is RFC 2781, and it recommends this behaviour: Any labelling application that uses UTF-16 character encoding, and puts an explicit charset label on the text, and does not know the serialization order of the characters in text, MUST label the text as "UTF-16", and SHOULD make sure the text starts with 0xFEFF. You could argue that putting a BOM is the application's duty, not iconv's business, but that would be painful for all applications which try to use iconv. And unlabelled data (e.g. files on a filesystem) shouldn't use UTF-16 or its variants in the first place, that what UTF-8 is for. > Do they emit BOMs if you ask for UTF-16BE? No, because RFC 2781 says "Systems labelling UTF-16BE text MUST NOT prepend a BOM to the text." Bruno - Linux-UTF8: i18n of Linux on all levels Archive: http://mail.nl.linux.org/lists/
- Re: iconv output utf-8 -> utf-16, which one is wro... Edmund GRIMLEY EVANS
- Re: iconv output utf-8 -> utf-16, which one i... Mark Leisher
- RE: iconv output utf-8 -> utf-16, which one is wro... Karlsson Kent - keka
- Re: iconv output utf-8 -> utf-16, which one is wro... Bruno Haible
- Re: iconv output utf-8 -> utf-16, which one i... Ulrich Drepper
- Re: iconv output utf-8 -> utf-16, which one is wro... H. Peter Anvin
- Re: iconv output utf-8 -> utf-16, which one is wro... Bruno Haible
- Re: iconv output utf-8 -> utf-16, which one is wro... Mark Leisher
- Re: iconv output utf-8 -> utf-16, which one i... Jean-Marc Desperrier
- Re: iconv output utf-8 -> utf-16, which one is wro... H. Peter Anvin
- Re: iconv output utf-8 -> utf-16, which one is wro... Bruno Haible
- Re: iconv output utf-8 -> utf-16, which one is wro... H. Peter Anvin
- Re: iconv output utf-8 -> utf-16, which one is wro... Edmund GRIMLEY EVANS
- Re: iconv output utf-8 -> utf-16, which one i... Bruno Haible
- Re: iconv output utf-8 -> utf-16, which one i... H. Peter Anvin
