On Sun, Jul 1, 2012 at 12:30 PM, Anthony J. Bentley
<anthonyjbent...@gmail.com> wrote:
>> So again, the complaint was that there was mojibake gibberish in
>> Ingo's presentation, because the character encoding isn't specified
>> but defaults to UTF-8 in modern browsers, while the page is actually
>> iso-8859-1 encoded.
>
> Actually, "modern" browsers do not default to a particular encoding (in
> fact, this violates the HTML standard). Instead, they attempt to autodetect
> the charset. Sometimes this works, and sometimes it doesn't -- I've seen
> UTF-8 pages incorrectly detected as ISO-8859-1, and in particularly bad
> cases, vice versa.

i would consider firefox a modern browser, and it does not default to
autodetect. it defaults to iso-8859-1

however, the gui does not allow per html doctype default charset, so a
management configured browser would apply default charset to html1, 4,
... n

there should be no case where this is a problem. all pages should be
html 4 to avoid these silly exchanges. it would be nice if some sort
of style guide clearly stated "pages in www/ are html4, charset
explicitly set to iso-8859-1". in the absence of that, we have these
discussions. having a www/STYLE doc does not require committing to a
particular templating language so hopefully it's a realistic
short-term goal

Reply via email to