[Bug 1258546] Re: Apache2 defaults to the wrong character set, it should be UTF-8

2014-02-07 Thread Launchpad Bug Tracker
[Expired for apache2 (Ubuntu) because there has been no activity for 60 days.] ** Changed in: apache2 (Ubuntu) Status: Incomplete = Expired -- You received this bug notification because you are a member of Ubuntu Server Team, which is subscribed to apache2 in Ubuntu.

[Bug 1258546] Re: Apache2 defaults to the wrong character set, it should be UTF-8

2013-12-09 Thread Robie Basak
Thank you for taking the time to report this bug and helping to make Ubuntu better. I have checked both Precise and Trusty, and can find no windows-1252 default that you refer to. I used wget -S to see the headers returned by the Apache server, and it did not specify a character set. Could you

[Bug 1258546] Re: Apache2 defaults to the wrong character set, it should be UTF-8

2013-12-09 Thread Lars Noodén
I can do the Server system, too, but right now the steps I have followed to get the problem are: 1. install Ubuntu 12.04 desktop, or Lubuntu 14.04devel desktop (it occurs on both) 2. install Apache2, leaving default configuration settings 3. load an html page from the server in a browser (in

[Bug 1258546] Re: Apache2 defaults to the wrong character set, it should be UTF-8

2013-12-09 Thread Lars Noodén
The one browser is Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:25.0) Gecko/20100101 Firefox/25.0 HTTP_ACCEPT Headers : text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8 gzip, deflate en,en-us;q=0.7,sv;q=0.3 The other is: Mozilla/5.0 (X11; Ubuntu; Linux i686; rv:25.0)

[Bug 1258546] Re: Apache2 defaults to the wrong character set, it should be UTF-8

2013-12-09 Thread Lars Noodén
I've done a fresh installation from the ubuntu-12.04.3-server-i386.iso image and installed Apache2. The Firefox web browser still shows that the pages being served are encoded in windows-1252 instead of UTF-8, which is what the locale is set to, or ISO-8859 which would be the old standard. --

[Bug 1258546] Re: Apache2 defaults to the wrong character set, it should be UTF-8

2013-12-09 Thread Robie Basak
I believe browsers typically try to guess. If Apache serves a page that doesn't have any non-ASCII characters in it, then browsers can guess, and windows-1252 would still be correct, since the document was a strict subset of this charset. What happens if you serve a UTF-8 encoded file? What does

[Bug 1258546] Re: Apache2 defaults to the wrong character set, it should be UTF-8

2013-12-09 Thread Lars Noodén
If I serve a UTF-8 encoded file *AND* set the default myself in Apache, then everything is fine. If the default encoding is left alone, Apache serves it up as windows-1252 and then UTF-8 encoded letters come out as garbage like this: åäöÅÄÖéÉ As seen from the browser HTTP_ACCEPT Headers,

[Bug 1258546] Re: Apache2 defaults to the wrong character set, it should be UTF-8

2013-12-09 Thread Robie Basak
If the default encoding is left alone, Apache serves it up as windows-1252 and then UTF-8 encoded letters come out as garbage like this: åäöÅÄÖéÉ I do not see this behaviour: root@trusty:/var/www# xxd test.txt 000: 5363 6872 c3b6 6469 6e67 6572 2773 2043 Schr..dinger's C 010:

[Bug 1258546] Re: Apache2 defaults to the wrong character set, it should be UTF-8

2013-12-09 Thread Lars Noodén
If wget is not seeing the wrong encoding then it may be a problem with Firefox instead. However, the steps to reproduce are 1. install Ubuntu 12.04 desktop, or Lubuntu 14.04devel desktop (it occurs on both) 2. install Apache2, leaving default configuration settings 3. load an html page from the

[Bug 1258546] Re: Apache2 defaults to the wrong character set, it should be UTF-8

2013-12-09 Thread Robie Basak
Sorry, your test case involving Firefox isn't sufficient to determine validity of a bug in Apache. What is Apache actually sending to Firefox in your case? -- You received this bug notification because you are a member of Ubuntu Server Team, which is subscribed to apache2 in Ubuntu.

[Bug 1258546] Re: Apache2 defaults to the wrong character set, it should be UTF-8

2013-12-09 Thread Lars Noodén
It looks like the problem is Firefox then. If no default is set, then it sends wget 'Content-Type: text/html'. If the default is set to utf-8, then it sends wget 'Content-Type: text/html; charset=utf-8' -- You received this bug notification because you are a member of Ubuntu Server Team, which