On Thu, Jan 24, 2013 at 10:09 PM, Clinton Ebadi <[email protected]>wrote:
> Zrajm C Akfohg <[email protected]> writes: > > On Wed, Jan 23, 2013 at 7:46 AM, Clinton Ebadi <[email protected]> > wrote: > > Zrajm C Akfohg <[email protected]> writes: > > > > Using UTF-8 does not seem to work properly when > > I set a variable in the main file (using <!--#set var="X_TITLE" > > value="Innehåll." -->) and then use that variable inside a file > > included with SSI (with <!--#echo var="X_TITLE" -->). > > > > An example is here http://zrajm.org/mat/ (look at the headline -- it's > > supposed to say "Innehåll" = "Contents" in Swedish -- but instead > > there are funny characters instead of the a with a ring above). It is > > as if Apache has decided that the variable has latin-1 content, even > > though addDefaultCharset "utf-8"; is in use). > > The document in question has the following meta tag (from .head.shtml): > > <meta http-equiv="content-type" content="text/html; charset=ISO-8859-1"> > > Which overrides the HTTP Content-Type. > > I think removing that should fix it; if not let me know, hopefully it's > not a deficiency in domtool or our apache rig (but if it is, I want to > fix it naturally). After removing the <meta> tag I still get the exact same behavior (still on page http://zrajm.org/mat/). All the relevant files are written in UTF-8, as evidenced by: $ file -i .foot.shtml .head.shtml index.shtml .foot.shtml: text/html; charset=utf-8 .head.shtml: text/html; charset=utf-8 index.shtml: text/html; charset=utf-8 And using wget to see the server headers give me the following $ wget -O- --save-headers 'http://zrajm.org/mat/' --2013-01-25 00:58:49-- http://zrajm.org/mat/ Resolving zrajm.org (zrajm.org)... 69.90.123.70 Connecting to zrajm.org (zrajm.org)|69.90.123.70|:80... connected. HTTP request sent, awaiting response... 200 OK Length: unspecified [text/html] Saving to: `STDOUT' HTTP/1.1 200 OK Date: Thu, 24 Jan 2013 23:58:50 GMT Server: Apache/2.2.16 (Debian) Accept-Ranges: bytes Vary: Accept-Encoding Keep-Alive: timeout=15, max=100 Connection: Keep-Alive Transfer-Encoding: chunked Content-Type: text/html; charset=utf-8 X-Pad: avoid browser bug ...... So it does look like Apache think the file to be in utf8 format. I would think that apache have some different opinion on the encoding of environment variables, however. May LANG, or the LC_* locale variables need to be set in Apache's environment? This thread on Stack Overflow [ http://stackoverflow.com/questions/539661/server-side-includes-and-character-encoding] suggest using the Apache setting "AddCharset UTF-8 .shtml" -- Though it is not clear whether this will actually fix the problem or not. I seem to recall (from an older discussion where I had utf-8 problems with mod_autoindex) that "AddCharset" cannot be set from doomtool, a hasty googling seems to confirm this as I can only find mentions of "addDefaultCharset" for doomtool (and not "addCharset"). Interestingly HTML entities seem to work *partially* as well. (å and å works and produces the expected "å"; but – does not work and is inserted literally on the page, – [en-dash's numeric entry] is removed completely). /zrajm
_______________________________________________ HCoop-Help mailing list [email protected] https://lists.hcoop.net/listinfo/hcoop-help
