2009/10/29 Nick Kew <n...@webthing.com> > > 2009/10/28 Martin Gerdes <marting...@googlemail.com> > >> Background: >> I am running Apache as provided for Debian Lenny (2.2.9-10+lenny4) as >> a reverse proxy. To that end, I installed libapache2-mod-proxy-html >> (3.0.0-1) for URL rewriting. >> As the original content is in charset ISO-8859-1, I set the option >> "ProxyHTMLCharsetOut *" to get the webpage output as ISO-8859 instead >> of UTF-8. >> >> Everything worked just fine, except that a certain packet crashed the >> proxy-html module (Paket went into the proxy, no paket came out; >> according to the log, a process crashed at precisely that time...) >> >> I emailed the developer, who advised me to try the most recent version >> (3.1.2) for which unfortunately no debian package exists. So I spent >> half a day figuring out how to update the source debian archive with >> the most recent source code. I managed it, installed it, and it >> worked, and the module did not crash anymore. :-) >> > > Hmmm, I thought I advised getting me a test case that could reproduce > the crash, and your email on Oct 26th said the problem had gone. >
Alright, so I simplified a bit :-) The problem IS gone with the new version. I just failed to check whether the conversion was still happening - which it turns out it wasn't. That should have no bearing on the crash though, as the conversion back to ISO-8859-1 happens after the links are replaced. > However, after continuing to play around with mod-proxy-html I noticed >> that the output stays as utf-8 (having been converted from ISO-8859-1) >> instead of being converted back to ISO-8859-1. >> Note that 3.0.0 worked with precisely the same configuration (except >> that it did not need xml2enc). >> > > Precisely the same configuration would be incorrect. You need to switch > from using apache's generic filter configuration to the new > ProxyHTMLEnable directive. Hence the bold > *Important Note*: Configuration has changed: > on its front page. > > But maybe you've found a docs bug, and I need to add the note > in more places, for existing users who don't check what you > already know about! Where did you check? > > I just went to http://apache.webthing.com/mod_proxy_html/ to get the source code. But I did not reread the page, but just fetched the archive file. I did look within the archive for any kind of info, like build instructions, a readme or changelog, but there wasn't any. I also distinctly remember looking on the website for a changelog, so an entry there would have reached me. Other than than: Mea Culpa. However, I still do not have it working: Relevant configuration in /etc/apache/apache2.conf: ProxyHTMLEnable On ProxyHTMLCharsetOut * ProxyHTMLExtended On ProxyHTMLMeta On ProxyHTMLLogVerbose On LogLevel info (the line "SetOutputFilter proxy-html" has been commented out) >grep -i charset /var/log/apache2/error.log [Thu Oct 29 13:52:34 2009] [info] [client 192.168.0.9] Got charset ISO-8859-1 from HTTP headers multiple times, no other charset is recognized, so that is fine I can load the login page of the webapp, though it is returned in utf8. Once I send of the login data, it segfaults. It did NOT do that when I used SetOutputFilter instead of ProxyHTMLEnable, so I assume that xml2enc crashes, though that is only conjecture. Line in apache error log: "[Thu Oct 29 14:18:22 2009] [notice] child pid 29443 exit signal Segmentation fault (11)" It makes me cry, it really does :-(