On Wed, 08 Nov 2006 00:48:39 -0500
mickg <[EMAIL PROTECTED]> wrote:

> Just to put my money where my mouth is, I have implemented a (stupid)
> prototype that does: If no known charset is native to libxml2
> detected , a recompiled version of mod_proxy_html now uses iconv
> (eventually via the xmlFindCharEncodingHandler function) to convert
> from the source encoding to UTF-8.

Interesting.  You've gone one up on my aliasing proposal, for
what looks like rather less work than I thought that would take.
I might snarf the basic idea for Version 3.

> If no encoding info is specified, it assumes windows-1251 (yes,
> stupid, but still).

But not stupid if we make it a configurable default!



> It does work on my _own_ website, where it quite happily converts
> win-1251 to utf-8. Once I fix the memory leak (any help appreciated),
> I'll be happy.

See http://www.apachetutor.org/dev/pools for an easy way to
deal with the memory.

> And a great many thanks to Nick Kew for getting me off my lazy ... to
> start coding  (which, honestly, I am better at than administering
> systems).

:-)

> BTW, I still have no clue why I cannot do this with mod_charset_lite.

Neither am I.  But a closer look at mod_charset_lite has been on
my TODO list for so long it's probably on a permanent back-burner.
Did you also look at the full mod_charset?   AIUI it was written by
Russian developers, so cyrillic was presumably important to them.

-- 
Nick Kew

Application Development with Apache - the Apache Modules Book
http://www.apachetutor.org/

---------------------------------------------------------------------
The official User-To-User support forum of the Apache HTTP Server Project.
See <URL:http://httpd.apache.org/userslist.html> for more info.
To unsubscribe, e-mail: [EMAIL PROTECTED]
   "   from the digest: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to