In <[EMAIL PROTECTED]> on 06 May 2004, Neven Luetic <[EMAIL PROTECTED]> wrote: > Is there a way to tell HTML::Entities the input encoding to use?
> The only remaining possibility I see to deal with this problem, is > to translate the input to iso-8859-1 before parsing it or to turn > encoding off completely. This seems pretty awkward to me. I've dealt with this same issue and, as far as I know, there is no way to control the input and output encodings under perl 5.6. Thus, HTML::TreeBuilder, out of the box, only works well for ISO-8859-1 content. (With perl 5.8, there is support for specifying encodings when opening filehandles.) My 5.6 solution is precisely the one you suggested: manually converting the content from/to the desired encodings. -ccwf -- Charles C. Fu ,-- Founder ___ __ __. . ,-/-- Web i18n, LLC (_,(_,|/|/ / www.web-i18n.net ----'
