In <[EMAIL PROTECTED]> on 06 May 2004,
   Neven Luetic <[EMAIL PROTECTED]> wrote:
> Is there a way to tell HTML::Entities the input encoding to use?

> The only remaining possibility I see to deal with this problem, is
> to translate the input to iso-8859-1 before parsing it or to turn
> encoding off completely. This seems pretty awkward to me.

I've dealt with this same issue and, as far as I know, there is no
way to control the input and output encodings under perl 5.6.  Thus,
HTML::TreeBuilder, out of the box, only works well for ISO-8859-1
content.  (With perl 5.8, there is support for specifying encodings
when opening filehandles.)

My 5.6 solution is precisely the one you suggested: manually
converting the content from/to the desired encodings.

-ccwf
-- 
Charles C. Fu                           ,--
Founder                ___  __ __. . ,-/--
Web i18n, LLC              (_,(_,|/|/ /
www.web-i18n.net                 ----'

Reply via email to