On 5/7/02 11:25 AM, Gisle Aas wrote: > John Siracusa <[EMAIL PROTECTED]> writes: >> On 5/7/02 10:58 AM, Paul Lindner wrote: >>> The output from your example looks like UTF-8 data (Ã is a >>> commonly seen UTF-8 escape sequence). XML::Parser converts all >>> incoming text into UTF-8. You will need to convert it back to >>> iso-8859-1. >>> >>> My favorite is Text::Iconv >>> >>> use Text::Iconv; >>> $utf8tolatin1 = Text::Iconv->new("UTF-8", "ISO8859-1"); >>> >>> my $buffer_latin1 = $converter->convert($buffer); >> >> So HTML::Entities only works with ISO8859-1 (or ASCII, presumably)? > > Not true. But the unicode support in perl-5.6.x has many bugs. With > 5.8 things will be better. It is a bad idea for XML::Parser to give > out strings with the UTF8 flag set.
Well, I'll let your guys figure it out (all fixed in 5.8, right? :) In the meantime, I guess I'll stick with the workaround(s) posted... :) -John