On Tue, 11 Jan 2005, Marc wrote: [UTF-8 problems]
>I have looked further into this. It seems that tv_grab_de_tvtoday >already spitzs out double-encoded data in some cases, and every >filter (such as tv_remove_* or tv_imdb) just make the problem worse. If you find an example where tv_grab_de_tvtoday puts out double-encoded data, please install the DB_File perl module and then you can say % tv_grab_de_tvtoday --cache >tv.xml Then send both the tv.xml and the generated tv_grab_de_tvtoday.cache (suitably compressed) to the mailing list. If you can narrow down the choice of days and channels, so much the better. Is the tv_grab_de_tvtoday output invalid, that is, does nsgmls report errors? (See <http://home.vr-web.de/stefan-siegl/xmltv/validating>). >I also experimented with binmode STDOUT in various scripts (binmode >STDIN makes some output latin1 for example). It seems that xmltv >does it's own encoding and then lets perl encode again, or sth. >similar. Until now xmltv has essentially ignored the question of encoding. I need to sit down and read <http://www.ahinea.com/en/tech/perl-unicode-struggle.html>, and anything else people can recommend, and work out what the right thing is. -- Ed Avis <[EMAIL PROTECTED]> -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]