On Tue, 11 Jan 2005, Marc wrote:

[UTF-8 problems]

>I have looked further into this. It seems that tv_grab_de_tvtoday
>already spitzs out double-encoded data in some cases, and every
>filter (such as tv_remove_* or tv_imdb) just make the problem worse.

If you find an example where tv_grab_de_tvtoday puts out
double-encoded data, please install the DB_File perl module and then
you can say

% tv_grab_de_tvtoday --cache >tv.xml

Then send both the tv.xml and the generated tv_grab_de_tvtoday.cache
(suitably compressed) to the mailing list.  If you can narrow down the
choice of days and channels, so much the better.

Is the tv_grab_de_tvtoday output invalid, that is, does nsgmls report
errors?  (See <http://home.vr-web.de/stefan-siegl/xmltv/validating>).

>I also experimented with binmode STDOUT in various scripts (binmode
>STDIN makes some output latin1 for example). It seems that xmltv
>does it's own encoding and then lets perl encode again, or sth.
>similar.

Until now xmltv has essentially ignored the question of encoding.  I
need to sit down and read
<http://www.ahinea.com/en/tech/perl-unicode-struggle.html>, and
anything else people can recommend, and work out what the right thing
is.

-- 
Ed Avis <[EMAIL PROTECTED]>




-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]

Reply via email to