On Sun, Jan 23, 2005 at 08:40:48PM +0000, Ed Avis <[EMAIL PROTECTED]> wrote:
> On Sun, 9 Jan 2005, Kenneth Pronovici wrote:
>
> >> <title lang="de">Das Geheimnis der S�dseeperlen</title>
> >>
> >>Output:
> >>
> >> <title lang="de">Das Geheimnis der Südseeperlen</title>
>
> Is the XML prolog (or whatever it's called) the same for both the
> input and output files?
The XML prolog says "iso-8859-1" for all files in question.
(I wasn't aware of that before, and it surprises me, because xml parsers
are not required to parse such documents, so that is at least a bit of a
interoperability problem).
> Are you sure the input isn't ISO-8859-1 and
> the output UTF-8?
I am sure that this *is* the case. That's the problem, in fact. Prolog
says latin1 while output is utf-8.
> Does it also happen with tv_cat?
Yes, until I added "binmode STDIN; binmode STDOUT".
It's likely that, on output, XMLTV doesn't use binmode and so is sensitive
to the environment, which, in my case, prefers UTF-8 output, so perl will
likely output utf-8, while xmltv outputs an encoding=iso-8859-1.
--
The choice of a
-----==- _GNU_
----==-- _ generation Marc Lehmann
---==---(_)__ __ ____ __ [EMAIL PROTECTED]
--==---/ / _ \/ // /\ \/ / http://schmorp.de/
-=====/_/_//_/\_,_/ /_/\_\ XX11-RIPE
--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]