On Fri, 21 Jan 2011, Erland Sommarskog wrote:
> "Jan Dubois" ([email protected]) writes:
> > You need to stack the I/O layers in the right order. The :encoding()
> > layer needs to come last (be at the bottom of the stack), *after* the
> > :crlf layer adds the additional carriage returns. The way to pop the
> > default :crlf layer is to start out with the :raw pseudo-layer:
> >
> > open(my $fh, ">:raw:encoding(UTF-16LE):crlf", $filename) or die $!;
>
> So this works. But this does not:
>
> use strict;
>
> open F, '>slask.out';
> binmode(F, ':raw:encoding(UTF16-LE):crlf');
> print F "Alfa\nBeta\nGamma\n";
>
> Looking at the file in a binary editor, I see:
>
> 41 00 6C 00 66 00 61 00 0D 0A 00 42 00 65 00 74
> 00 61 00 0D 0A 00 47 00 61 00 6D 00 6D 00 61 00
> 0D 0A 00
>
> In total 35 bytes. Which is a very odd number for a UTF16 file.
I've double-checked with Leon, who thinks that this is due to bug 38456:
http://rt.perl.org/rt3//Public/Bug/Display.html?id=38456
He made a patch to fix the bug, and the patch has been applied to
bleadperl already. I ran you sample script with 5.13.9 plus his
patch, and it generates a correct 38 bytes file. I'm not sure
if this change could/should be picked for a 5.12.4 release as
well, but I guess it probably won't. But 5.14 should be out
in April or May anyways...
It looks like there is still a lot of brokenness lurking in
the internals of the Perl I/O layer implementation. :(
Cheers,
-Jan