map well to the latest rendition of Windows codepage
1252; which is to say a combinitorial explosion that would be difficult to
verify for accuracy.
Peter Prymmer
> Bjoern Hoehrmann wrote:
> >
> > Hi,
> >
> >Something I _really_ miss, is a section about i18n in POD, e
On Thu, 12 Jul 2001, Nick Ing-Simmons wrote:
> Peter Prymmer <[EMAIL PROTECTED]> writes:
> >> Is someone currently working on incorporating ICU into Perl?
> >
> >I think Nick Ing-Simmons took some codepage data from ICU and
> >folded it into the Encode mo
erl
> internals which, I would guess, is the hard part.
A matter of perspective I suppose :-)
Peter Prymmer
DING, $string[, CHECK])
Encodes string from Perl's internal form into I and returns
a sequence of octets. For CHECK see L.
=item *
$string = decode(ENCODING, $bytes[, CHECK])
Decode sequence of octets assumed to be in I into Perl's
internal form and returns the resulting string. For CHECK see
L.
etcetera. It was the work of Nick Ing-Simmons and is in bleedperl.
Peter Prymmer
ling
list I see regular postings from a *BSD person and a MacOS Darwin person.
I have not tried it myself yet (sigh).
Peter Prymmer
iconv() repetoire
is typically done by ordering extra I18N tapes from IBM. I've not looked
much into the GPLed iconv implementations plug in architecture though I
suppose that it is reasonably straight forward.
Peter Prymmer
n a little facetious
here ;-).
How firmly established is the Tcl scheme? Is it still being hammered out?
I do think that it would be nice to avoid yet another gratuitous file
format incompatability if possible. So how do the Tcl folks plan to
handle surrogates or truly unrecognized characters?
Peter Prymmer
roblem with abondoning the (admittedly awkward)
special-ness of 0x for this purpose is that perl .enc files would then
become incompatable with Tcl .enc files.
Peter Prymmer
the letter '1' is 49 (31 hex)
The code point for the letter is now 241 (F1 hex)
The code point for the letter '8' is 56 (38 hex)
The code point for the letter is now 248 (F8 hex)
The code point for the letter '9' is 57 (39 hex)
The code point for the letter is now 249 (F9 hex)
So shall I go ahead with a cp1047.enc plus cp37.enc and posix-bc.enc
patch and perhaps some additions to t/lib/encode.t ?
Thanks for your help.
Peter Prymmer
le going
to handle UTF16 surrogates?
Thanks for any information.
Peter Prymmer
is wrong?
Try
perldoc Unicode::Map8
and perhaps use that module(?)
Peter Prymmer
t, or isn't there yet.
Thanks for saying that. So how do I turn on HINT_BYTE instead of
HINT_UTF8 throught the compile of perl? Note that the one-liner given
above was 5.005_03 not 5.6.0. I would like to be able to compile 5.6.0
again on OS/390.
Peter Prymmer
tion is,
>
> How does one get utf8 text into Perl 5.6 strings?
Try Gisle Aas's Unicode::Map8 module from CPAN. I think that it
requires that you install Unicode::String ahead of it.
Peter Prymmer
eaks).
BTW, I think there was a perli18n.pod doc distributed with an earlier
version of perl.
Peter Prymmer
--- perli18n.html.orig Fri Dec 17 14:15:27 1999
+++ perli18n.html Fri Dec 17 14:43:13 1999
@@ -72,7 +72,14 @@
Q1. I think that I'm a clever programmer. What's
14 matches
Mail list logo