IRI support in URI and URI::Escape modules

2005-01-31 Thread Martin Duerst
Dear Perl Unicode Experts, I tried to have a look at how much would have to be done to get the URI and URI::Escape modules to support IRIs in a reasonable way. The IRI spec has just been published as an IETF Proposed Standard at http://www.ietf.org/rfc/rfc3987.txt. Also, a new version of the URI

Keeping byte-wise processing as an option

2004-01-02 Thread Martin Duerst
Dear Perl Unicode experts, http://www.perldoc.com/perl5.8.0/pod/perlunicode.html says: In future, Perl-level operations will be expected to work with characters rather than bytes. I very much appreciate all your hard work on the internationalization of Perl. However, recently I have been

Re: Keeping byte-wise processing as an option

2004-01-02 Thread Martin Duerst
Hello Jarkko, Many thanks for your very quick answer. At 00:31 04/01/03 +0200, Jarkko Hietaniemi wrote: In future, Perl-level operations will be expected to work with characters rather than bytes. I very much appreciate all your hard work on the internationalization of Perl. However, recently

Re: UTF-16 - UTF-8

2001-11-20 Thread Martin Duerst
At 17:25 01/11/20 +0100, Philip Newton wrote: PS: Does anyone know of - even an odd looking It would look really, really, odd. - Fixed pitch Unicode font including Western European, CJK, Cyrillic and Greek glyphs (ie: most Left to Right data) ? It's not for an end-user, it's for techies

Re: please test Text::Unicode::Normalize + Sort::UCA

2001-08-12 Thread Martin Duerst
At 00:31 01/08/13 +0200, Bjoern Hoehrmann wrote: * Simon Cozens wrote: I'd suggested another name, but since Unicode::Normalize has already been taken I can only think of Unicode::UTR15 # not appropriate, since it doesn't cover KD and KC Well, probably you mean D and DK, which formally

Re: UTF-8 in web pages

2001-08-10 Thread Martin Duerst
At 22:39 01/08/06 -0700, Brian Stell wrote: Netscape 4.x and earlier did not use Unicode. Not completely true. Netscape 4 used Unicode, but only in a separate code path, i.e. legacy encodings didn't get converted to Unicode. Regards, Martin.

Re: UTF-8 in web pages

2001-08-06 Thread Martin Duerst
At 10:02 01/08/06 -0700, Tim Walters wrote: Markus Kuhn wrote: Andrew McNaughton wrote on 2001-08-05 10:38 UTC: You can send Unicode directly to the Web browser. Just make sure you announce in the HTTP header that the body is encoded in UTF-8. Sounds nice, but in practice this

Re: [proposal] utility module for Hangul Syllables

2001-08-02 Thread Martin Duerst
I think having two different methods, one to parse long names (HANGUL SYLLABLE GA) and one to parse short names (GA) may be cleaner. Regards, Martin. At 01:25 01/08/03 +0900, SADAHIRO Tomoyuki wrote: Sorry, there're some typos. On Fri, 03 Aug 2001 00:55:47 +0900 SADAHIRO Tomoyuki [EMAIL