Re: Unicode characters

2009-05-25 Thread Andreas J. Koenig
On Sun, 24 May 2009 10:09:25 +0200, Juerd Waalboer ju...@convolution.nl said: Although it's safe on output, it's better to get used to using :encoding(utf8) instead of :utf8. Using :utf8 on input can cause stability and security issues. That's new to me. Do you have a link that backs

Re: Unicode characters

2009-05-23 Thread Andreas J. Koenig
On Fri, 22 May 2009 20:49:24 +0530, Saravanan Balaji saravanan.bal...@morganstanley.com said: Could you please help to know what i am missing or doing wrong. I'll greatly appreciate the help. I think all you're missing is (1) that a script written in utf8 needs to declare that fact with

Re: utf8::encode causes can't find SWASHNEW error

2006-09-08 Thread Andreas J. Koenig
On Thu, 07 Sep 2006 15:20:34 -0400, Dan Ragle [EMAIL PROTECTED] said: Hi! I'm having trouble getting my head around this issue and I'm hoping someone can spare a few moments to explain what's going on. On a Perl 5.6.1 server, the following reduction: ---

Re: is it utf8 or unicode?

2005-03-12 Thread Andreas J Koenig
On Wed, 09 Mar 2005 20:03:08 +, [EMAIL PROTECTED] said: binmode FH1, :raw; Make this binmode FH1, :utf8; I don't understand what the [UTF8 \x{c4}] \x{x4} is valid perl notation for the Unicode character 0xc4. % perl -le ' my $data = \xC4; binmode STDOUT, :utf8; print $data ; '

Re: Keeping byte-wise processing as an option

2004-01-02 Thread Andreas J Koenig
On Fri, 02 Jan 2004 18:17:13 -0500, Martin Duerst [EMAIL PROTECTED] said: Jungshik has also reported that it fails with Perl 5.8.0 with an UTF-8 locale. Perl 5.8.0 was very broken with UTF-8 locales since it auto-PERL_UNICODEd. We saw (keep seeing) a lot of that since RedHat 8 and 9

Re: \W and [\W]

2004-01-01 Thread Andreas J Koenig
On Wed, 31 Dec 2003 16:21:36 +0100, Eric Cholet [EMAIL PROTECTED] said: Can anyone enlighten me as to why \W behaves differently depending on wether it's inside or outside of a character class, for certain characters: I have reported this as bug 18281 http://guest:[EMAIL

Re: UTF-8 case conversion

2003-09-03 Thread Andreas J Koenig
On Wed, 3 Sep 2003 13:05:21 +0200 (CEST), [EMAIL PROTECTED] said: I wrote a small script (see below), trying to transform RJAN LUNDSTRM into rjan Lundstrm, but it seems to fail, probably because of locale related problems. My question is then simply. How do I do this then? Do not

Re: Inverse of /\p{script}/

2003-08-29 Thread Andreas J Koenig
On Fri, 29 Aug 2003 11:08:33 +0100, Nick Ing-Simmons [EMAIL PROTECTED] said: But cyrillic glyphs are likely double width :-( This is one of reasons I want to do _something_ in this area. I don't want to even try and read a big 16-bit Japanese font just to get cyrillic (for SPAMer's

Re: issues with unpack function

2003-04-02 Thread Andreas J. Koenig
On Wed, 2 Apr 2003 11:16:10 +0530 , KORSAPATI,PRATEEP (HP-India,ex2) [EMAIL PROTECTED] said: unpack('U*', $string) This expression is resulting in a null string if the string does not have any unicode characters.. How to avoid this situation.. Please use the perlbug script to

Re: Encode 1.87 and later don't pass make test on static perl

2003-03-06 Thread Andreas J. Koenig
On Wed, 05 Mar 2003 11:10:54 -0800, Blair Zajac [EMAIL PROTECTED] said: If I can find tar.gz's of Encode 1.86 and 1.87, I could check again to ensure that 1.86 and 1.87 work and fail respectively. http://mirrors.develooper.com/perl/backpan/authors/id/D/DA/DANKOGAI/ -- andreas

Re: Odd regexp behavior

2003-02-27 Thread Andreas J. Koenig
On Wed, 26 Feb 2003 22:20:19 +0200, Jarkko Hietaniemi [EMAIL PROTECTED] said: A bug? Was seemingly broken still in 5.8.0, but 5.8.1-to-be seems to get this right. (I don't off-hand remember this particular kind of problem but there were some s/// fixes that might have helped.) I'll

Re: Sending a Unicode character in an e-mail subject line

2003-02-26 Thread Andreas J. Koenig
On Wed, 26 Feb 2003 08:03:32 +0100, Henning Mller-Nielsen [EMAIL PROTECTED] said: Hi I am new to this list, but I have tried to search through the archives, to no avail. I hope you can help me. I am trying to send a Unicode character in an e-mail subject line, without success.

Re: Problem displaying UTF8 characters in HTML generated by a Perl sc ript

2003-01-23 Thread Andreas J. Koenig
On Thu, 23 Jan 2003 15:00:33 -0500, Tay, William [EMAIL PROTECTED] said: I am developing a Web application using Perl and a database. The Perl script is responsible for getting data from the database and then generate a HTML page. The data to be retrieved is encoded in UTF8. Here's 

Re: CGI and UTF

2002-11-20 Thread Andreas J. Koenig
On Wed, 20 Nov 2002 15:57:43 -, Mark Proctor [EMAIL PROTECTED] said: I'm having some problems with XML/UTF8 and CGI variables in perl5.6.1 If you have any chance to upgrade to perl-5.8.0, please do it now. The Unicode model of 5.8.0 is much more mature than that of 5.6.* and the number

Re: Problems with XML - What exactly does Cannot decode string with wide characters mean?

2002-11-12 Thread Andreas J. Koenig
On Mon, 11 Nov 2002 23:37:12 -0800, Daisuke Maki [EMAIL PROTECTED] said: utf82euc( $xml-findvalue( 'foobar' ) ); where utf82euc() is a convenience function that I wrote which does: my $octets = decode( 'utf8', $text ); Decode doesn't return octets. return

Unicode::Normalize surprise with dotless i

2002-09-05 Thread Andreas J. Koenig
Hi, Tomoyuki, is it a bug in Unicode::Normalize or in my code: I expected that for combining a circumflex with a small letter i, I'd have to use the dotless i, but to my surprise, NFC refuses to combine with the dotless i. Here's a demo progam: % perl -le ' use Unicode::Normalize; use Encode;

Re: Encode 1.76 Released

2002-09-01 Thread Andreas J. Koenig
On Sat, 31 Aug 2002 23:34:09 +0900, Dan Kogai [EMAIL PROTECTED] said: On Friday, August 30, 2002, at 08:48 , Andreas J. Koenig wrote: Hi Dan, today I revisited enc2xs and found three things missing: Okay - enc2xs doesn't write a MANIFEST file: this would be handy

Re: Encode 1.76 Released

2002-08-30 Thread Andreas J. Koenig
Hi Dan, today I revisited enc2xs and found three things missing: - enc2xs doesn't write a MANIFEST file: this would be handy as the innocent user doesn't know which files need to be included in a distribution - no -h or --help option - no -v or --version option I'd volunteer to add all

Re: 2 Suprises w/5.8.0

2002-08-01 Thread Andreas J. Koenig
On Thu, 1 Aug 2002 06:33:07 +0300, Jarkko Hietaniemi [EMAIL PROTECTED] said: Pre-5.8 way of Unicode (or, even worse, pre-5.6 way of Unicode) simply is not compatible, and trying to bridge the gap is probably worse than its worth. I agree with Jarkko if you write new code. But for old

Re: 2 Suprises w/5.8.0

2002-08-01 Thread Andreas J. Koenig
On Thu, 01 Aug 2002 09:22:52 +0100, Nick Ing-Simmons [EMAIL PROTECTED] said: Andreas J. Koenig [EMAIL PROTECTED] writes: A scalar that is going to be passed to some extension, be it Compress::Zlib, Apache::Request or any extension that has no mention of Unicode in the manpage

Re: an api

2002-04-10 Thread Andreas J. Koenig
On Wed, 10 Apr 2002 04:04:42 +0900, Dan Kogai [EMAIL PROTECTED] said: On Wednesday, April 10, 2002, at 03:58 , Jarkko Hietaniemi wrote: Encode::Alias::resolve_alias(), to go with define_alias()? Well, be it resolve_alias(). Maybe it's nit-picking, but I find this one additional

Re: Text::Unidecode

2002-04-10 Thread Andreas J. Koenig
On Wed, 10 Apr 2002 02:01:34 -0600, Sean M. Burke [EMAIL PROTECTED] said: Random question: Has anyone besides me had occasion to use Text::Unidecode? Sorry, no. I was interested, but my decoding needs are both so limited and Text::Unidecode didn't meet my requirements, so I made up my own

Re: Encode seriously broken

2002-04-01 Thread Andreas J. Koenig
On Mon, 1 Apr 2002 23:38:57 +0900, Dan Kogai [EMAIL PROTECTED] said: Would you send me the application that is causing this? I've sent you in PM a testcase that exhibits the bug for me. If anybody else is interested: the sample is a 65kB text file and a 10 line perl script. If you're

Some more missing encodings (Was: patch 15589/Encode ...)

2002-03-29 Thread Andreas J. Koenig
On Sat, 30 Mar 2002 15:24:06 +0900, Dan Kogai [EMAIL PROTECTED] said: Okay. I've checked http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/ One more time and it seems that other missing encodings are available as well, such as korean. I'll look into that. I've already

Encode anomalies (Was: [Encode] 1.00 released at last!)

2002-03-29 Thread Andreas J. Koenig
All the warnings below seem bogus to me. % /usr/local/perl-5.7.3@15620/bin/perl -wle ' use Encode qw(from_to); $x = \x{df}; from_to($x,utf-8,iso8859-1); ' Use of uninitialized value in subroutine entry at