The input and pattern for pcre (regex) is UTF-8. So a character above
127 is the start of a multibyte character. Is this what you intended? If
not, open regex script and see rxutf8.
Try your tests again after doing rxutf8_jregex_ 0.
----- Original Message -----
From: "Henry Rich" <[EMAIL PROTECTED]>
To: "'Beta forum'" <[email protected]>
Sent: Tuesday, August 08, 2006 4:45 PM
Subject: [Jbeta] regex on characters above 127 doesn't work
I see, using the test line
I. (0 = [: #[: '</*st1[^>]*>'&rxmatches ,&'<st1:>')"0 a.
that regex stops processing strings that contain characters
greater than 127 (or the /NUL character). Is this the way
it's supposed to work? Looking at the PCRE documentation,
I don't see why characters above 127 should fail, unless some
option has been set to choose that behavior. I wish they
wouldn't.
Henry Rich
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm