The input and pattern for pcre (regex) is UTF-8. So a character above 127 is the start of a multibyte character. Is this what you intended? If not, open regex script and see rxutf8.

Try your tests again after doing rxutf8_jregex_ 0.

----- Original Message ----- From: "Henry Rich" <[EMAIL PROTECTED]>
To: "'Beta forum'" <[email protected]>
Sent: Tuesday, August 08, 2006 4:45 PM
Subject: [Jbeta] regex on characters above 127 doesn't work


I see, using the test line

I. (0 = [: #[: '</*st1[^>]*>'&rxmatches ,&'<st1:>')"0 a.


that regex stops processing strings that contain characters
greater than 127 (or the /NUL character).  Is this the way
it's supposed to work?  Looking at the PCRE documentation,
I don't see why characters above 127 should fail, unless some
option has been set to choose that behavior.  I wish they
wouldn't.

Henry Rich

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Reply via email to