In message <[EMAIL PROTECTED]>, Chan dramouli Kharidehal writes: >How do i detect Unicode Characters using the ORO package >For example usign \w i can detect all the ASCII characters >How about the characters beyond the ASCII Unicode Characters
As I said before: >\d matches based on Character.isDigit() and \w matches based on >Character.isLetterOrDigit() or '_'. So, you see, it's all Unicode based >on Java's interpretation of how to classify characters with some attempt >to remain true to Perl (e.g., the inclusion of '_' in \w). Even though >the source is somewhat inscrutable, these types of questions can be >answered by looking at the source. If you skimmed it the first time, please reread the first sentence of my original reply which explicitly answers your question. Since Character.isLetterOrDigit(), or any other Character.isFoo() method, is not restricted to ASCII, neither is \w, \d, or any of the other character set specifiers. Also pay special attention to the last sentence of my original reply :) daniel -- To unsubscribe, e-mail: <mailto:[EMAIL PROTECTED]> For additional commands, e-mail: <mailto:[EMAIL PROTECTED]>