Unicode characters counted as multiple characters

2014-05-23 Thread Ion Toloaca
Hello everyone, I have been trying to get the start and end positions of the last matched rule for some time - and I got into trouble when I tried an example that had unicode. Here is a simplified version below that shows that the position (the return value of the read() method) is

Re: Unicode characters counted as multiple characters

2014-05-23 Thread Deyan Ginev
Interestingly enough, if you remove all calls to encode() you get a parse that prints out 1, as you would expect. Usually you only want to call encode() that you don't type in the source yourself. Hope that helps, Deyan On Fri, May 23, 2014 at 5:18 PM, Ion Toloaca toloaca...@gmail.com wrote:

Re: Unicode characters counted as multiple characters

2014-05-23 Thread Durand Jean-Damien
I forgot the output! $VAR1 = \[ [ \x{2260} ] ]; ≠: OK, pos=1 Correct isn't it -; May I suggest you do a Data::Dumper as well, I am pretty sure you think unicode while it is not. Jean-Damien. Le vendredi 23