Aki Yoshida created CAMEL-7584:
----------------------------------

             Summary: XML-Aware Tokenizer failing with utf-8 multibyte 
characters
                 Key: CAMEL-7584
                 URL: https://issues.apache.org/jira/browse/CAMEL-7584
             Project: Camel
          Issue Type: Bug
          Components: camel-core
            Reporter: Aki Yoshida
            Assignee: Aki Yoshida
             Fix For: 2.14.0


There is some issue in the underlining Stax reader's  
getLocation().getCharOffset() when the input data is an InputStream to the stax 
reader.

This issue was brought up in the woodstox community. But I believe fixing it 
seems to be non trivial as woodstox internally uses char/Reader and keeps the 
offset value to the character sequence and not to the original input stream.

We change the tokenzer to pass java.io.Reader to the woodstox parser instead of 
passing java.io.InputStream directly.






--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to