Hi!
Many many thanks for answer!

Problem solved with a StringBuffer! :jumping:


Michael Glavassevich-3 wrote:
> 
> 
> Hi,
> 
> Fuzzo <[EMAIL PROTECTED]> wrote on 10/22/2008 03:54:18 AM:
> 
>> Hi all!
>>
>> Let me explain the problem with an example.
>> I have to parse an XML in this form:
>>
>> <anomaly id="0012" severity="4">some_text_with_%_symbol</anomaly>
>>
>> With Xerces1 SAX parser, the element text (some_text_with_%A7_symbol) is
>> parsed in one solution with full length invoking the characters(char[]
> ch,
>> int start, int length) method.
>>
>> With Xerces2, the element text is parsed in 30 bytes slot and the method
> is
>> invoked some times until the text element is fully parsed.
>>
>> Now, in my application the text element is sometimes encoded with
>> java.net.URLEncoder class and then decoded with java.net.URLDecoder.
>>
>> With Xerces2, happens that the element substring can be in form of
>> first_part_of_text_% and URLDecoder can't handle correctly the final %
> char,
>> giving me a URLDecoder: Incomplete trailing escape (%) pattern because it
>> does not find the 2 following chars (ex.: %A7 means the ยง symbol in
> Cp1252
>> encoding).
>>
>> There is a way to configure Xerces2 to parse text elements in only one
>> solution?
> 
> No. characters() may be called multiple times [1][2] for contiguous text.
> You cannot assume it will only be called once. Your ContentHandler needs
> to
> accumulate the text returned in each call of characters() until you
> receive
> a callback that isn't characters.
> 
>> Many thanks!
>>
>>
>> --
>> View this message in context: http://www.nabble.com/Xerces2-vs-
>> Xerces1-Element-Text-Parsing-Implementation-tp20105730p20105730.html
>> Sent from the Xerces - J - Users mailing list archive at Nabble.com.
>>
>>
>> ---------------------------------------------------------------------
>> To unsubscribe, e-mail: [EMAIL PROTECTED]
>> For additional commands, e-mail: [EMAIL PROTECTED]
> 
> Thanks.
> 
> [1]
> http://xerces.apache.org/xerces2-j/javadocs/api/org/xml/sax/ContentHandler.html#characters(char[],%20int,%20int)
> [2] http://xerces.apache.org/xerces2-j/faq-sax.html#faq-2
> 
> Michael Glavassevich
> XML Parser Development
> IBM Toronto Lab
> E-mail: [EMAIL PROTECTED]
> E-mail: [EMAIL PROTECTED]
> 

-- 
View this message in context: 
http://www.nabble.com/Xerces2-vs-Xerces1-Element-Text-Parsing-Implementation-tp20105730p20197262.html
Sent from the Xerces - J - Users mailing list archive at Nabble.com.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to