Hi John,

characters() may be called multiple times for contiguous text. It's one of 
the finer details of SAX. This gets reported so often it has its own FAQ 
[1].

[1] http://xml.apache.org/xerces2-j/faq-sax.html#faq-2

John Wright <[EMAIL PROTECTED]> wrote on 05/12/2005 03:48:04 PM:

> Hi,
> 
> I am relatively new to parsing XML via Xerces and Sax, with Java.  I 
followed
> some examples I located on the web, and got my code working, with two
> exceptions. I'll deal with one of them here.
> 
> Some of the XML looks like:
> 
> <tag>first line
> 2nd line
> 3rd line</tag>
> 
> The start and end tags are noted (Sax events), but the char array 
contains
> only "first line" and loses the rest of the text. The variable 'length' 
in
> this case returns 10. The XML comes from an external source and is 
beyond my
> control. The files are also quite large, some over 60 MB, so I wasn't 
able to
> use my old method of importing the entire XML file as a string and 
letting
> digester do the work.
> 
> So, is this a known issue, and is there a fix or workaround?
> 
> let me know what additional information I need to provide.
> 
> Thanks,
> 
> -John
> 
> 
> 
> Discover Yahoo! 
> Find restaurants, movies, travel and more fun for the weekend. Check it 
out! 
> http://discover.yahoo.com/weekend.html 
> 
> 
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional commands, e-mail: [EMAIL PROTECTED]
> 

Michael Glavassevich
XML Parser Development
IBM Toronto Lab
E-mail: [EMAIL PROTECTED]
E-mail: [EMAIL PROTECTED]


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to