Hi John, characters() may be called multiple times for contiguous text. It's one of the finer details of SAX. This gets reported so often it has its own FAQ [1].
[1] http://xml.apache.org/xerces2-j/faq-sax.html#faq-2 John Wright <[EMAIL PROTECTED]> wrote on 05/12/2005 03:48:04 PM: > Hi, > > I am relatively new to parsing XML via Xerces and Sax, with Java. I followed > some examples I located on the web, and got my code working, with two > exceptions. I'll deal with one of them here. > > Some of the XML looks like: > > <tag>first line > 2nd line > 3rd line</tag> > > The start and end tags are noted (Sax events), but the char array contains > only "first line" and loses the rest of the text. The variable 'length' in > this case returns 10. The XML comes from an external source and is beyond my > control. The files are also quite large, some over 60 MB, so I wasn't able to > use my old method of importing the entire XML file as a string and letting > digester do the work. > > So, is this a known issue, and is there a fix or workaround? > > let me know what additional information I need to provide. > > Thanks, > > -John > > > > Discover Yahoo! > Find restaurants, movies, travel and more fun for the weekend. Check it out! > http://discover.yahoo.com/weekend.html > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > Michael Glavassevich XML Parser Development IBM Toronto Lab E-mail: [EMAIL PROTECTED] E-mail: [EMAIL PROTECTED] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
