[ https://issues.apache.org/jira/browse/CAMEL-4595?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13139467#comment-13139467 ]
Raul Kripalani edited comment on CAMEL-4595 at 10/29/11 10:57 PM: ------------------------------------------------------------------ Claus - I'm wondering whether it would make sense to look into leveraging libraries such as: * VTD-XML (http://vtd-xml.sourceforge.net) * Nux (http://acs.lbl.gov/software/nux) * Aalto (http://wiki.fasterxml.com/AaltoHome) What do you think? was (Author: raulvk): Claus - I'm wondering whether it would make sense to look into levering libraries such as: * VTD-XML (http://vtd-xml.sourceforge.net) * Nux (http://acs.lbl.gov/software/nux) * Aalto (http://wiki.fasterxml.com/AaltoHome) What do you think? > Add tokenize by pair to be able to split big XML files in streaming mode > ------------------------------------------------------------------------ > > Key: CAMEL-4595 > URL: https://issues.apache.org/jira/browse/CAMEL-4595 > Project: Camel > Issue Type: New Feature > Components: camel-core > Reporter: Claus Ibsen > Assignee: Claus Ibsen > Fix For: 2.9.0 > > > Using XPath to split big XML files is not optimal as the JDK XPath framework > doesn't support streaming mode yet. It may come in the future. > So instead we can introduce a tokenizer which can grab the xml content > between start/end tokens instead. Then we can parse big files with very low > memory footprint. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira