[ https://issues.apache.org/jira/browse/TIKA-1048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael McCandless updated TIKA-1048: ------------------------------------- Attachment: TIKA-1048.patch Patch w/ fix ... I added an optional boolean to TextContentHandler to have it add a space character before each element. > XMLParser should add whitespace between elements > ------------------------------------------------ > > Key: TIKA-1048 > URL: https://issues.apache.org/jira/browse/TIKA-1048 > Project: Tika > Issue Type: Bug > Components: parser > Reporter: Michael McCandless > Fix For: 1.3 > > Attachments: TIKA-1048.patch, TIKA-1048.patch > > > If the incoming XML is compact (ie doesn't have whitespace between elements), > I think we should somehow add whitespace between elements when extracting > text? -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira