Automatic whitespace for block elements in XHTMLContentHandler
--------------------------------------------------------------
Key: TIKA-188
URL: https://issues.apache.org/jira/browse/TIKA-188
Project: Tika
Issue Type: Improvement
Components: parser
Reporter: Jukka Zitting
Priority: Minor
As discussed in TIKA-171, it would be a good idea to make the
XHTMLContentHandler automatically add extra whitespace to separate block level
elements from each other. This would prevent extracted words to accidentally
get concatenated in clients that only care about the character events.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.