[ 
https://issues.apache.org/jira/browse/TIKA-188?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12664309#action_12664309
 ] 

Uwe Schindler commented on TIKA-188:
------------------------------------

Nice work!
Uwe

> Automatic whitespace for block elements in XHTMLContentHandler
> --------------------------------------------------------------
>
>                 Key: TIKA-188
>                 URL: https://issues.apache.org/jira/browse/TIKA-188
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>            Reporter: Jukka Zitting
>            Assignee: Jukka Zitting
>            Priority: Minor
>             Fix For: 0.3
>
>
> As discussed in TIKA-171, it would be a good idea to make the 
> XHTMLContentHandler automatically add extra whitespace to separate block 
> level elements from each other. This would prevent extracted words to 
> accidentally get concatenated in clients that only care about the character 
> events.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to