BoilerpipeContentHandler needs to emit full set of standard elements
--------------------------------------------------------------------

                 Key: TIKA-480
                 URL: https://issues.apache.org/jira/browse/TIKA-480
             Project: Tika
          Issue Type: Bug
    Affects Versions: 0.7
            Reporter: Ken Krugler
            Assignee: Ken Krugler
             Fix For: 0.8


Currently BoilerpipeContentHandler will call the provided delegate 
ContentHandler with:

<p>xxx</p>

for each block of text. But without the wrappers around these elements, things 
like BodyContentHandler can't be used.

In addition, current BoilerpipeContentHandler emits a <p> element with a null 
attributes value, which will cause a NPE for BodyContentHandler.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to