[ 
https://issues.apache.org/jira/browse/TIKA-1748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14935840#comment-14935840
 ] 

Andreas Beeker commented on TIKA-1748:
--------------------------------------

at HSLFExtractor:
- as the extraction is specific to HSLF, I would use the HSLF and not the 
common sl classes.
- in my patch (tika-1707) the method textRunsToText wraps paragraphs in divs 
... I don't know how the post-processing of the output works, but I wanted to 
point out, that you don't deal with just a list of textruns, but with separate 
paragraphs having textruns

at PowerPointParserTest:
- the above handling leads to a change in the tests

at XSLFPowerPointExtractorDecorator:
- use XSLF classes instead of common sl
- the new extractTable method is better

Andi.


> Upgrade to POI 3.13-final when available
> ----------------------------------------
>
>                 Key: TIKA-1748
>                 URL: https://issues.apache.org/jira/browse/TIKA-1748
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Minor
>         Attachments: TIKA-1748.patch
>
>
> Upgrade.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to