[ 
https://issues.apache.org/jira/browse/TIKA-712?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13114700#comment-13114700
 ] 

Nick Burch commented on TIKA-712:
---------------------------------

It looks like we only want to exclude the placeholder ones on the layout and 
master slides, and only then if they're not custom

Well, unless there isn't a matching placeholder on the slide itself....

Ideally we'll want to expand POI to have a full model for this. For now, I've 
got something roughly working in POI in XSLFPowerPointExtractor. If the logic 
in there seems ok, we can implement the same in Tika when we move to POI 3.8 
beta 5

> Master slide text isn't extracted
> ---------------------------------
>
>                 Key: TIKA-712
>                 URL: https://issues.apache.org/jira/browse/TIKA-712
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>            Reporter: Michael McCandless
>         Attachments: TIKA-712-master-slide.xml, TIKA-712.patch, 
> testPPT_masterFooter.ppt, testPPT_masterFooter.pptx, 
> testPPT_masterFooter2.ppt, testPPT_masterFooter2.pptx
>
>
> It looks like we are not getting text from the master slide for PPT
> and PPTX.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to