[ https://issues.apache.org/jira/browse/TIKA-2129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Seva Alekseyev updated TIKA-2129: --------------------------------- Attachment: 10.1056-NEJMra020100Figure01.ppt > IllegalArgumentException/"Unknown shape type" on a valid Powerpoint file > ------------------------------------------------------------------------ > > Key: TIKA-2129 > URL: https://issues.apache.org/jira/browse/TIKA-2129 > Project: Tika > Issue Type: Bug > Components: parser > Environment: Windows 7 x64, JVM 1.8.0_101 > Reporter: Seva Alekseyev > Attachments: 10.1056-NEJMra020100Figure01.ppt > > > The following valid Powerpoint file: > https://dl.dropboxusercontent.com/u/92341073/10.1056-NEJMra020100Figure01.ppt > when parsed with Tika, throws the following error: > java.lang.IllegalArgumentException: Unknown shape type: 4095 > at org.apache.poi.sl.usermodel.ShapeType.forId(ShapeType.java:314) > at > org.apache.poi.hslf.usermodel.HSLFShapeFactory.createSimpleShape(HSLFShapeFactory.java:98) > at > org.apache.poi.hslf.usermodel.HSLFShapeFactory.createShape(HSLFShapeFactory.java:62) > at org.apache.poi.hslf.usermodel.HSLFSheet.getShapes(HSLFSheet.java:173) > at > org.apache.tika.parser.microsoft.HSLFExtractor.parse(HSLFExtractor.java:93) > at > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:149) > at > org.apache.tika.parser.microsoft.OfficeParser.parse(OfficeParser.java:117) -- This message was sent by Atlassian JIRA (v6.3.4#6332)