[ https://issues.apache.org/jira/browse/TIKA-3164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17456737#comment-17456737 ]
Tim Allison commented on TIKA-3164: ----------------------------------- Y. Thank you [~pj.fanning]! That's exactly it. I can fix it on the Tika side for now by copy/pasting XSSFSheetXMLHandler. Three other points of interest: 1) I'm getting this on quite a few files in our regression set. Warnings are great, but is something else going on? org.apache.poi.hpsf.CodePageString String terminator (\0) for CodePageString property value occurred before the end of string. Trimming and hope for the best. 2) I'm getting a lot of these warnings. Should we be checking if an entry is a directory before adding them to the parts list: org.apache.poi.openxml4j.exceptions.InvalidFormatException: A part name shall not have a forward slash as the last character [M1.5]: /word/_rels/ 3) How can I avoid this and make sure that we are not vulnerable to xxe? org.apache.poi.util.XMLHelper SAX Feature unsupported [log suppressed for 5 minutes]http://javax.xml.XMLConstants/property/accessExternalSchema java.lang.IllegalArgumentException: Property 'http://javax.xml.XMLConstants/property/accessExternalSchema' is not recognized. > Upgrade to POI 5.0.0 when available > ----------------------------------- > > Key: TIKA-3164 > URL: https://issues.apache.org/jira/browse/TIKA-3164 > Project: Tika > Issue Type: Task > Reporter: Tim Allison > Priority: Major > -- This message was sent by Atlassian Jira (v8.20.1#820001)