[ 
https://issues.apache.org/jira/browse/TIKA-3164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17456737#comment-17456737
 ] 

Tim Allison commented on TIKA-3164:
-----------------------------------

Y. Thank you [~pj.fanning]!  That's exactly it.  I can fix it on the Tika side 
for now by copy/pasting XSSFSheetXMLHandler.

Three other points of interest:

1)  I'm getting this on quite a few files in our regression set.  Warnings are 
great, but is something else going on? org.apache.poi.hpsf.CodePageString 
String terminator (\0) for CodePageString property value occurred before the 
end of string. Trimming and hope for the best. 

2) I'm getting a lot of these warnings.  Should we be checking if an entry is a 
directory before adding them to the parts list: 
org.apache.poi.openxml4j.exceptions.InvalidFormatException: A part name shall 
not have a forward slash as the last character [M1.5]: /word/_rels/

3) How can I avoid this and make sure that we are not vulnerable to xxe? 
org.apache.poi.util.XMLHelper SAX Feature unsupported [log suppressed for 5 
minutes]http://javax.xml.XMLConstants/property/accessExternalSchema
java.lang.IllegalArgumentException: Property 
'http://javax.xml.XMLConstants/property/accessExternalSchema' is not recognized.

> Upgrade to POI 5.0.0 when available
> -----------------------------------
>
>                 Key: TIKA-3164
>                 URL: https://issues.apache.org/jira/browse/TIKA-3164
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to