[ 
https://issues.apache.org/jira/browse/TIKA-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15151036#comment-15151036
 ] 

Tim Allison commented on TIKA-1859:
-----------------------------------

Early diagnosis (could be wrong)...This xlsx file includes a namespace in the 
{{qname}} ("x:row", "x:c", "x:v").

POI's XSSFSheetXMLHandler tests literal string equality and is looking for 
"row" or "c" in {{startElement}} and elsewhere.

If this is correct, this is a serious problem with XSSFSheetXMLHandler.  I'll 
kick the tires some more and then open an issue in POI if I can confirm this.

> file poi reads tika does not bring the content
> ----------------------------------------------
>
>                 Key: TIKA-1859
>                 URL: https://issues.apache.org/jira/browse/TIKA-1859
>             Project: Tika
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.12
>            Reporter: Movses
>            Priority: Blocker
>         Attachments: testing.Xlsx
>
>
> I have a file xlsx I'm able to read and process in using poi but in tika it 
> does not extract the content of the file



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to