[
https://issues.apache.org/jira/browse/TIKA-1859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15151036#comment-15151036
]
Tim Allison commented on TIKA-1859:
-----------------------------------
Early diagnosis (could be wrong)...This xlsx file includes a namespace in the
{{qname}} ("x:row", "x:c", "x:v").
POI's XSSFSheetXMLHandler tests literal string equality and is looking for
"row" or "c" in {{startElement}} and elsewhere.
If this is correct, this is a serious problem with XSSFSheetXMLHandler. I'll
kick the tires some more and then open an issue in POI if I can confirm this.
> file poi reads tika does not bring the content
> ----------------------------------------------
>
> Key: TIKA-1859
> URL: https://issues.apache.org/jira/browse/TIKA-1859
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 1.12
> Reporter: Movses
> Priority: Blocker
> Attachments: testing.Xlsx
>
>
> I have a file xlsx I'm able to read and process in using poi but in tika it
> does not extract the content of the file
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)