Tika can't parse XLSX when build with latest POI trunk version
--------------------------------------------------------------
Key: TIKA-348
URL: https://issues.apache.org/jira/browse/TIKA-348
Project: Tika
Issue Type: Bug
Affects Versions: 0.6
Reporter: Maxim Valyanskiy
Attachments: TIKA-348.patch
OOXMLParserTest fails:
org.apache.tika.exception.TikaException: Unexpected RuntimeException from
org.apache.tika.parser.microsoft.ooxml.ooxmlpar...@82d37
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:122)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:101)
at
org.apache.tika.parser.AutoDetectParser.parse(AutoDetectParser.java:114)
at
org.apache.tika.parser.microsoft.ooxml.OOXMLParserTest.testExcel(OOXMLParserTest.java:43)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at
com.intellij.rt.execution.junit.JUnitStarter.main(JUnitStarter.java:40)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at com.intellij.rt.execution.application.AppMain.main(AppMain.java:90)
Caused by: java.lang.IllegalStateException: Cannot get a text value from a
numeric formula cell
at
org.apache.poi.xssf.usermodel.XSSFCell.typeMismatch(XSSFCell.java:781)
at
org.apache.poi.xssf.usermodel.XSSFCell.checkFormulaCachedValueType(XSSFCell.java:286)
at
org.apache.poi.xssf.usermodel.XSSFCell.getRichStringCellValue(XSSFCell.java:274)
at
org.apache.poi.xssf.usermodel.XSSFCell.getRichStringCellValue(XSSFCell.java:63)
at
org.apache.tika.parser.microsoft.ooxml.XSSFExcelExtractorDecorator.buildXHTML(XSSFExcelExtractorDecorator.java:72)
at
org.apache.tika.parser.microsoft.ooxml.AbstractOOXMLExtractor.getXHTML(AbstractOOXMLExtractor.java:69)
at
org.apache.tika.parser.microsoft.ooxml.OOXMLParser.parse(OOXMLParser.java:49)
at
org.apache.tika.parser.CompositeParser.parse(CompositeParser.java:120)
... 26 more
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.