[jira] [Commented] (TIKA-521) OutOfMemoryError Parsing XSLX File

2011-05-26 Thread Maxim Valyanskiy (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13039609#comment-13039609 ] Maxim Valyanskiy commented on TIKA-521: --- Tika from trunk with POI from trunk parses th

[jira] [Commented] (TIKA-521) OutOfMemoryError Parsing XSLX File

2011-05-26 Thread Maxim Valyanskiy (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13039620#comment-13039620 ] Maxim Valyanskiy commented on TIKA-521: --- Sorry, I missed screenshot with stack trace.

Re: Towards 1.0

2011-05-26 Thread Steve Aulenbach
Hi Chris, I think your plan to improve the netCDF and HDF parsing is a great one. The richness of a full ncdump of netCDF metadata and a full ncdump HDF-EOS metadata would be an excellent addition to the 1.0 release of Tika. I have discussed Tika to several science data user and they usually ask

[jira] [Commented] (TIKA-521) OutOfMemoryError Parsing XSLX File

2011-05-26 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-521?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13039719#comment-13039719 ] Ken Krugler commented on TIKA-521: -- Tika CLI uses BoilerpipeContentHandler in regular (don'

[jira] [Commented] (TIKA-288) Support override parsers in AutoDetectParser

2011-05-26 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13039771#comment-13039771 ] Ken Krugler commented on TIKA-288: -- What about: AutoDetectParser.override(Class clazz, Par