[jira] Resolved: (TIKA-586) Parsing a ms access file (*.mdb) throws an error

2011-01-18 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-586. - Resolution: Fixed > Parsing a ms access file (*.mdb) throws an error >

[jira] Commented: (TIKA-586) Parsing a ms access file (*.mdb) throws an error

2011-01-18 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-586?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983205#action_12983205 ] Nick Burch commented on TIKA-586: - Thanks for this. I made a slight tweak to the patch to put

[jira] Resolved: (TIKA-416) Out-of-process text extraction

2011-01-18 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-416. Resolution: Fixed Fix Version/s: 0.9 Assignee: Jukka Zitting An initial version of th

[jira] Issue Comment Edited: (TIKA-416) Out-of-process text extraction

2011-01-18 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983229#action_12983229 ] Jukka Zitting edited comment on TIKA-416 at 1/18/11 10:35 AM: -- A

[jira] Commented: (TIKA-416) Out-of-process text extraction

2011-01-18 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-416?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983233#action_12983233 ] Chris A. Mattmann commented on TIKA-416: Awesome job Jukka! > Out-of-process text ex

[jira] Created: (TIKA-587) NullPointerException in OutlookExtractor on missing chunks

2011-01-18 Thread Tom Klonikowski (JIRA)
NullPointerException in OutlookExtractor on missing chunks -- Key: TIKA-587 URL: https://issues.apache.org/jira/browse/TIKA-587 Project: Tika Issue Type: Bug Components: parse

[jira] Commented: (TIKA-567) Temporary file leak in TikaInputStream

2011-01-18 Thread David Benson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983395#action_12983395 ] David Benson commented on TIKA-567: --- We upgraded from Tika 0.8 to 0.9-SNAPSHOT because of T

[jira] Commented: (TIKA-548) PDF content extracted as single line

2011-01-18 Thread Paul Pearcy (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-548?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983411#action_12983411 ] Paul Pearcy commented on TIKA-548: -- Just wanted to say that I don't believe there is a stabl

[jira] Commented: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures

2011-01-18 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983447#action_12983447 ] Dennis Adler commented on TIKA-577: --- Maxim, I edited my Classpath file to use the new POI b

[jira] Commented: (TIKA-583) Tika 0.8 line break removal is faulty (misses space when concatenating lines) for PDF file

2011-01-18 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983454#action_12983454 ] Dennis Adler commented on TIKA-583: --- Ken, I tried replacing the 3 PDFBox 1.3.1 JARs (fontbo

[jira] Commented: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures

2011-01-18 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983497#action_12983497 ] Dennis Adler commented on TIKA-577: --- Re-verified that the bug repro's with 3.8-beta1-201101

[jira] Updated: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures

2011-01-18 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Adler updated TIKA-577: -- Attachment: X'd Out Doc for Tika.doc Here's the Word document that causes the exception. After my hex-edi