[jira] Resolved: (TIKA-583) Tika 0.8 line break removal is faulty (misses space when concatenating lines) for PDF file

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-583. Resolution: Duplicate Assignee: Jukka Zitting This is a duplicate of TIKA-548, fixed in trunk.

[jira] Resolved: (TIKA-567) Temporary file leak in TikaInputStream

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-567. Resolution: Fixed I've added a TemporaryFiles class that can be used with TikaInputStream when the c

[jira] Resolved: (TIKA-587) NullPointerException in OutlookExtractor on missing chunks

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-587. Resolution: Fixed Fix Version/s: 0.9 Assignee: Jukka Zitting Good point, thanks! Fixe

[jira] Resolved: (TIKA-585) AudioParser Fails with NPE on fileFormat.properties

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-585?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-585. Resolution: Fixed Fix Version/s: 0.9 Assignee: Jukka Zitting Good point, thanks! Fixe

[jira] Resolved: (TIKA-584) Tika parse of some PDF files removes all spaces between words

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-584. Resolution: Duplicate Assignee: Jukka Zitting Like TIKA-583, this is a duplicate of TIKA-548, f

[jira] Commented: (TIKA-375) Improve code quality metrics

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-375?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983695#action_12983695 ] Jukka Zitting commented on TIKA-375: Thanks! Patch committed in revision 1060801. > Impr

[jira] Resolved: (TIKA-582) Lithuanian language identification

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-582. Resolution: Fixed Fix Version/s: 0.9 Assignee: Jukka Zitting (was: Ken Krugler) Than

[jira] Commented: (TIKA-576) OutofMemory issues while building Tika

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983702#action_12983702 ] Jukka Zitting commented on TIKA-576: The bundle plugin we're using to produce the tika-ap

[jira] Commented: (TIKA-581) Parser fails on files that parsed with v0.7

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983711#action_12983711 ] Jukka Zitting commented on TIKA-581: The HTML parsing problem is caused by double close t

[jira] Resolved: (TIKA-578) XMLParser ContentHandler: multiple endDocument calls

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-578. Resolution: Fixed Fix Version/s: 0.9 Assignee: Jukka Zitting Good point, thanks! Fixe

[jira] Resolved: (TIKA-551) Unit test failures in org.apache.tika.parser.image.ImageParserTest on JDK 1.6.0_05

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-551. Resolution: Won't Fix Agreed, resolved as Won't Fix. > Unit test failures in org.apache.tika.parser.

[jira] Commented: (TIKA-567) Temporary file leak in TikaInputStream

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983784#action_12983784 ] Jukka Zitting commented on TIKA-567: To clarify, there's normally no need to use the Temp

[jira] Commented: (TIKA-567) Temporary file leak in TikaInputStream

2011-01-19 Thread David Benson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983789#action_12983789 ] David Benson commented on TIKA-567: --- Still seeing some temporary files get created. I've up

[jira] Commented: (TIKA-567) Temporary file leak in TikaInputStream

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983799#action_12983799 ] Jukka Zitting commented on TIKA-567: These +~JFxxx.tmp files are a result of a bug in jav

[jira] Commented: (TIKA-567) Temporary file leak in TikaInputStream

2011-01-19 Thread David Benson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983802#action_12983802 ] David Benson commented on TIKA-567: --- We're running: java version "1.6.0_21" Java(TM) SE Run

[jira] Commented: (TIKA-567) Temporary file leak in TikaInputStream

2011-01-19 Thread David Benson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983834#action_12983834 ] David Benson commented on TIKA-567: --- Still seeing the temporary files getting created on la

[jira] Commented: (TIKA-567) Temporary file leak in TikaInputStream

2011-01-19 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12983841#action_12983841 ] Jukka Zitting commented on TIKA-567: Can you check if files stay around also after you te

[jira] Reopened: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures

2011-01-19 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Adler reopened TIKA-577: --- Reopened after I posted the sample file that repro's the bug > IndexOutOfBounds Exception looking for Pict