[jira] Commented: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures

2011-01-18 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12983447#action_12983447 ] Dennis Adler commented on TIKA-577: --- Maxim, I edited my Classpath file to use the new POI

[jira] Commented: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures

2011-01-18 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12983497#action_12983497 ] Dennis Adler commented on TIKA-577: --- Re-verified that the bug repro's with

[jira] Created: (TIKA-583) Tika 0.8 line break removal is faulty (misses space when concatenating lines) for PDF file

2011-01-13 Thread Dennis Adler (JIRA)
Tika 0.8 line break removal is faulty (misses space when concatenating lines) for PDF file -- Key: TIKA-583 URL: https://issues.apache.org/jira/browse/TIKA-583

[jira] Updated: (TIKA-583) Tika 0.8 line break removal is faulty (misses space when concatenating lines) for PDF file

2011-01-13 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Adler updated TIKA-583: -- Attachment: Savchuk v. Jerde.pdf Original PDF; parsed with tika-app-0.7 and tika-app-0.8 (release).

[jira] Commented: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures

2011-01-12 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-577?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12980861#action_12980861 ] Dennis Adler commented on TIKA-577: --- Just checked out latest POI (Version 1058255, svn

[jira] Created: (TIKA-581) Parser fails on files that parsed with v0.7

2011-01-03 Thread Dennis Adler (JIRA)
Parser fails on files that parsed with v0.7 --- Key: TIKA-581 URL: https://issues.apache.org/jira/browse/TIKA-581 Project: Tika Issue Type: Bug Components: parser Affects Versions: 0.8

[jira] Updated: (TIKA-581) Parser fails on files that parsed with v0.7

2011-01-03 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Adler updated TIKA-581: -- Attachment: Tika-581 files.zip ZIP with the three repro files Parser fails on files that parsed with

[jira] Resolved: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio

2010-12-22 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Adler resolved TIKA-522. --- Resolution: Cannot Reproduce Fix Version/s: 0.8 I'm moving my project to 0.8 and have put in

[jira] Created: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures

2010-12-21 Thread Dennis Adler (JIRA)
IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures -- Key: TIKA-577 URL: https://issues.apache.org/jira/browse/TIKA-577 Project: Tika

[jira] Updated: (TIKA-577) IndexOutOfBounds Exception looking for Picture in Word 03 doc that has no pictures

2010-12-21 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Adler updated TIKA-577: -- Description: When cracking a Word 03 document (which, unfortunately, I cannot upload -- it has

[jira] Commented: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio

2010-10-10 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12919685#action_12919685 ] Dennis Adler commented on TIKA-522: --- As soon as I can develop a reliable repro case I would

[jira] Issue Comment Edited: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio

2010-10-05 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12917814#action_12917814 ] Dennis Adler edited comment on TIKA-522 at 10/5/10 3:54 PM: Hi

[jira] Updated: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio

2010-10-05 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dennis Adler updated TIKA-522: -- Attachment: Tika MimeTypes bug repro case.htm Local copy of the network file that is mis-detected.

[jira] Issue Comment Edited: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio

2010-10-05 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12918323#action_12918323 ] Dennis Adler edited comment on TIKA-522 at 10/5/10 7:40 PM: Local

[jira] Issue Comment Edited: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio

2010-10-05 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12918323#action_12918323 ] Dennis Adler edited comment on TIKA-522 at 10/5/10 7:41 PM: Local

[jira] Commented: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio

2010-10-05 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12918337#action_12918337 ] Dennis Adler commented on TIKA-522: --- Sigh... I stopped the debug session in Eclipse, set a

[jira] Commented: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio

2010-10-04 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12917781#action_12917781 ] Dennis Adler commented on TIKA-522: --- I've further traced the problem. It happens in

[jira] Commented: (TIKA-522) AutoDetectParser treats HTML/XML files as Audio

2010-10-04 Thread Dennis Adler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-522?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12917814#action_12917814 ] Dennis Adler commented on TIKA-522: --- Hi Nick, Pardon the alias on the GMail addr... I