[jira] [Updated] (TIKA-605) Tika GDAL parser

2014-10-10 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated TIKA-605: --- Attachment: TIKA-605.Mattmann.100914.2.patch.txt - ok here is a fully working complete test.

Review Request 26542: Tika GDAL parser

2014-10-10 Thread Chris Mattmann
--- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/26542/ --- Review request for tika, Lewis McGibbney and Tyler Palsulich. Bugs: TIKA-605

[jira] [Commented] (TIKA-605) Tika GDAL parser

2014-10-10 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14166452#comment-14166452 ] Chris A. Mattmann commented on TIKA-605: https://reviews.apache.org/r/26542 Tika

[jira] [Commented] (TIKA-1427) PDF Images don't appear in structured view

2014-10-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14166714#comment-14166714 ] Tim Allison commented on TIKA-1427: --- [~tilman], well, sure, but you actually know what

[jira] [Created] (TIKA-1442) Upgrade to PDFBox 1.8.8

2014-10-10 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1442: - Summary: Upgrade to PDFBox 1.8.8 Key: TIKA-1442 URL: https://issues.apache.org/jira/browse/TIKA-1442 Project: Tika Issue Type: Improvement Reporter:

[jira] [Commented] (TIKA-1442) Upgrade to PDFBox 1.8.8

2014-10-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14166717#comment-14166717 ] Tim Allison commented on TIKA-1442: --- Thank you [~tilman]! Upgrade to PDFBox 1.8.8

[jira] [Created] (TIKA-1443) Add a junk text detector to Tika

2014-10-10 Thread Tim Allison (JIRA)
Tim Allison created TIKA-1443: - Summary: Add a junk text detector to Tika Key: TIKA-1443 URL: https://issues.apache.org/jira/browse/TIKA-1443 Project: Tika Issue Type: Wish Reporter:

[jira] [Commented] (TIKA-1419) Upgrade to PDFBox 1.8.7

2014-10-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1419?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14166736#comment-14166736 ] Tim Allison commented on TIKA-1419: --- Let's move discussion over to TIKA-1442. I also

[jira] [Commented] (TIKA-1442) Upgrade to PDFBox 1.8.8

2014-10-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14166755#comment-14166755 ] Tim Allison commented on TIKA-1442: --- This is in response to our discussion on TIKA-1419.

[jira] [Commented] (TIKA-1435) Update rome dependency to 1.5

2014-10-10 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14166894#comment-14166894 ] Chris A. Mattmann commented on TIKA-1435: - Thanks [~jotomo] just let me know if you

[jira] [Commented] (TIKA-1422) org.apache.tika.parser.mail.RFC822ParserTest fails

2014-10-10 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1422?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14166903#comment-14166903 ] Chris A. Mattmann commented on TIKA-1422: - ok figured it out. It was _this_ part of

[jira] [Commented] (TIKA-1302) Let's run Tika against a large batch of docs nightly

2014-10-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14166931#comment-14166931 ] Tim Allison commented on TIKA-1302: --- I just transitioned development on TIKA-1302

[jira] [Commented] (TIKA-1435) Update rome dependency to 1.5

2014-10-10 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14167014#comment-14167014 ] Johannes Mockenhaupt commented on TIKA-1435: Hey Chris, I have both variants

[jira] [Commented] (TIKA-1435) Update rome dependency to 1.5

2014-10-10 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14167016#comment-14167016 ] Johannes Mockenhaupt commented on TIKA-1435: (See the attached diff for

[jira] [Comment Edited] (TIKA-1435) Update rome dependency to 1.5

2014-10-10 Thread Johannes Mockenhaupt (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14167016#comment-14167016 ] Johannes Mockenhaupt edited comment on TIKA-1435 at 10/10/14 3:52 PM:

[jira] [Commented] (TIKA-1442) Upgrade to PDFBox 1.8.8

2014-10-10 Thread Tilman Hausherr (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1442?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14167194#comment-14167194 ] Tilman Hausherr commented on TIKA-1442: --- Do you want the junk list in some format?