[jira] [Reopened] (TIKA-973) PDF form data isn't included in extracted content.

2013-12-13 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reopened TIKA-973: -- Assignee: Tim Allison In hindsight, would prefer to use test documents that are unequivocally

Jenkins build is back to normal : Tika-trunk #1050

2013-12-13 Thread Apache Jenkins Server
See https://builds.apache.org/job/Tika-trunk/1050/changes

Jenkins build is back to normal : Tika-trunk ยป Apache Tika application #1050

2013-12-13 Thread Apache Jenkins Server
See https://builds.apache.org/job/Tika-trunk/org.apache.tika$tika-app/1050/

[jira] [Commented] (TIKA-1208) Migrate Any23 mime contributions to Tika

2013-12-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13847729#comment-13847729 ] Lewis John McGibbney commented on TIKA-1208: I've started work on this one to

[jira] [Commented] (TIKA-1208) Migrate Any23 mime contributions to Tika

2013-12-13 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13847734#comment-13847734 ] Lewis John McGibbney commented on TIKA-1208: Hey [~p_ansell], can you please

[jira] [Created] (TIKA-1209) Upgrade Tika tests to JUnit 4.X

2013-12-13 Thread Lewis John McGibbney (JIRA)
Lewis John McGibbney created TIKA-1209: -- Summary: Upgrade Tika tests to JUnit 4.X Key: TIKA-1209 URL: https://issues.apache.org/jira/browse/TIKA-1209 Project: Tika Issue Type:

Support for marks in InputStream passed to Tika.detect

2013-12-13 Thread Lewis John Mcgibbney
Hi, I am wondering whether the concept of 'purifying' [0][1] is something which may be of interest to the detect API in Tika. Basically we have an interface which defines some logic which should be performed prior to MIMEType detection taking place. The only implementation we have right now is a