[jira] [Commented] (TIKA-1706) Bring back commons-io to tika-core

2015-08-15 Thread Yaniv Kunda (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698477#comment-14698477 ] Yaniv Kunda commented on TIKA-1706: --- I've separated all the related changes besides addin

[jira] [Updated] (TIKA-1710) Replace usages of classes in org.apache.tika.io with current alternatives

2015-08-15 Thread Yaniv Kunda (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yaniv Kunda updated TIKA-1710: -- Attachment: TIKA-1710.patch A patch for the described changes > Replace usages of classes in org.apache.

[jira] [Issue Comment Deleted] (TIKA-1706) Bring back commons-io to tika-core

2015-08-15 Thread Yaniv Kunda (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yaniv Kunda updated TIKA-1706: -- Comment: was deleted (was: A patch to bring back commons-io to tika-core and replace all formerly inline

[jira] [Updated] (TIKA-1706) Bring back commons-io to tika-core

2015-08-15 Thread Yaniv Kunda (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yaniv Kunda updated TIKA-1706: -- Attachment: (was: TIKA-1706.patch) > Bring back commons-io to tika-core > ---

[jira] [Created] (TIKA-1710) Replace usages of classes in org.apache.tika.io with current alternatives

2015-08-15 Thread Yaniv Kunda (JIRA)
Yaniv Kunda created TIKA-1710: - Summary: Replace usages of classes in org.apache.tika.io with current alternatives Key: TIKA-1710 URL: https://issues.apache.org/jira/browse/TIKA-1710 Project: Tika

[jira] [Updated] (TIKA-1699) Integrate the GROBID PDF extractor in Tika

2015-08-15 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated TIKA-1699: Attachment: TIKA-1699.restgrobid.MattmannWIP081515.patch.txt - here's a WIP patch to convert

[jira] [Commented] (TIKA-1707) Upgrade to Apache POI 3.13 Beta 2

2015-08-15 Thread Andreas Beeker (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698406#comment-14698406 ] Andreas Beeker commented on TIKA-1707: -- The affected test cases are ok now ... I haven

[jira] [Commented] (TIKA-1706) Bring back commons-io to tika-core

2015-08-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698401#comment-14698401 ] Hudson commented on TIKA-1706: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #826 (See [https://b

[jira] [Commented] (TIKA-1707) Upgrade to Apache POI 3.13 Beta 2

2015-08-15 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1707?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698383#comment-14698383 ] Nick Burch commented on TIKA-1707: -- The build is hopefully working again now. If you could

[jira] [Commented] (TIKA-1699) Integrate the GROBID PDF extractor in Tika

2015-08-15 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698375#comment-14698375 ] Chris A. Mattmann commented on TIKA-1699: - To use this patch, follow the instructio

[jira] [Commented] (TIKA-1699) Integrate the GROBID PDF extractor in Tika

2015-08-15 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698371#comment-14698371 ] Nick Burch commented on TIKA-1699: -- {quote}Tika-app is ~48MB it seems so closer to 30% act

[jira] [Comment Edited] (TIKA-1699) Integrate the GROBID PDF extractor in Tika

2015-08-15 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698161#comment-14698161 ] Chris A. Mattmann edited comment on TIKA-1699 at 8/15/15 5:46 PM: ---

[jira] [Updated] (TIKA-1699) Integrate the GROBID PDF extractor in Tika

2015-08-15 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated TIKA-1699: Attachment: TIKA-1699.grobid-core.MattmannShah.081515.patch.txt - here's the patch that Nick

[jira] [Commented] (TIKA-1699) Integrate the GROBID PDF extractor in Tika

2015-08-15 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698340#comment-14698340 ] Chris A. Mattmann commented on TIKA-1699: - All filed issues to publish all grobid-c

[jira] [Commented] (TIKA-1699) Integrate the GROBID PDF extractor in Tika

2015-08-15 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698317#comment-14698317 ] Hudson commented on TIKA-1699: -- FAILURE: Integrated in tika-trunk-jdk1.7 #825 (See [https://b

tika-trunk-jdk1.7 - Build # 825 - Still Failing

2015-08-15 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-trunk-jdk1.7 (build #825) Status: Still Failing Check console output at https://builds.apache.org/job/tika-trunk-jdk1.7/825/ to view the results.

[jira] [Commented] (TIKA-1699) Integrate the GROBID PDF extractor in Tika

2015-08-15 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698315#comment-14698315 ] Chris A. Mattmann commented on TIKA-1699: - bq. I've tried to exclude the grobid tra

[jira] [Comment Edited] (TIKA-1706) Bring back commons-io to tika-core

2015-08-15 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698313#comment-14698313 ] Uwe Schindler edited comment on TIKA-1706 at 8/15/15 3:19 PM: --

[jira] [Commented] (TIKA-1706) Bring back commons-io to tika-core

2015-08-15 Thread Uwe Schindler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698313#comment-14698313 ] Uwe Schindler commented on TIKA-1706: - Yes, you can add the maven property {{false" to

[jira] [Commented] (TIKA-1706) Bring back commons-io to tika-core

2015-08-15 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698307#comment-14698307 ] Nick Burch commented on TIKA-1706: -- [~thetaphi] We currently have the forbidden apis check

[jira] [Commented] (TIKA-1699) Integrate the GROBID PDF extractor in Tika

2015-08-15 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698304#comment-14698304 ] Nick Burch commented on TIKA-1699: -- I've tried to exclude the grobid transient dependencie

Re: [DISCUSS] A more modular parser project

2015-08-15 Thread Bob Paulin
Hi, So just to understand the break downs. When you say: tika-classic-parser-bundle/ Tika-office-parser-bundle/ (including microsoft, opendocument, pst, rtf, iwork? Has dependency on html/text) Tika-pdf-parser-bundle/ Tika-text-parser-bundle (including txt,chm, rfc822,

tika-trunk-jdk1.7 - Build # 824 - Still Failing

2015-08-15 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-trunk-jdk1.7 (build #824) Status: Still Failing Check console output at https://builds.apache.org/job/tika-trunk-jdk1.7/824/ to view the results.

[jira] [Reopened] (TIKA-1699) Integrate the GROBID PDF extractor in Tika

2015-08-15 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch reopened TIKA-1699: -- > Integrate the GROBID PDF extractor in Tika > -- > >

[jira] [Commented] (TIKA-1699) Integrate the GROBID PDF extractor in Tika

2015-08-15 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14698161#comment-14698161 ] Nick Burch commented on TIKA-1699: -- A build from trunk is now failing for me: {code} [ERRO