[jira] [Updated] (TIKA-1816) Lenient testing for NamedEntityParser

2016-02-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann updated TIKA-1816: Fix Version/s: (was: 1.12) 1.13 > Lenient testing for

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134502#comment-15134502 ] Tim Allison commented on TIKA-1851: --- Back to normal-ish exceptions. Thank you, [~bobpaulin]! I'll take

[jira] [Comment Edited] (TIKA-1824) Tika 2.0 - Create Initial Parser Modules

2016-02-05 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15133821#comment-15133821 ] Konstantin Gribov edited comment on TIKA-1824 at 2/5/16 8:22 AM: - I'm on

[jira] [Commented] (TIKA-1824) Tika 2.0 - Create Initial Parser Modules

2016-02-05 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15133821#comment-15133821 ] Konstantin Gribov commented on TIKA-1824: - I'm on vacation now, so reveiwed this topic only

[jira] [Comment Edited] (TIKA-1824) Tika 2.0 - Create Initial Parser Modules

2016-02-05 Thread Konstantin Gribov (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15133821#comment-15133821 ] Konstantin Gribov edited comment on TIKA-1824 at 2/5/16 8:24 AM: - I'm on

[jira] [Created] (TIKA-1854) Include the storage class ID of documents embedded in MS Office documents

2016-02-05 Thread Daniel Bonniot de Ruisselet (JIRA)
Daniel Bonniot de Ruisselet created TIKA-1854: - Summary: Include the storage class ID of documents embedded in MS Office documents Key: TIKA-1854 URL: https://issues.apache.org/jira/browse/TIKA-1854

[jira] [Commented] (TIKA-1854) Include the storage class ID of documents embedded in MS Office documents

2016-02-05 Thread Daniel Bonniot de Ruisselet (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15133916#comment-15133916 ] Daniel Bonniot de Ruisselet commented on TIKA-1854: --- By the way, the Content-Type of the

[jira] [Updated] (TIKA-1854) Include the storage class ID of documents embedded in MS Office documents

2016-02-05 Thread Daniel Bonniot de Ruisselet (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Bonniot de Ruisselet updated TIKA-1854: -- Attachment: class-id.patch > Include the storage class ID of documents

[jira] [Commented] (TIKA-1836) Convertion DOC->TXT failed due to POI issue

2016-02-05 Thread Jorge Spinsanti (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134049#comment-15134049 ] Jorge Spinsanti commented on TIKA-1836: --- Great news! Thanks for helping. > Convertion DOC->TXT

[jira] [Reopened] (TIKA-1816) Lenient testing for NamedEntityParser

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reopened TIKA-1816: --- Assignee: (was: Tim Allison) Reopening until this works in 2.x. > Lenient testing for

[jira] [Commented] (TIKA-1854) Include the storage class ID of documents embedded in MS Office documents

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134083#comment-15134083 ] Tim Allison commented on TIKA-1854: --- Will commit shortly. Thank you for the patch and test case! First,

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134070#comment-15134070 ] Tim Allison commented on TIKA-1851: --- Hi Ken, [~thammegowda] is working on making the opennlp test suite

[jira] [Assigned] (TIKA-1854) Include the storage class ID of documents embedded in MS Office documents

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reassigned TIKA-1854: - Assignee: Tim Allison > Include the storage class ID of documents embedded in MS Office documents

[jira] [Resolved] (TIKA-1854) Include the storage class ID of documents embedded in MS Office documents

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved TIKA-1854. --- Resolution: Fixed Committed in trunk and 2.x with small mods. Thank you! > Include the storage class

[jira] [Comment Edited] (TIKA-1854) Include the storage class ID of documents embedded in MS Office documents

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134083#comment-15134083 ] Tim Allison edited comment on TIKA-1854 at 2/5/16 1:37 PM: --- Will commit shortly.

[jira] [Commented] (TIKA-1854) Include the storage class ID of documents embedded in MS Office documents

2016-02-05 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134197#comment-15134197 ] Hudson commented on TIKA-1854: -- SUCCESS: Integrated in tika-trunk-jdk1.7 #907 (See

[jira] [Comment Edited] (TIKA-1854) Include the storage class ID of documents embedded in MS Office documents

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134083#comment-15134083 ] Tim Allison edited comment on TIKA-1854 at 2/5/16 1:13 PM: --- Will commit shortly.

[jira] [Commented] (TIKA-1854) Include the storage class ID of documents embedded in MS Office documents

2016-02-05 Thread Daniel Bonniot de Ruisselet (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134180#comment-15134180 ] Daniel Bonniot de Ruisselet commented on TIKA-1854: --- The documents I'm processing

[jira] [Commented] (TIKA-1854) Include the storage class ID of documents embedded in MS Office documents

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134252#comment-15134252 ] Tim Allison commented on TIKA-1854: --- Got it. This is very helpful. Thank you. bq. Is the same

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135473#comment-15135473 ] Hudson commented on TIKA-1851: -- FAILURE: Integrated in tika-2.x #21 (See

tika-2.x - Build # 21 - Failure

2016-02-05 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-2.x (build #21) Status: Failure Check console output at https://builds.apache.org/job/tika-2.x/21/ to view the results.

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135483#comment-15135483 ] Lewis John McGibbney commented on TIKA-1851: If you can build locally then can you try a manual

[jira] [Comment Edited] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135480#comment-15135480 ] Tim Allison edited comment on TIKA-1851 at 2/6/16 2:35 AM: --- Build now works

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135480#comment-15135480 ] Tim Allison commented on TIKA-1851: --- Build now works locally for me after manual download of ner models.

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135506#comment-15135506 ] Tim Allison commented on TIKA-1851: --- Ah, ok, thank you. Y, I was inclined to put the test resources in

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135511#comment-15135511 ] Bob Paulin commented on TIKA-1851: -- Ah I was following [~lewismc] lead from any23 using the test-jar. I

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135514#comment-15135514 ] Tim Allison commented on TIKA-1851: --- K. Moving everything back to test now. > Tika 2.0 - Move test

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135521#comment-15135521 ] Lewis John McGibbney commented on TIKA-1851: Ack -- *Lewis* > Tika 2.0 - Move test

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Thamme Gowda N (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135434#comment-15135434 ] Thamme Gowda N commented on TIKA-1851: -- Hi [~talli...@mitre.org] [~chrismattmann] and [~bobpaulin]

[jira] [Reopened] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison reopened TIKA-1851: --- https://builds.apache.org/job/tika-2.x/20/#showFailuresLink For these two: {noformat}

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Thamme Gowda N (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135458#comment-15135458 ] Thamme Gowda N commented on TIKA-1851: -- Shell script was the initial version we had for manual setup

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135475#comment-15135475 ] Lewis John McGibbney commented on TIKA-1851: All dependencies should always come from first

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135490#comment-15135490 ] Tim Allison commented on TIKA-1851: --- Apologies for the following display of ignorance...but I _think_ the

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135496#comment-15135496 ] Lewis John McGibbney commented on TIKA-1851: Can you check which modules failed. If something

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135504#comment-15135504 ] Bob Paulin commented on TIKA-1851: -- I see the issue. It looks like the test-jar was removed so that the

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135526#comment-15135526 ] Tim Allison commented on TIKA-1851: --- Looks like I have to move the MockParserTest out of the

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135544#comment-15135544 ] Tim Allison commented on TIKA-1851: --- So, we're zipping _all_ the test files into a jar, and then each

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135342#comment-15135342 ] Ken Krugler commented on TIKA-1851: --- Hmm, now the top-level build fails on the tika parser text module,

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135336#comment-15135336 ] Ken Krugler commented on TIKA-1851: --- I did a top-level "mvn clean install", which failed with: [ERROR]

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135373#comment-15135373 ] Chris A. Mattmann commented on TIKA-1851: - We could potentially refactor so that we assumeTrue on

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135348#comment-15135348 ] Lewis John McGibbney commented on TIKA-1851: I'm the same Ken. We have been using cTAKES and

[jira] [Comment Edited] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135544#comment-15135544 ] Tim Allison edited comment on TIKA-1851 at 2/6/16 3:52 AM: --- So, we're zipping

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135548#comment-15135548 ] Chris A. Mattmann commented on TIKA-1851: - Yep it may make sense to move the test-docs into sub

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135559#comment-15135559 ] Lewis John McGibbney commented on TIKA-1851: In all honesty, if one takes a step back,

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135550#comment-15135550 ] Tim Allison commented on TIKA-1851: --- K. Just pushed the "undo" moving all tika-test-resources back to

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Bob Paulin (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135551#comment-15135551 ] Bob Paulin commented on TIKA-1851: -- I like the idea of putting the docs in each module. I tried to do

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135589#comment-15135589 ] Hudson commented on TIKA-1851: -- FAILURE: Integrated in tika-2.x #22 (See

tika-2.x - Build # 22 - Still Failing

2016-02-05 Thread Apache Jenkins Server
The Apache Jenkins build system has built tika-2.x (build #22) Status: Still Failing Check console output at https://builds.apache.org/job/tika-2.x/22/ to view the results.

[jira] [Commented] (TIKA-1851) Tika 2.0 - Move test resources from core to test-resources

2016-02-05 Thread Lewis John McGibbney (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135594#comment-15135594 ] Lewis John McGibbney commented on TIKA-1851: Regression in Tika advanced module and also