[
https://issues.apache.org/jira/browse/TIKA-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Chris A. Mattmann updated TIKA-1816:
Fix Version/s: (was: 1.12)
1.13
> Lenient testing for
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134502#comment-15134502
]
Tim Allison commented on TIKA-1851:
---
Back to normal-ish exceptions. Thank you, [~bobpaulin]! I'll take
[
https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15133821#comment-15133821
]
Konstantin Gribov edited comment on TIKA-1824 at 2/5/16 8:22 AM:
-
I'm on
[
https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15133821#comment-15133821
]
Konstantin Gribov commented on TIKA-1824:
-
I'm on vacation now, so reveiwed this topic only
[
https://issues.apache.org/jira/browse/TIKA-1824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15133821#comment-15133821
]
Konstantin Gribov edited comment on TIKA-1824 at 2/5/16 8:24 AM:
-
I'm on
Daniel Bonniot de Ruisselet created TIKA-1854:
-
Summary: Include the storage class ID of documents embedded in MS
Office documents
Key: TIKA-1854
URL: https://issues.apache.org/jira/browse/TIKA-1854
[
https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15133916#comment-15133916
]
Daniel Bonniot de Ruisselet commented on TIKA-1854:
---
By the way, the Content-Type of the
[
https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Daniel Bonniot de Ruisselet updated TIKA-1854:
--
Attachment: class-id.patch
> Include the storage class ID of documents
[
https://issues.apache.org/jira/browse/TIKA-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134049#comment-15134049
]
Jorge Spinsanti commented on TIKA-1836:
---
Great news! Thanks for helping.
> Convertion DOC->TXT
[
https://issues.apache.org/jira/browse/TIKA-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison reopened TIKA-1816:
---
Assignee: (was: Tim Allison)
Reopening until this works in 2.x.
> Lenient testing for
[
https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134083#comment-15134083
]
Tim Allison commented on TIKA-1854:
---
Will commit shortly. Thank you for the patch and test case!
First,
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134070#comment-15134070
]
Tim Allison commented on TIKA-1851:
---
Hi Ken,
[~thammegowda] is working on making the opennlp test suite
[
https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison reassigned TIKA-1854:
-
Assignee: Tim Allison
> Include the storage class ID of documents embedded in MS Office documents
[
https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison resolved TIKA-1854.
---
Resolution: Fixed
Committed in trunk and 2.x with small mods. Thank you!
> Include the storage class
[
https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134083#comment-15134083
]
Tim Allison edited comment on TIKA-1854 at 2/5/16 1:37 PM:
---
Will commit shortly.
[
https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134197#comment-15134197
]
Hudson commented on TIKA-1854:
--
SUCCESS: Integrated in tika-trunk-jdk1.7 #907 (See
[
https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134083#comment-15134083
]
Tim Allison edited comment on TIKA-1854 at 2/5/16 1:13 PM:
---
Will commit shortly.
[
https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134180#comment-15134180
]
Daniel Bonniot de Ruisselet commented on TIKA-1854:
---
The documents I'm processing
[
https://issues.apache.org/jira/browse/TIKA-1854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15134252#comment-15134252
]
Tim Allison commented on TIKA-1854:
---
Got it. This is very helpful. Thank you.
bq. Is the same
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135473#comment-15135473
]
Hudson commented on TIKA-1851:
--
FAILURE: Integrated in tika-2.x #21 (See
The Apache Jenkins build system has built tika-2.x (build #21)
Status: Failure
Check console output at https://builds.apache.org/job/tika-2.x/21/ to view the
results.
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135483#comment-15135483
]
Lewis John McGibbney commented on TIKA-1851:
If you can build locally then can you try a manual
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135480#comment-15135480
]
Tim Allison edited comment on TIKA-1851 at 2/6/16 2:35 AM:
---
Build now works
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135480#comment-15135480
]
Tim Allison commented on TIKA-1851:
---
Build now works locally for me after manual download of ner models.
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135506#comment-15135506
]
Tim Allison commented on TIKA-1851:
---
Ah, ok, thank you. Y, I was inclined to put the test resources in
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135511#comment-15135511
]
Bob Paulin commented on TIKA-1851:
--
Ah I was following [~lewismc] lead from any23 using the test-jar. I
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135514#comment-15135514
]
Tim Allison commented on TIKA-1851:
---
K. Moving everything back to test now.
> Tika 2.0 - Move test
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135521#comment-15135521
]
Lewis John McGibbney commented on TIKA-1851:
Ack
--
*Lewis*
> Tika 2.0 - Move test
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135434#comment-15135434
]
Thamme Gowda N commented on TIKA-1851:
--
Hi [~talli...@mitre.org] [~chrismattmann] and [~bobpaulin]
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison reopened TIKA-1851:
---
https://builds.apache.org/job/tika-2.x/20/#showFailuresLink
For these two:
{noformat}
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135458#comment-15135458
]
Thamme Gowda N commented on TIKA-1851:
--
Shell script was the initial version we had for manual setup
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135475#comment-15135475
]
Lewis John McGibbney commented on TIKA-1851:
All dependencies should always come from first
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135490#comment-15135490
]
Tim Allison commented on TIKA-1851:
---
Apologies for the following display of ignorance...but I _think_ the
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135496#comment-15135496
]
Lewis John McGibbney commented on TIKA-1851:
Can you check which modules failed. If something
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135504#comment-15135504
]
Bob Paulin commented on TIKA-1851:
--
I see the issue. It looks like the test-jar was removed so that the
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135526#comment-15135526
]
Tim Allison commented on TIKA-1851:
---
Looks like I have to move the MockParserTest out of the
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135544#comment-15135544
]
Tim Allison commented on TIKA-1851:
---
So, we're zipping _all_ the test files into a jar, and then each
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135342#comment-15135342
]
Ken Krugler commented on TIKA-1851:
---
Hmm, now the top-level build fails on the tika parser text module,
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135336#comment-15135336
]
Ken Krugler commented on TIKA-1851:
---
I did a top-level "mvn clean install", which failed with:
[ERROR]
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135373#comment-15135373
]
Chris A. Mattmann commented on TIKA-1851:
-
We could potentially refactor so that we assumeTrue on
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135348#comment-15135348
]
Lewis John McGibbney commented on TIKA-1851:
I'm the same Ken. We have been using cTAKES and
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135544#comment-15135544
]
Tim Allison edited comment on TIKA-1851 at 2/6/16 3:52 AM:
---
So, we're zipping
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135548#comment-15135548
]
Chris A. Mattmann commented on TIKA-1851:
-
Yep it may make sense to move the test-docs into sub
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135559#comment-15135559
]
Lewis John McGibbney commented on TIKA-1851:
In all honesty, if one takes a step back,
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135550#comment-15135550
]
Tim Allison commented on TIKA-1851:
---
K. Just pushed the "undo" moving all tika-test-resources back to
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135551#comment-15135551
]
Bob Paulin commented on TIKA-1851:
--
I like the idea of putting the docs in each module. I tried to do
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135589#comment-15135589
]
Hudson commented on TIKA-1851:
--
FAILURE: Integrated in tika-2.x #22 (See
The Apache Jenkins build system has built tika-2.x (build #22)
Status: Still Failing
Check console output at https://builds.apache.org/job/tika-2.x/22/ to view the
results.
[
https://issues.apache.org/jira/browse/TIKA-1851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15135594#comment-15135594
]
Lewis John McGibbney commented on TIKA-1851:
Regression in Tika advanced module and also
49 matches
Mail list logo