[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14272570#comment-14272570
]
Chris A. Mattmann commented on TIKA-1445:
-
yeesh, caught up on all this great work.
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271201#comment-14271201
]
Tim Allison commented on TIKA-1445:
---
No major problems found via quick and dirty govdocs1
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271266#comment-14271266
]
Tim Allison commented on TIKA-1445:
---
Might have been neater, but you figured out how to
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14271599#comment-14271599
]
Nick Burch commented on TIKA-1445:
--
Please open a ticket for the excel 3 issue, and if you
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14269765#comment-14269765
]
Nick Burch commented on TIKA-1445:
--
If we're going to close this for 1.7, then we need to
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14269768#comment-14269768
]
Tim Allison commented on TIKA-1445:
---
Completely agree! Opening new issues now.
Figure
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14269800#comment-14269800
]
Tyler Palsulich commented on TIKA-1445:
---
Thanks guys! [~tallison], let me know once
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14269454#comment-14269454
]
Tim Allison commented on TIKA-1445:
---
I'll have time to rerun trunk against govdocs1 and
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267553#comment-14267553
]
Nick Burch commented on TIKA-1445:
--
I wonder if it wouldn't be better to do the is
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267584#comment-14267584
]
Nick Burch commented on TIKA-1445:
--
As of r1650051, I think we're correctly handling the
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267586#comment-14267586
]
Hudson commented on TIKA-1445:
--
UNSTABLE: Integrated in tika-trunk-jdk1.7 #411 (See
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267643#comment-14267643
]
Nick Burch commented on TIKA-1445:
--
Ah, true, I hadn't thought so much about the system
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267756#comment-14267756
]
Nick Burch commented on TIKA-1445:
--
I've no idea why the fork parser is failing when run
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267626#comment-14267626
]
Hudson commented on TIKA-1445:
--
UNSTABLE: Integrated in tika-trunk-jdk1.7 #412 (See
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267766#comment-14267766
]
Tim Allison commented on TIKA-1445:
---
Y, and why did the tests work before and how does it
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267773#comment-14267773
]
Nick Burch commented on TIKA-1445:
--
The only other parser that uses ExternalParser is
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267786#comment-14267786
]
Luis Filipe Nassif commented on TIKA-1445:
--
It is not related directly to this
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267792#comment-14267792
]
Nick Burch commented on TIKA-1445:
--
[~lfcnassif] Longer term we'll have different config
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267724#comment-14267724
]
Tim Allison commented on TIKA-1445:
---
Not to repeat Jenkins, well, apologies for repeating
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267854#comment-14267854
]
Tim Allison commented on TIKA-1445:
---
[~gagravarr], see if you have success with r1650117.
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267871#comment-14267871
]
Hudson commented on TIKA-1445:
--
SUCCESS: Integrated in tika-trunk-jdk1.6 #399 (See
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267879#comment-14267879
]
Tyler Palsulich commented on TIKA-1445:
---
All tests pass with and without Tesseract
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267840#comment-14267840
]
Tim Allison commented on TIKA-1445:
---
Fixed the tika-server test failure with r1650111.
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14268021#comment-14268021
]
Hudson commented on TIKA-1445:
--
SUCCESS: Integrated in tika-trunk-jdk1.6 #401 (See
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14268003#comment-14268003
]
Hudson commented on TIKA-1445:
--
SUCCESS: Integrated in tika-trunk-jdk1.7 #416 (See
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14268006#comment-14268006
]
Tyler Palsulich commented on TIKA-1445:
---
Done. I made some small changes and split
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267618#comment-14267618
]
Tim Allison commented on TIKA-1445:
---
Yes, that's a great idea. I was disturbed by the
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267892#comment-14267892
]
Tim Allison commented on TIKA-1445:
---
Thank you! Do you mind doing a quick code review of
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267934#comment-14267934
]
Hudson commented on TIKA-1445:
--
SUCCESS: Integrated in tika-trunk-jdk1.7 #415 (See
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14267161#comment-14267161
]
Tim Allison commented on TIKA-1445:
---
Looking into this a bit more...we aren't even
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14252952#comment-14252952
]
Nick Burch commented on TIKA-1445:
--
For 1.7, how about we just have the Tesseract Parser
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14252956#comment-14252956
]
Tyler Palsulich commented on TIKA-1445:
---
+1, Nick. That sounds good to me. I'll
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14252973#comment-14252973
]
Nick Burch commented on TIKA-1445:
--
In r1646624 I've added what I think should do the
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14252985#comment-14252985
]
Hudson commented on TIKA-1445:
--
UNSTABLE: Integrated in tika-trunk-jdk1.7 #371 (See
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14253057#comment-14253057
]
Hudson commented on TIKA-1445:
--
SUCCESS: Integrated in tika-trunk-jdk1.7 #372 (See
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14253076#comment-14253076
]
Hudson commented on TIKA-1445:
--
SUCCESS: Integrated in tika-trunk-jdk1.6 #356 (See
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222510#comment-14222510
]
Nick Burch commented on TIKA-1445:
--
I quite like Tim's idea. We can have things like
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14222512#comment-14222512
]
Chris A. Mattmann commented on TIKA-1445:
-
Yep I like the idea too. Time to figure
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14217685#comment-14217685
]
Dave Meikle commented on TIKA-1445:
---
bq. Hey Guys, to be honest, the way I see that we
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14217965#comment-14217965
]
Tim Allison commented on TIKA-1445:
---
How about using the order of parsers as specified in
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216351#comment-14216351
]
Tim Allison commented on TIKA-1445:
---
[~gagravarr], thank you for explaining the original
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216365#comment-14216365
]
Tim Allison commented on TIKA-1445:
---
Copied from dev discussion to record points on this
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216444#comment-14216444
]
Nick Burch commented on TIKA-1445:
--
I think it's fairly common for people to have 4-5
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216451#comment-14216451
]
Chris A. Mattmann commented on TIKA-1445:
-
Hey Guys, to be honest, the way I see
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216466#comment-14216466
]
Nick Burch commented on TIKA-1445:
--
Anyone using tika-parser OOTB has two parsers services
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14216960#comment-14216960
]
Chris A. Mattmann commented on TIKA-1445:
-
Hi Nick:
I think we need to be careful
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14217100#comment-14217100
]
Lewis John McGibbney commented on TIKA-1445:
We can run many extractors against
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14217407#comment-14217407
]
Lewis John McGibbney commented on TIKA-1445:
OK so in Any23, if we were to take
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14214668#comment-14214668
]
Tim Allison commented on TIKA-1445:
---
This might muddy results, initially, but users could
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14215170#comment-14215170
]
Luis Filipe Nassif commented on TIKA-1445:
--
+1 to respect the order of parsers in
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14215292#comment-14215292
]
Nick Burch commented on TIKA-1445:
--
+1 to respect the order of parsers in the service
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14215303#comment-14215303
]
Chris A. Mattmann commented on TIKA-1445:
-
Hey [~talli...@apache.org]:
Here are my
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14213858#comment-14213858
]
Chris A. Mattmann commented on TIKA-1445:
-
Tim, I wonder if it's possible to clone
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14212246#comment-14212246
]
Tim Allison commented on TIKA-1445:
---
The AutoDetectParser was doing its regular lookup
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14212258#comment-14212258
]
Tim Allison commented on TIKA-1445:
---
This is what we're currently doing in
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14211277#comment-14211277
]
Tyler Palsulich commented on TIKA-1445:
---
[~talli...@apache.org], what was the system
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14185574#comment-14185574
]
Tim Allison commented on TIKA-1445:
---
I played with this a bit with a png test file.
The
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14185644#comment-14185644
]
Tyler Palsulich commented on TIKA-1445:
---
bq. Doh! Send in a DefaultHandler instead of
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14183873#comment-14183873
]
Tyler Palsulich commented on TIKA-1445:
---
I've been trying my hand at this some time
[
https://issues.apache.org/jira/browse/TIKA-1445?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14169090#comment-14169090
]
Hong-Thai Nguyen commented on TIKA-1445:
Interesting question !
For me, parser's
60 matches
Mail list logo