[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16007416#comment-16007416
]
Tim Allison commented on TIKA-2359:
---
Sorry, took me a while to dig into this. I hadn't seen our
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16007276#comment-16007276
]
Luis Filipe Nassif commented on TIKA-2359:
--
In the past I was against enabling tesseract by
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006750#comment-16006750
]
Eugen Mayer commented on TIKA-2359:
---
any informations how the binaries are called and how to disable
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006697#comment-16006697
]
Tim Allison commented on TIKA-2359:
---
IIRC, might also want to check ExifTool and Strings...which I think
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006692#comment-16006692
]
Eugen Mayer commented on TIKA-2359:
---
Anyways, case closed, thank you for the quick response
> Extreme
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006689#comment-16006689
]
Eugen Mayer commented on TIKA-2359:
---
oh holy..seriously? By default OCR by simply having a lib installed
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006680#comment-16006680
]
Tim Allison commented on TIKA-2359:
---
y, Tika will call tesseract on every image file in your document,
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006667#comment-16006667
]
Eugen Mayer commented on TIKA-2359:
---
interestingly, no, i get an option list:
tesseract
Usage:
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006665#comment-16006665
]
Tim Allison commented on TIKA-2359:
---
Doh, right, tika-app. Thank you.
To confirm, if you type
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006661#comment-16006661
]
Eugen Mayer commented on TIKA-2359:
---
thats my call
java -jar tika.jar Sample-doc-file-2000kb.doc
So to
Eugen Mayer created TIKA-2359:
-
Summary: Extreme slow parsing on the attachment attached
Key: TIKA-2359
URL: https://issues.apache.org/jira/browse/TIKA-2359
Project: Tika
Issue Type: Bug
[
https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Eugen Mayer updated TIKA-2359:
--
Attachment: Sample-doc-file-2000kb.doc
> Extreme slow parsing on the attachment attached
>
12 matches
Mail list logo