[jira] [Commented] (TIKA-2359) Extreme slow parsing on the attachment attached

2017-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16007416#comment-16007416 ] Tim Allison commented on TIKA-2359: --- Sorry, took me a while to dig into this. I hadn't seen our

[jira] [Commented] (TIKA-2359) Extreme slow parsing on the attachment attached

2017-05-11 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16007276#comment-16007276 ] Luis Filipe Nassif commented on TIKA-2359: -- In the past I was against enabling tesseract by

[jira] [Commented] (TIKA-2359) Extreme slow parsing on the attachment attached

2017-05-11 Thread Eugen Mayer (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006750#comment-16006750 ] Eugen Mayer commented on TIKA-2359: --- any informations how the binaries are called and how to disable

[jira] [Commented] (TIKA-2359) Extreme slow parsing on the attachment attached

2017-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006697#comment-16006697 ] Tim Allison commented on TIKA-2359: --- IIRC, might also want to check ExifTool and Strings...which I think

[jira] [Commented] (TIKA-2359) Extreme slow parsing on the attachment attached

2017-05-11 Thread Eugen Mayer (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006692#comment-16006692 ] Eugen Mayer commented on TIKA-2359: --- Anyways, case closed, thank you for the quick response > Extreme

[jira] [Commented] (TIKA-2359) Extreme slow parsing on the attachment attached

2017-05-11 Thread Eugen Mayer (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006689#comment-16006689 ] Eugen Mayer commented on TIKA-2359: --- oh holy..seriously? By default OCR by simply having a lib installed

[jira] [Commented] (TIKA-2359) Extreme slow parsing on the attachment attached

2017-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006680#comment-16006680 ] Tim Allison commented on TIKA-2359: --- y, Tika will call tesseract on every image file in your document,

[jira] [Commented] (TIKA-2359) Extreme slow parsing on the attachment attached

2017-05-11 Thread Eugen Mayer (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006667#comment-16006667 ] Eugen Mayer commented on TIKA-2359: --- interestingly, no, i get an option list: tesseract Usage:

[jira] [Commented] (TIKA-2359) Extreme slow parsing on the attachment attached

2017-05-11 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006665#comment-16006665 ] Tim Allison commented on TIKA-2359: --- Doh, right, tika-app. Thank you. To confirm, if you type

[jira] [Commented] (TIKA-2359) Extreme slow parsing on the attachment attached

2017-05-11 Thread Eugen Mayer (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16006661#comment-16006661 ] Eugen Mayer commented on TIKA-2359: --- thats my call java -jar tika.jar Sample-doc-file-2000kb.doc So to

[jira] [Created] (TIKA-2359) Extreme slow parsing on the attachment attached

2017-05-11 Thread Eugen Mayer (JIRA)
Eugen Mayer created TIKA-2359: - Summary: Extreme slow parsing on the attachment attached Key: TIKA-2359 URL: https://issues.apache.org/jira/browse/TIKA-2359 Project: Tika Issue Type: Bug

[jira] [Updated] (TIKA-2359) Extreme slow parsing on the attachment attached

2017-05-11 Thread Eugen Mayer (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2359?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eugen Mayer updated TIKA-2359: -- Attachment: Sample-doc-file-2000kb.doc > Extreme slow parsing on the attachment attached >