Re: Tesseract OCR always activeated parser for images

2014-10-07 Thread Tyler Palsulich
++ -Original Message- From: Tyler Palsulich tpalsul...@gmail.com Reply-To: dev@tika.apache.org dev@tika.apache.org Date: Tuesday, October 7, 2014 at 1:49 AM To: dev@tika.apache.org dev@tika.apache.org Subject: Re: Tesseract OCR always

Tesseract OCR always activeated parser for images

2014-10-06 Thread Lewis John Mcgibbney
Hi Folks, Now, once I install Tesseract, it is run for every image I pass through Tika server or Tika app. This is not okay as it does not give me the type of MD I am looking for. This is a just a note to folks, to say that AFAIK you would need to unregister the the parser from [0] then rebuild

Re: Tesseract OCR always activeated parser for images

2014-10-06 Thread Tyler Palsulich
Confirmed. This is why we ran into TIKA-1422. But, Chris' patch may provide the backwards compatibility you're looking for. What do you think? Tyler On Mon, Oct 6, 2014 at 7:47 PM, Lewis John Mcgibbney lewis.mcgibb...@gmail.com wrote: Hi Folks, Now, once I install Tesseract, it is run for