As you know 2.x uses Tika 1.2.. if this bug was resolved/fixed and
made it into the 1.2 release then the problem is related to something
else. However if this is not the case you will need to either hack it
and build the dependency yourself before making it available on your
CP or alternatively use the appropriate development distribution.

hth

Lewis

On Tue, Nov 6, 2012 at 11:16 PM, kiran chitturi
<[email protected]> wrote:
> Hi,
>
> I was trying to parse HTML files and it throwed this error for one
> particular HTML file. I guessed we are using tagsoup for parsing and
> someone already fixed this in the tika code. (
> https://github.com/jukka/tagsoup/commit/9cfe7b48745173faafa419f540538a0b6309b699
> )
>
> Can someone tell me if this revision is included in the tika that we have
> with Nutch-2.x ? Should i use latest tika-dev to have this included and
> change libraries in ivy.xml ?
>
> Thank you,
>
> --
> Kiran Chitturi



-- 
Lewis

Reply via email to