Hello,

I am trying crawl a website using nutch trunk along with the latest tika
It gives me an error:

Can't retrieve Tika parser for mime-type text/aspdotnet

But when I try to parse the same url using the tika-app-1.10.jar using the
command

$ java -jar tika-app-1.10.jar -m url

It prints the metadata.

I also tried using tika-mimetypes.xml in my conf and used it in
nutch-site.xml

how shall I fix this issue?

Thanks,

Manali

Reply via email to