Hello, I am trying crawl a website using nutch trunk along with the latest tika It gives me an error:
Can't retrieve Tika parser for mime-type text/aspdotnet But when I try to parse the same url using the tika-app-1.10.jar using the command $ java -jar tika-app-1.10.jar -m url It prints the metadata. I also tried using tika-mimetypes.xml in my conf and used it in nutch-site.xml how shall I fix this issue? Thanks, Manali