To debug what's going on you need the indexchecker tool and the parsechecker 
tool.

> hi
> I m using Nutch 1.3 to crawl.
> 
> It works fine with html but for  PDF file i dont see any data for title in
> solr when i index solr (trunk version4.0)
> 
> Any mapping/config needs to be done for pdf files?
> 
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/title-is-missing-when-crawling-pdf-file
> -tp3694141p3694141.html Sent from the Nutch - User mailing list archive at
> Nabble.com.

Reply via email to