Hi, I'm using nutch 0.8.1 to index several thousand text files (source code) and I use intranet crawling method to create an index.
Everything looks fine, but when I try to search something, it often doesn't find what it should. I'm sure that the term is in several pages, but I got result only for some of them. I tried to set limits in properties like page sizes, number of links etc. but nothing helped. There aren't any error messages in logfile during crawl. Is there any way how to find a reason for this behavior ? How to make nutch more reliable in results? Thanks for any hint. Libor ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
