We perform a web crawl in Nutch with a number of pages including for example www.apple.com and www.luggagepensgifts.com. After that we run a query "apple" and obtain some results including the pages from www.apple.com. If we specify the domain for search "apple site:www.apple.com" we get only the pages from www.apple.com. The number of resulting pages may be considered the number of the pages crawled at the domain www.apple.com. But if we search either for "luggagepensgifts" or for "luggagepensgifts site:www.luggagepensgifts.com" there are no results returned. This site is included in the search for sure because searching for other words specific for its pages returns results. The same is, e.g. for www.nycexoticcarrentals.com. What may be the matter for this behavior and how can we obtain the pages with “luggagepensgifts” in domain?
------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
