We perform a web crawl in Nutch with a number of pages including for
example www.apple.com and www.luggagepensgifts.com. 
 
After that we run a query "apple" and obtain some results including
the pages from www.apple.com. If we specify the domain for search
"apple site:www.apple.com" we get only the pages from www.apple.com.
The number of resulting pages may be considered the number of the
pages crawled at the domain www.apple.com.    
 
But if we search either for "luggagepensgifts" or for
"luggagepensgifts site:www.luggagepensgifts.com" there are no results
returned. This site is included in the search for sure because
searching for other words specific for its pages returns results. The
same is, e.g. for www.nycexoticcarrentals.com.    
 
What may be the matter for this behavior and how can we obtain the
pages with “luggagepensgifts” in domain? 


-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys - and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to