Good day, We are running Nutch using the Eclipse environment, the first time we use it to crawl for a site (e.g. http://inquirer.net ) there seems to be nothing wrong with it. But when we attempt to crawl a *differen*t site (e.g. www.mb.com.ph), we encounter the following errors:
*Generator: 0 records selected for fetching, exiting ... Stopping at depth=0 - no more URLs to fetch. No URLs to fetch - check your seed list and URL filters. crawl finished: crawl.te8* However these errors do not occur when we crawl the first site for the second time. *Our further testing have made us to conclude that only the FIRST site to be crawled can be crawled without encountering the errors specified above.* Can you please tell us why these errors occurs? Thanks, -- Emily Marie A. Tabugadir Academic Affairs Committee Head, UP CURSOR BS Computer Science University of the Philippines, Diliman
