URL Fetch Error

Marie Tabugadir Wed, 20 Aug 2008 14:12:00 -0700

Good day,

We are running Nutch using the Eclipse environment, the first time we use it
to crawl for a site (e.g. http://inquirer.net ) there seems to be nothing
wrong with it.
But when we attempt to crawl a *differen*t site (e.g. www.mb.com.ph), we
encounter the following errors:


*Generator: 0 records selected for fetching, exiting ...
Stopping at depth=0 - no more URLs to fetch.
No URLs to fetch - check your seed list and URL filters.
crawl finished: crawl.te8*

However these errors do not occur when we crawl the first site for the
second time. *Our further testing have made us to conclude that only the
FIRST site to be crawled can be crawled without encountering the errors
specified above.*

Can you please tell us why these errors occurs?


Thanks,
-- 
Emily Marie A. Tabugadir
Academic Affairs Committee Head, UP CURSOR
BS Computer Science
University of the Philippines, Diliman

URL Fetch Error

Reply via email to