Hi, On Sat, Aug 23, 2008 at 4:48 AM, MaRiE16 <[EMAIL PROTECTED]> wrote:
> > Good day, > > We are running Nutch using the Eclipse environment, the first time we use > it > to crawl for a site (e.g. http://inquirer.net ) there seems to be nothing > wrong with it. > But when we attempt to crawl a different site (e.g. www.mb.com.ph), we > encounter the following errors: > > Generator: 0 records selected for fetching, exiting ... > Stopping at depth=0 - no more URLs to fetch. > No URLs to fetch - check your seed list and URL filters. > crawl finished: crawl.te8 > > However these errors do not occur when we crawl the first site for the > second time. Our further testing have made us to conclude that only the > FIRST site to be crawled can be crawled without encountering the errors > specified above. > > Can you please tell us why these errors occurs? For some reason, generator did not generate any urls. Possibly, something is wrong with your url filters. I would recommed checking conf/crawl-urlfilter.xml (if you are doing a local crawl) or conf/regex-urlfilter.xml. > > -- > View this message in context: > http://www.nabble.com/URL-Fetch-Error-tp19117909p19117909.html > Sent from the Nutch - User mailing list archive at Nabble.com. > > -- Doğacan Güney
