Re: URL Fetch Error

Doğacan Güney Sat, 23 Aug 2008 02:43:18 -0700

Hi,

On Sat, Aug 23, 2008 at 4:48 AM, MaRiE16 <[EMAIL PROTECTED]> wrote:


>
> Good day,
>
> We are running Nutch using the Eclipse environment, the first time we use
> it
> to crawl for a site (e.g. http://inquirer.net ) there seems to be nothing
> wrong with it.
> But when we attempt to crawl a different site (e.g. www.mb.com.ph), we
> encounter the following errors:
>
> Generator: 0 records selected for fetching, exiting ...
> Stopping at depth=0 - no more URLs to fetch.
> No URLs to fetch - check your seed list and URL filters.
> crawl finished: crawl.te8
>
> However these errors do not occur when we crawl the first site for the
> second time. Our further testing have made us to conclude that only the
> FIRST site to be crawled can be crawled without encountering the errors
> specified above.
>
> Can you please tell us why these errors occurs?


For some reason, generator did not generate any urls. Possibly, something is
wrong with your url filters. I would recommed checking
conf/crawl-urlfilter.xml (if you are doing a local crawl) or
conf/regex-urlfilter.xml.


>
> --
> View this message in context:
> http://www.nabble.com/URL-Fetch-Error-tp19117909p19117909.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>
>


-- 
Doğacan Güney

Re: URL Fetch Error

Reply via email to