Hi,
This is not quite similar but there's a new parameter for the generator
in Nutch 1.5 where you can restrict selection by status.
Cheers
-------- Original Message --------
Subject: Re: Make Nutch to crawl internal urls only
Date: Thu, 10 May 2012 02:24:18 -0700 (PDT)
From: Greg Fields <gracken...@gmail.com>
To: user@nutch.apache.org
Reply-To: user@nutch.apache.org
I have a similar problem. Is there a way i can force the fetcher to
only take
urls from the unfetched url list?
--
View this message in context:
http://lucene.472066.n3.nabble.com/Make-Nutch-to-crawl-internal-urls-only-tp3974397p3976568.html
Sent from the Nutch - User mailing list archive at Nabble.com.
--
Markus Jelsma - CTO - Openindex