Re: Simple crawl fails to find any URLs

bhupal Tue, 29 Jan 2008 01:54:36 -0800

Hi,

Look at your conf/nutch-default.xml.
I think you have not added crawl-urlfilter plugin in plugin-include
property.


bhupal.


Barry Haddow wrote:
> 
> Hi
> 
> I'm try to get the nutch/hadoop example from 
> http://wiki.apache.org/nutch/NutchHadoopTutorial
> running.  
> 
> I've set up the urllist.txm and the crawl-urlfilter.xml as instructed in
> the 
> tutorial, but whenever I run the crawl it either reports
> 
> Generator: 0 records selected for fetching, exiting ...
> Stopping at depth=1 - no more URLs to fetch.
> 
> or
> 
> Generator: 0 records selected for fetching, exiting ...
> Stopping at depth=0 - no more URLs to fetch.
> 
> 
> I can't tell if the crawler has managed to fetch any data. How can I
> extract 
> whatever data is has downloaded?
> 
> thanks,
> Barry
> 
> 

-- 
View this message in context: 
http://www.nabble.com/Simple-crawl-fails-to-find-any-URLs-tp15143487p15155976.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Re: Simple crawl fails to find any URLs

Reply via email to