Hi, Look at your conf/nutch-default.xml. I think you have not added crawl-urlfilter plugin in plugin-include property.
bhupal. Barry Haddow wrote: > > Hi > > I'm try to get the nutch/hadoop example from > http://wiki.apache.org/nutch/NutchHadoopTutorial > running. > > I've set up the urllist.txm and the crawl-urlfilter.xml as instructed in > the > tutorial, but whenever I run the crawl it either reports > > Generator: 0 records selected for fetching, exiting ... > Stopping at depth=1 - no more URLs to fetch. > > or > > Generator: 0 records selected for fetching, exiting ... > Stopping at depth=0 - no more URLs to fetch. > > > I can't tell if the crawler has managed to fetch any data. How can I > extract > whatever data is has downloaded? > > thanks, > Barry > > -- View this message in context: http://www.nabble.com/Simple-crawl-fails-to-find-any-URLs-tp15143487p15155976.html Sent from the Nutch - User mailing list archive at Nabble.com.