Hi Nutch guys
I used org.apache.nutch.parse.rss.RSSParser , for parsing RSS feeds. It is
showing urls on console.Now i want to crawl those urls.
How will i do this? Does RSSPrser class store it in crawldb or
i need to send to all URLs to crawldb.Then run the crawl command.
Is there another approach?
--
View this message in context:
http://www.nabble.com/How-to-crawl-URLs-getting-from-RSSParser-tp24404179p24404179.html
Sent from the Nutch - User mailing list archive at Nabble.com.