No - you won't be able to crawl this page. Nutch will follow robots
directive of the domain - see http://search.yahoo.com/robots.txt.


-Devang.


-----Original Message-----
From: Kim Theng Chong [mailto:kimthe...@yahoo.com] 
Sent: Tuesday, March 30, 2010 10:00 PM
To: nutch-user@lucene.apache.org
Subject: Crawl yahoo search result page

Hi all,

Can Nutch crawl Yahoo search result page? eg :
http://search.yahoo.com/search?rd=&fp_ip=my&p=ontology&toggle=1&cop=mss&ei=U
TF-8&fr=yfp-t-892 (put as seed url) . I was not able to fetch the results in
this page. Can someone guide me on this?

Thank you.

Best regards,
Kim


      

Reply via email to