1)How i can make Nutch not follow the external links ?.
For example, i only want to fetch http://www.nutch.org and not the external links on this site which points to other sites such as http://labs.yahoo.com/demo/nutch/ .
2)I am also unable to save "Anchors" when i fetch the sites,the site cache page are saved but without anchors.
Thanks.
Do you Yahoo!?
Friends. Fun. Try the all-new Yahoo! Messenger
