Hi,
this question is difficult to answer and may be there more experts in
the nutch user list than in the developer list.
In nutch 0.8 you can use the new scoring api to change the scoring of
a page for being scheduled for crawling based on the it's scores.
Have a look to the opic score plu
Hi,
I have successfully configured nutch 0.7.2. Ran the crawler a few times all
working fine. Now i wanted to know is there a way i can run the crawler so
that if it finds certain keyword in a website only then it indexes it
otherwise not. Also after i have the index created is it possible that i