A set of properties including the ones above, based on my understanding, are part of the scoring component in Nutch.
What I don't quite understand is the nature/purpose of such scoring: 1. Does it work as a sorting function for the frontier of the crawling? --> this seems reasonable. 2. If it is some PageRank style ranking algorithm intended for retrieval, shouldn't it be part of solr, instead of the crawler? Which one is true?

