Hi Joe, You've more or less answered your own question I think. Please also see the wiki entries for the LinkRank scoring. http://wiki.apache.org/nutch/NewScoring Thanks Lewis
On Sun, Jun 16, 2013 at 12:14 PM, Joe Zhang <[email protected]> wrote: > Is scoring-link preferred over scoring-opic? I saw some disucssion of > deficiencies of opic. > > > On Sun, Jun 16, 2013 at 12:12 PM, Lewis John Mcgibbney < > [email protected]> wrote: > > > Yes Joe this is correct. > > > > > > On Sun, Jun 16, 2013 at 12:03 PM, Joe Zhang <[email protected]> > wrote: > > > > > Thanks. > > > > > > with regards to (2), is this score the "boost" we see in solr index? > > > > > > > > > On Sun, Jun 16, 2013 at 10:38 AM, Ahme Emre Aladağ > > > <[email protected]>wrote: > > > > > > > Note: I'm a newbie. > > > > > > > > As far as I know, new scoring and scoring-link corresponds to > LinkRank. > > > > It's implemented in the scoring.webgraph package. The code in the > > > > scoring-link might be linking the scoring plugin system to the > LinkRank > > > > class in webgraph. > > > > > > > > 1) Yes it works for sorting the pages. The topN most > important-seeming > > > > pages are fetched in the next cycles according to this scoring. > > > > 2) Relevance in retrieval is affected due to (1). It calculates the > > > scores > > > > and gives them to Solr. Solr will rank the search results according > to > > > > these scores and some other external custom scores. > > > > > > > > > > > > ----- Orijinal Mesaj ----- > > > > Kimden: "Joe Zhang" <[email protected]> > > > > Kime: "user" <[email protected]> > > > > Gönderilenler: 15 Haziran Cumartesi 2013 23:41:33 > > > > Konu: Nutch scoring question again > > > > > > > > The plugins directory only contains two scoring plugs: scoring-link > and > > > > scoring-opic. What about the newscoring, linkrank, etc.? Where are > they > > > > available? > > > > > > > > Again, I'm confused about the nature/purpose of such scoring: > > > > > > > > 1. Does it work as a sorting function for the frontier of the > crawling? > > > --> > > > > this seems reasonable. > > > > 2. Or does it affect relevance in retrieval? If so, why is it handled > > in > > > > the crawler, but not solr? > > > > > > > > I'd greatly appreciate any enlightment. > > > > > > > > > > > > > > > -- > > *Lewis* > > > -- *Lewis*

