and scoring-link != LinkRank?
On Sun, Jun 16, 2013 at 12:20 PM, Joe Zhang <[email protected]> wrote: > So this is a stand-alone process after the crawling. > > > On Sun, Jun 16, 2013 at 12:17 PM, Markus Jelsma < > [email protected]> wrote: > >> Hi Joe, >> >> You don't need a scoring filter for Linkrank. Just follow the wiki and >> run the webgraph tool on your segments. Then you can run the linkrank tool >> on the webgraph you just created from your segments. Finally use the >> scoreupdater tool to write the scores back to your crawldb. >> >> Cheers >> >> https://wiki.apache.org/nutch/NewScoring >> >> >> -----Original message----- >> > From:Joe Zhang <[email protected]> >> > Sent: Sun 16-Jun-2013 21:14 >> > To: user <[email protected]> >> > Subject: Re: Nutch scoring question again >> > >> > Is scoring-link preferred over scoring-opic? I saw some disucssion of >> > deficiencies of opic. >> > >> > >> > On Sun, Jun 16, 2013 at 12:12 PM, Lewis John Mcgibbney < >> > [email protected]> wrote: >> > >> > > Yes Joe this is correct. >> > > >> > > >> > > On Sun, Jun 16, 2013 at 12:03 PM, Joe Zhang <[email protected]> >> wrote: >> > > >> > > > Thanks. >> > > > >> > > > with regards to (2), is this score the "boost" we see in solr index? >> > > > >> > > > >> > > > On Sun, Jun 16, 2013 at 10:38 AM, Ahme Emre Aladağ >> > > > <[email protected]>wrote: >> > > > >> > > > > Note: I'm a newbie. >> > > > > >> > > > > As far as I know, new scoring and scoring-link corresponds to >> LinkRank. >> > > > > It's implemented in the scoring.webgraph package. The code in the >> > > > > scoring-link might be linking the scoring plugin system to the >> LinkRank >> > > > > class in webgraph. >> > > > > >> > > > > 1) Yes it works for sorting the pages. The topN most >> important-seeming >> > > > > pages are fetched in the next cycles according to this scoring. >> > > > > 2) Relevance in retrieval is affected due to (1). It calculates >> the >> > > > scores >> > > > > and gives them to Solr. Solr will rank the search results >> according to >> > > > > these scores and some other external custom scores. >> > > > > >> > > > > >> > > > > ----- Orijinal Mesaj ----- >> > > > > Kimden: "Joe Zhang" <[email protected]> >> > > > > Kime: "user" <[email protected]> >> > > > > Gönderilenler: 15 Haziran Cumartesi 2013 23:41:33 >> > > > > Konu: Nutch scoring question again >> > > > > >> > > > > The plugins directory only contains two scoring plugs: >> scoring-link and >> > > > > scoring-opic. What about the newscoring, linkrank, etc.? Where >> are they >> > > > > available? >> > > > > >> > > > > Again, I'm confused about the nature/purpose of such scoring: >> > > > > >> > > > > 1. Does it work as a sorting function for the frontier of the >> crawling? >> > > > --> >> > > > > this seems reasonable. >> > > > > 2. Or does it affect relevance in retrieval? If so, why is it >> handled >> > > in >> > > > > the crawler, but not solr? >> > > > > >> > > > > I'd greatly appreciate any enlightment. >> > > > > >> > > > >> > > >> > > >> > > >> > > -- >> > > *Lewis* >> > > >> > >> > >

