Hi !

Doing some test With Webgraph and Linkrank i can not figure out how
totalInlinkScore is initialized (totalInlinkScore at
LinkRank.Analyzer$reduce):
If i understand well LinkScore is computed as follow :

   - Counter  : count the  number of nodes.
   - Initializer : initialize score to 1 (by default).


Repeat for a fixed number of iterations :

   - Inverter : invert outlink into inlink and compute the score
   transmitted by each link.
   - Analyzer : update the score of each node by summing the score of each
   inlinks (in a variable called totalInlinkScore) and apply the damping
   factor.


My question is : during Analyzer, why is the sum of inlinks scores
initialized to 1/N where N is the result of Counter(see totalInlinkScore at
LinkRank.Analyzer$reduce). Tell me if i am wrong but In a normal PageRank
this sum would be initialized to zero. I know only two variant of PageRank :

    Normalized version of PageRank in which all vertices score is set to
1/N during Initializer and the update formula becomes (1-damping)/N
+damping*totalInlinkScore.
    non-Normalized PageRank which is close to what nutch is doing right
know but do not justify the initialization of Inlink Score at 1/N. If i
understand well, the advantage of non-Normalized pageRank is that the total
number of node is remove from PageRank Formula so there is no need to run
Counter.

In both case i do not see why totalInlinkScore would be initialized to
something other than zero.

Thanks you for any help.

------

Arthur

Reply via email to