Re: bugs in Spark PageRank implementation

2015-06-25 Thread Sean Owen
#2 is not a bug. Have a search through JIRA. It is merely unformalized. I think that is how (one of?) the original PageRank papers does it. On Thu, Jun 25, 2015, 7:39 AM Kelly, Terence P (HP Labs Researcher) terence.p.ke...@hp.com wrote: Hi, Colleagues and I have found that the PageRank

bugs in Spark PageRank implementation

2015-06-24 Thread Kelly, Terence P (HP Labs Researcher)
Hi, Colleagues and I have found that the PageRank implementation bundled with Spark is incorrect in several ways. The code in question is in Apache Spark 1.2 distribution's examples directory, called SparkPageRank.scala. Consider the example graph presented in the colorful figure on the

Re: bugs in Spark PageRank implementation

2015-06-24 Thread Tarek Auel
Hi Terence, which implementation are you using? I tested it and the results look very good id --- result value -percentage --- percentage (wikipedia) 2: 3.5658816369034536 (38.43986817970977 %), 38.4% 3: 3.1809909923039688 (34.29078328331496 %), 34.3% 5: 0.7503491964913347