[ https://issues.apache.org/jira/browse/SPARK-10945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15156639#comment-15156639 ]
Robin East commented on SPARK-10945: ------------------------------------ It's not obvious how to reproduce this from the datasets available at the download site. You mentioned that 'dataset format was converted to edge-list, no edge weights at all.'. Can you share the code that converts from the WebGraph format to edge-list? Alternatively can you make the input file available? > GraphX computes Pagerank with NaN (with some datasets) > ------------------------------------------------------ > > Key: SPARK-10945 > URL: https://issues.apache.org/jira/browse/SPARK-10945 > Project: Spark > Issue Type: Bug > Components: GraphX > Affects Versions: 1.3.0 > Environment: Linux > Reporter: Khaled Ammar > Labels: test > > Hi, > I run GraphX in a medium size standalone Spark 1.3.0 installation. The > pagerank typically works fine, except with one dataset (Twitter: > http://law.di.unimi.it/webdata/twitter-2010). This is a public dataset that > is commonly used in research papers. > I found that many vertices have an NaN values. This is true, even if the > algorithm run for 1 iteration only. > Thanks, > -Khaled -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org