[jira] [Commented] (SPARK-10945) GraphX computes Pagerank with NaN (with some datasets)

Robin East (JIRA) Mon, 22 Feb 2016 00:52:37 -0800

    [ 
https://issues.apache.org/jira/browse/SPARK-10945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15156639#comment-15156639
 ]


Robin East commented on SPARK-10945:
------------------------------------

It's not obvious how to reproduce this from the datasets available at the 
download site. You mentioned that 'dataset format was converted to edge-list, 
no edge weights at all.'. Can you share the code that converts from the 
WebGraph format to edge-list? Alternatively can you make the input file 
available?

> GraphX computes Pagerank with NaN (with some datasets)
> ------------------------------------------------------
>
>                 Key: SPARK-10945
>                 URL: https://issues.apache.org/jira/browse/SPARK-10945
>             Project: Spark
>          Issue Type: Bug
>          Components: GraphX
>    Affects Versions: 1.3.0
>         Environment: Linux
>            Reporter: Khaled Ammar
>              Labels: test
>
> Hi,
> I run GraphX in a medium size standalone Spark 1.3.0 installation. The 
> pagerank typically works fine, except with one dataset (Twitter: 
> http://law.di.unimi.it/webdata/twitter-2010). This is a public dataset that 
> is commonly used in research papers.
> I found that many vertices have an NaN values. This is true, even if the 
> algorithm run for 1 iteration only.  
> Thanks,
> -Khaled



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-10945) GraphX computes Pagerank with NaN (with some datasets)

Reply via email to