Re: GraphX Connected Components

2016-11-08 Thread Robineast
in context: http://apache-spark-user-list.1001560.n3.nabble.com/GraphX-Connected-Components-tp10869p28049.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe e-mail: user-unsubscr

RE: Question about GraphX connected-components

2015-10-12 Thread John Lilley
be needed. Arriving at the answer through experimentation isn’t a good approach, because that assumes -- chicken-and-egg problem -- that we have already arrived at an optimal configuration. -- Does GraphX connected-components performance degrade slowly or catastrophically when that memory limit

Re: Question about GraphX connected-components

2015-10-10 Thread Igor Berman
t 00:13, John Lilley <john.lil...@redpoint.net> wrote: > Greetings, > > We are looking into using the GraphX connected-components algorithm on > Hadoop for grouping operations. Our typical data is on the order of > 50-200M vertices with an edge:vertex ratio between 2 and 30. Whi

Question about GraphX connected-components

2015-10-09 Thread John Lilley
Greetings, We are looking into using the GraphX connected-components algorithm on Hadoop for grouping operations. Our typical data is on the order of 50-200M vertices with an edge:vertex ratio between 2 and 30. While there are pathological cases of very large groups, they tend to be small. I

Re: SparkR -Graphx Connected components

2015-08-11 Thread Robineast
respectively. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/SparkR-Graphx-Connected-components-tp24165p24209.html Sent from the Apache Spark User List mailing list archive at Nabble.com

Re: SparkR -Graphx Connected components

2015-08-10 Thread smagadi
been 7,3 (5,3)-OK (2,0)-OK -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/SparkR-Graphx-Connected-components-tp24165p24190.html Sent from the Apache Spark User List mailing list archive at Nabble.com

SparkR -Graphx Connected components

2015-08-07 Thread smagadi
on how to get stronglyconnected nodes .Pls help in completing this code/ -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/SparkR-Graphx-Connected-components-tp24165.html Sent from the Apache Spark User List mailing list archive at Nabble.com

Re: SparkR -Graphx Connected components

2015-08-07 Thread Robineast
Manning Publications Co. http://www.manning.com/malak/ http://www.manning.com/malak/ -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/SparkR-Graphx-Connected-components-tp24165p24166.html Sent from the Apache Spark User List mailing list archive at Nabble.com

java.nio.channels.CancelledKeyException in Graphx Connected Components

2014-08-18 Thread Jeffrey Picard
Hey all, I’m trying to run connected components in graphx on about 400GB of data on 50 m3.xlarge nodes on emr. I keep getting java.nio.channels.CancelledKeyException when it gets to mapPartitions at VertexRDD.scala:347”. I haven’t been able to find much about this online, and nothing that

GraphX Connected Components

2014-07-29 Thread Jeffrey Picard
Hey all, I’m currently trying to run connected components using GraphX on a large graph (~1.8b vertices and ~3b edges, most of them are self edges where the only edge that exists for vertex v is v-v) on emr using 50 m3.xlarge nodes. As the program runs I’m seeing each iteration take longer and