in context:
http://apache-spark-user-list.1001560.n3.nabble.com/GraphX-Connected-Components-tp10869p28049.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
-
To unsubscribe e-mail: user-unsubscr
be needed. Arriving at the answer through experimentation
isn’t a good approach, because that assumes -- chicken-and-egg problem -- that
we have already arrived at an optimal configuration.
-- Does GraphX connected-components performance degrade slowly or
catastrophically when that memory limit
t 00:13, John Lilley <john.lil...@redpoint.net> wrote:
> Greetings,
>
> We are looking into using the GraphX connected-components algorithm on
> Hadoop for grouping operations. Our typical data is on the order of
> 50-200M vertices with an edge:vertex ratio between 2 and 30. Whi
Greetings,
We are looking into using the GraphX connected-components algorithm on Hadoop
for grouping operations. Our typical data is on the order of 50-200M vertices
with an edge:vertex ratio between 2 and 30. While there are pathological cases
of very large groups, they tend to be small. I
respectively.
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/SparkR-Graphx-Connected-components-tp24165p24209.html
Sent from the Apache Spark User List mailing list archive at Nabble.com
been 7,3
(5,3)-OK
(2,0)-OK
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/SparkR-Graphx-Connected-components-tp24165p24190.html
Sent from the Apache Spark User List mailing list archive at Nabble.com
on how to get
stronglyconnected nodes .Pls help in completing this code/
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/SparkR-Graphx-Connected-components-tp24165.html
Sent from the Apache Spark User List mailing list archive at Nabble.com
Manning Publications Co.
http://www.manning.com/malak/ http://www.manning.com/malak/
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/SparkR-Graphx-Connected-components-tp24165p24166.html
Sent from the Apache Spark User List mailing list archive at Nabble.com
Hey all,
I’m trying to run connected components in graphx on about 400GB of data on 50
m3.xlarge nodes on emr. I keep getting java.nio.channels.CancelledKeyException
when it gets to mapPartitions at VertexRDD.scala:347”. I haven’t been able to
find much about this online, and nothing that
Hey all,
I’m currently trying to run connected components using GraphX on a large graph
(~1.8b vertices and ~3b edges, most of them are self edges where the only edge
that exists for vertex v is v-v) on emr using 50 m3.xlarge nodes. As the
program runs I’m seeing each iteration take longer and
10 matches
Mail list logo