[jira] [Commented] (SPARK-1750) EdgePartition is not serialized properly
[ https://issues.apache.org/jira/browse/SPARK-1750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14008988#comment-14008988 ] Ankur Dave commented on SPARK-1750: --- Resolved in PR #742: https://github.com/apache/spark/pull/742 > EdgePartition is not serialized properly > > > Key: SPARK-1750 > URL: https://issues.apache.org/jira/browse/SPARK-1750 > Project: Spark > Issue Type: Bug > Components: GraphX >Affects Versions: 0.9.0, 1.0.0, 0.9.1 >Reporter: Ankur Dave > Fix For: 1.0.0 > > > The GraphX design attempts to avoid moving edges across the network, instead > shipping the vertices to the edge partitions. However, Spark sometimes needs > to move the edges, such as for straggler mitigation. > All EdgePartition fields are currently declared transient, so the edges will > not be serialized properly. Even if they are not marked transient, Kryo is > unable to serialize the EdgePartition, failing with the following error: > {code} > java.lang.IllegalArgumentException: Can not set final > org.apache.spark.graphx.util.collection.PrimitiveKeyOpenHashMap field > org.apache.spark.graphx.impl.EdgePartition.index to > scala.collection.immutable.$colon$colon > {code} > A workaround is to discourage Spark from moving the edges by setting > {{spark.locality.wait}} to a high value such as 10. -- This message was sent by Atlassian JIRA (v6.2#6252)
[jira] [Commented] (SPARK-1750) EdgePartition is not serialized properly
[ https://issues.apache.org/jira/browse/SPARK-1750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13994766#comment-13994766 ] Joseph E. Gonzalez commented on SPARK-1750: --- I believe this issue is resolved with PR #724. > EdgePartition is not serialized properly > > > Key: SPARK-1750 > URL: https://issues.apache.org/jira/browse/SPARK-1750 > Project: Spark > Issue Type: Bug > Components: GraphX >Affects Versions: 0.9.0, 1.0.0, 0.9.1 >Reporter: Ankur Dave > > The GraphX design attempts to avoid moving edges across the network, instead > shipping the vertices to the edge partitions. However, Spark sometimes needs > to move the edges, such as for straggler mitigation. > All EdgePartition fields are currently declared transient, so the edges will > not be serialized properly. Even if they are not marked transient, Kryo is > unable to serialize the EdgePartition, failing with the following error: > {code} > java.lang.IllegalArgumentException: Can not set final > org.apache.spark.graphx.util.collection.PrimitiveKeyOpenHashMap field > org.apache.spark.graphx.impl.EdgePartition.index to > scala.collection.immutable.$colon$colon > {code} > A workaround is to discourage Spark from moving the edges by setting > {{spark.locality.wait}} to a high value such as 10. -- This message was sent by Atlassian JIRA (v6.2#6252)