Reynold Xin created SPARK-2758: ---------------------------------- Summary: UnionRDD's UnionPartition should not reference parent RDDs Key: SPARK-2758 URL: https://issues.apache.org/jira/browse/SPARK-2758 Project: Spark Issue Type: Improvement Affects Versions: 1.0.0, 1.0.1, 1.0.2 Reporter: Reynold Xin Assignee: Reynold Xin
UnionPartition has a non-transient field referencing the parent RDD, to be used in compute (iterator). That causes some trouble with task size because partition objects are supposed to be small. -- This message was sent by Atlassian JIRA (v6.2#6252)