Bill Chambers created SPARK-14708: ------------------------------------- Summary: Repl Serialization Issue Key: SPARK-14708 URL: https://issues.apache.org/jira/browse/SPARK-14708 Project: Spark Issue Type: Bug Components: Spark Core Reporter: Bill Chambers Priority: Critical
Run this code 6 times with the :paste command in Spark. You'll see exponential slow downs. class IntWrapper(val i: Int) extends Serializable { } var pairs = sc.parallelize(Array((0, new IntWrapper(0)))) for (_ <- 0 until 3) { val wrapper = pairs.values.reduce((x,_) => x) pairs = pairs.mapValues(_ => wrapper) } val result = pairs.collect() https://forums.databricks.com/questions/7729/delays-when-running-program-multiple-times-in-note.html -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org