Sean Zhong created SPARK-17306: ---------------------------------- Summary: Memory leak in QuantileSummaries Key: SPARK-17306 URL: https://issues.apache.org/jira/browse/SPARK-17306 Project: Spark Issue Type: Bug Reporter: Sean Zhong
compressThreshold was not referenced anywhere {code} class QuantileSummaries( val compressThreshold: Int, val relativeError: Double, val sampled: ArrayBuffer[Stats] = ArrayBuffer.empty, private[stat] var count: Long = 0L, val headSampled: ArrayBuffer[Double] = ArrayBuffer.empty) extends Serializable {code} And, it causes memory leak, QuantileSummaries takes unbounded memory {code} val summary = new QuantileSummaries(10000, relativeError = 0.001) // Results in creating an array of size 100000000 !!! (1 to 100000000).foreach(summary.insert(_)) {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org