[GitHub] spark pull request #21133: [SPARK-24013][SQL] Remove unneeded compress in Ap...

mgaido91 Fri, 27 Apr 2018 03:43:35 -0700

Github user mgaido91 commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21133#discussion_r184652876
  
    --- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/ApproximatePercentile.scala
 ---
    @@ -238,12 +238,6 @@ object ApproximatePercentile {
           summaries = summaries.insert(value)
           // The result of QuantileSummaries.insert is un-compressed
           isCompressed = false
    -
    -      // Currently, QuantileSummaries ignores the construction parameter 
compressThresHold,
    -      // which may cause QuantileSummaries to occupy unbounded memory. We 
have to hack around here
    -      // to make sure QuantileSummaries doesn't occupy infinite memory.
    -      // TODO: Figure out why QuantileSummaries ignores construction 
parameter compressThresHold
    -      if (summaries.sampled.length >= compressThresHoldBufferLength) 
compress()
    --- End diff --
    
    Yes, the TODO was resolved in SPARK-17439. I thought I clearly stated it in 
the description, but if this is not the case or you have any suggestion about 
how to improve the description, I am happy to improve it.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21133: [SPARK-24013][SQL] Remove unneeded compress in Ap...

Reply via email to