spark git commit: [SPARK-22957] ApproxQuantile breaks if the number of rows exceeds MaxInt

2018-01-04 Thread wenchen
Repository: spark Updated Branches: refs/heads/branch-2.3 ea9da6152 -> 158f7e6a9 [SPARK-22957] ApproxQuantile breaks if the number of rows exceeds MaxInt ## What changes were proposed in this pull request? 32bit Int was used for row rank. That overflowed in a dataframe with more than 2B

spark git commit: [SPARK-22957] ApproxQuantile breaks if the number of rows exceeds MaxInt

2018-01-04 Thread wenchen
Repository: spark Updated Branches: refs/heads/master 0428368c2 -> df7fc3ef3 [SPARK-22957] ApproxQuantile breaks if the number of rows exceeds MaxInt ## What changes were proposed in this pull request? 32bit Int was used for row rank. That overflowed in a dataframe with more than 2B rows.