[GitHub] spark issue #15136: [SPARK-17581] [SQL] Invalidate Statistics After Some ALT...

2016-09-18 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15136 **[Test build #65554 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65554/consoleFull)** for PR 15136 at commit

[GitHub] spark pull request #15136: [SPARK-17581] [SQL] Invalidate Statistics After S...

2016-09-18 Thread gatorsmile
GitHub user gatorsmile opened a pull request: https://github.com/apache/spark/pull/15136 [SPARK-17581] [SQL] Invalidate Statistics After Some ALTER TABLE Commands ### What changes were proposed in this pull request? In the recent statistics-related work, our focus is on how to

[GitHub] spark issue #14971: [SPARK-17410] [SPARK-17284] Move Hive-generated Stats In...

2016-09-18 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14971 **[Test build #65553 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/65553/consoleFull)** for PR 14971 at commit

[GitHub] spark issue #14971: [SPARK-17410] [SPARK-17284] Move Hive-generated Stats In...

2016-09-18 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/14971 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark pull request #15131: [SPARK-17577][SparkR] SparkR support add files to...

2016-09-18 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/15131#discussion_r79298408 --- Diff: R/pkg/R/context.R --- @@ -225,6 +225,37 @@ setCheckpointDir <- function(sc, dirName) { invisible(callJMethod(sc, "setCheckpointDir",

[GitHub] spark pull request #15131: [SPARK-17577][SparkR] SparkR support add files to...

2016-09-18 Thread HyukjinKwon
Github user HyukjinKwon commented on a diff in the pull request: https://github.com/apache/spark/pull/15131#discussion_r79298358 --- Diff: R/pkg/R/context.R --- @@ -225,6 +225,37 @@ setCheckpointDir <- function(sc, dirName) { invisible(callJMethod(sc, "setCheckpointDir",

[GitHub] spark issue #15131: [SPARK-17577][SparkR] SparkR support add files to Spark ...

2016-09-18 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/15131 I just took a look. The problematic code is here, [SparkContext.scala#L1429](https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/SparkContext.scala#L1429).

[GitHub] spark issue #15135: [pyspark][group]pyspark GroupedData can't apply agg func...

2016-09-18 Thread petermaxlee
Github user petermaxlee commented on the issue: https://github.com/apache/spark/pull/15135 Isn't it as simple as ``` cols = [x for x in df.columns if x != "key] df.groupby("key").agg([F.min(x) for x in cols] + [F.max(x) for x in cols]) ``` --- If your project is set

[GitHub] spark issue #15135: [pyspark][group]pyspark GroupedData can't apply agg func...

2016-09-18 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15135 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request #15135: [pyspark][group]pyspark GroupedData can't apply a...

2016-09-18 Thread citoubest
GitHub user citoubest opened a pull request: https://github.com/apache/spark/pull/15135 [pyspark][group]pyspark GroupedData can't apply agg functions on all left numeric columns. ## What changes were proposed in this pull request? With pyspark dataframe, the agg method just

<    1   2   3   4