[GitHub] spark issue #16566: [SPARK-18821][SparkR]: Bisecting k-means wrapper in Spar...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16566 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71675/ Test FAILed. ---

[GitHub] spark issue #16533: [SPARK-19160][PYTHON][SQL][WIP] Add udf decorator

2017-01-19 Thread rdblue
Github user rdblue commented on the issue: https://github.com/apache/spark/pull/16533 Yeah, I added the `isinstance(f, DataType)` trick to my local tests. I'd add it if it were up to me, but I'm fine requiring `returnType` to be a keyword arg if you feel strongly about it.

[GitHub] spark issue #16534: [SPARK-19161][PYTHON][SQL] Improving UDF Docstrings

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16534 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16534: [SPARK-19161][PYTHON][SQL] Improving UDF Docstrings

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16534 **[Test build #71680 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71680/testReport)** for PR 16534 at commit

[GitHub] spark issue #16534: [SPARK-19161][PYTHON][SQL] Improving UDF Docstrings

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16534 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71680/ Test FAILed. ---

[GitHub] spark issue #16534: [SPARK-19161][PYTHON][SQL] Improving UDF Docstrings

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16534 **[Test build #71680 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71680/testReport)** for PR 16534 at commit

[GitHub] spark issue #16536: [SPARK-19163][PYTHON][SQL] Delay _judf initialization to...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16536 **[Test build #71679 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71679/testReport)** for PR 16536 at commit

[GitHub] spark pull request #15467: [SPARK-17912][SQL] Refactor code generation to ge...

2017-01-19 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/15467 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #15467: [SPARK-17912][SQL] Refactor code generation to get data ...

2017-01-19 Thread davies
Github user davies commented on the issue: https://github.com/apache/spark/pull/15467 Merging this into master, thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #16533: [SPARK-19160][PYTHON][SQL][WIP] Add udf decorator

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16533 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16533: [SPARK-19160][PYTHON][SQL][WIP] Add udf decorator

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16533 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71678/ Test FAILed. ---

[GitHub] spark issue #16533: [SPARK-19160][PYTHON][SQL][WIP] Add udf decorator

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16533 **[Test build #71678 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71678/testReport)** for PR 16533 at commit

[GitHub] spark issue #16537: [SPARK-19165][PYTHON][SQL][WIP] UserDefinedFunction.__ca...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16537 **[Test build #71676 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71676/testReport)** for PR 16537 at commit

[GitHub] spark issue #12004: [SPARK-7481] [build] Add spark-cloud module to pull in o...

2017-01-19 Thread nchammas
Github user nchammas commented on the issue: https://github.com/apache/spark/pull/12004 > Does a build of Spark + Hadoop 2.7 right now have no ability at all to read from S3 out of the box, or just not full / ideal support? No ability at all, as far as I can tell. People

[GitHub] spark issue #16533: [SPARK-19160][PYTHON][SQL][WIP] Add udf decorator

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16533 **[Test build #71678 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71678/testReport)** for PR 16533 at commit

[GitHub] spark issue #16535: [SPARK-19162][PYTHON][SQL] UserDefinedFunction should va...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16535 **[Test build #71677 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71677/testReport)** for PR 16535 at commit

[GitHub] spark issue #16533: [SPARK-19160][PYTHON][SQL][WIP] Add udf decorator

2017-01-19 Thread zero323
Github user zero323 commented on the issue: https://github.com/apache/spark/pull/16533 @rdblue Good question. I vaguely remember I had some motivation to avoid this but now I cannot recall why. In general I really liked the initial approach because it allowed us to write:

[GitHub] spark pull request #16645: [SPARK-19290][SQL] add a new extending interface ...

2017-01-19 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/16645#discussion_r96976865 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveSessionState.scala --- @@ -62,15 +62,17 @@ private[hive] class

[GitHub] spark pull request #16645: [SPARK-19290][SQL] add a new extending interface ...

2017-01-19 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/16645#discussion_r96976789 --- Diff: sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveSessionState.scala --- @@ -62,15 +62,17 @@ private[hive] class

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16639 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71673/ Test PASSed. ---

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16639 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16643: [SPARK-17724][Streaming][WebUI] Unevaluated new lines in...

2017-01-19 Thread ajbozarth
Github user ajbozarth commented on the issue: https://github.com/apache/spark/pull/16643 I checked this out and tested it and it runs well, code LGTM as well, but I agree that the unrelated changes should be removed. --- If your project is set up for it, you can reply to this email

[GitHub] spark pull request #16649: [SPARK-19295] [SQL] IsolatedClientLoader's downlo...

2017-01-19 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/16649 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16639 **[Test build #71673 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71673/testReport)** for PR 16639 at commit

[GitHub] spark issue #16566: [SPARK-18821][SparkR]: Bisecting k-means wrapper in Spar...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16566 **[Test build #71675 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71675/testReport)** for PR 16566 at commit

[GitHub] spark issue #16649: [SPARK-19295] [SQL] IsolatedClientLoader's downloadVersi...

2017-01-19 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/16649 Cool I am merging this to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #16566: [SPARK-18821][SparkR]: Bisecting k-means wrapper ...

2017-01-19 Thread wangmiao1981
Github user wangmiao1981 commented on a diff in the pull request: https://github.com/apache/spark/pull/16566#discussion_r96971840 --- Diff: mllib/src/main/scala/org/apache/spark/ml/r/BisectingKMeansWrapper.scala --- @@ -0,0 +1,141 @@ +/* + * Licensed to the Apache Software

[GitHub] spark issue #16649: [SPARK-19295] [SQL] IsolatedClientLoader's downloadVersi...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16649 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71674/ Test PASSed. ---

[GitHub] spark pull request #16566: [SPARK-18821][SparkR]: Bisecting k-means wrapper ...

2017-01-19 Thread wangmiao1981
Github user wangmiao1981 commented on a diff in the pull request: https://github.com/apache/spark/pull/16566#discussion_r96971280 --- Diff: R/pkg/R/mllib_clustering.R --- @@ -38,6 +45,146 @@ setClass("KMeansModel", representation(jobj = "jobj")) #' @note LDAModel since 2.1.0

[GitHub] spark issue #16649: [SPARK-19295] [SQL] IsolatedClientLoader's downloadVersi...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16649 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16649: [SPARK-19295] [SQL] IsolatedClientLoader's downloadVersi...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16649 **[Test build #71674 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71674/testReport)** for PR 16649 at commit

[GitHub] spark issue #16649: [SPARK-19295] [SQL] IsolatedClientLoader's downloadVersi...

2017-01-19 Thread liancheng
Github user liancheng commented on the issue: https://github.com/apache/spark/pull/16649 LGTM pending Jenkins. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #16645: [SPARK-19290][SQL] add a new extending interface in Anal...

2017-01-19 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/16645 My main concern of this pr is that if people will think it is recommended to add new batches to force those rules running in a certain ordering. For these resolution rules, we can also use conditions

[GitHub] spark issue #16473: [SPARK-19069] [CORE] Expose task 'status' and 'duration'...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16473 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16473: [SPARK-19069] [CORE] Expose task 'status' and 'duration'...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16473 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71672/ Test PASSed. ---

[GitHub] spark issue #16473: [SPARK-19069] [CORE] Expose task 'status' and 'duration'...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16473 **[Test build #71672 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71672/testReport)** for PR 16473 at commit

[GitHub] spark issue #16649: [SPARK-19295] [SQL] IsolatedClientLoader's downloadVersi...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16649 **[Test build #71674 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71674/testReport)** for PR 16649 at commit

[GitHub] spark pull request #16649: [SPARK-19295] [SQL] IsolatedClientLoader's downlo...

2017-01-19 Thread yhuai
GitHub user yhuai opened a pull request: https://github.com/apache/spark/pull/16649 [SPARK-19295] [SQL] IsolatedClientLoader's downloadVersion should log the location of downloaded metastore client jars ## What changes were proposed in this pull request? This will help the

[GitHub] spark issue #16581: [SPARK-18589] [SQL] Fix Python UDF accessing attributes ...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16581 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71671/ Test FAILed. ---

[GitHub] spark issue #16581: [SPARK-18589] [SQL] Fix Python UDF accessing attributes ...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16581 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16581: [SPARK-18589] [SQL] Fix Python UDF accessing attributes ...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16581 **[Test build #71671 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71671/testReport)** for PR 16581 at commit

[GitHub] spark issue #16557: [SPARK-18693][ML][MLLIB] ML Evaluators should use weight...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16557 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16557: [SPARK-18693][ML][MLLIB] ML Evaluators should use weight...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16557 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71670/ Test PASSed. ---

[GitHub] spark issue #16581: [SPARK-18589] [SQL] Fix Python UDF accessing attributes ...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16581 **[Test build #3542 has finished](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/3542/testReport)** for PR 16581 at commit

[GitHub] spark issue #16557: [SPARK-18693][ML][MLLIB] ML Evaluators should use weight...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16557 **[Test build #71670 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71670/testReport)** for PR 16557 at commit

[GitHub] spark issue #16473: [SPARK-19069] [CORE] Expose task 'status' and 'duration'...

2017-01-19 Thread squito
Github user squito commented on the issue: https://github.com/apache/spark/pull/16473 ah, sorry I didn't notice that before my comment. well looks good now, I'll merge once tests pass. --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16639 **[Test build #71673 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71673/testReport)** for PR 16639 at commit

[GitHub] spark pull request #16645: [SPARK-19290][SQL] add a new extending interface ...

2017-01-19 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/16645#discussion_r96941832 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -106,6 +106,13 @@ class Analyzer( */

[GitHub] spark issue #16473: [SPARK-19069] [CORE] Expose task 'status' and 'duration'...

2017-01-19 Thread paragpc
Github user paragpc commented on the issue: https://github.com/apache/spark/pull/16473 Thanks @squito. Actually I have resolved the merge conflicts and started a new test build about half an hour ago. Currently, I don't see any merge conflicts. --- If your project is set up for it,

[GitHub] spark pull request #11867: [SPARK-14049] [CORE] Add functionality in spark h...

2017-01-19 Thread squito
Github user squito commented on a diff in the pull request: https://github.com/apache/spark/pull/11867#discussion_r96939526 --- Diff: core/src/main/scala/org/apache/spark/status/api/v1/ApplicationListResource.scala --- @@ -43,11 +45,24 @@ private[v1] class

[GitHub] spark issue #16533: [SPARK-19160][PYTHON][SQL][WIP] Add udf decorator

2017-01-19 Thread rdblue
Github user rdblue commented on the issue: https://github.com/apache/spark/pull/16533 @zero323, I think that the decorator and existing UDF factory method should be the same, but that we can't break existing code. Can you explain why this necessarily breaks code that relies on

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16639 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71668/ Test FAILed. ---

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16639 Build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16639 **[Test build #71668 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71668/testReport)** for PR 16639 at commit

[GitHub] spark issue #16473: [SPARK-19069] [CORE] Expose task 'status' and 'duration'...

2017-01-19 Thread squito
Github user squito commented on the issue: https://github.com/apache/spark/pull/16473 thanks @paragpc . lgtm, but now there are merge conflicts. Can you merge with the latest master? You might need to regenerate those test expectation files. --- If your project is set up for it,

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16639 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16639 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71669/ Test FAILed. ---

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16639 **[Test build #71669 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71669/testReport)** for PR 16639 at commit

[GitHub] spark issue #16473: [SPARK-19069] [CORE] Expose task 'status' and 'duration'...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16473 **[Test build #71672 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71672/testReport)** for PR 16473 at commit

[GitHub] spark issue #16607: [SPARK-19247][ML] Save large word2vec models

2017-01-19 Thread Krimit
Github user Krimit commented on the issue: https://github.com/apache/spark/pull/16607 @srowen @jkbradley updated with comments. I used the spark version to sniff the version as suggested by @jkbradley, although I'm happy to continue the conversation about the best way to handle

[GitHub] spark issue #14547: [SPARK-16718][MLlib] gbm-style treeboost

2017-01-19 Thread jkbradley
Github user jkbradley commented on the issue: https://github.com/apache/spark/pull/14547 I'd recommend overriding setImpurity in the relevant concrete classes. In those, you can add warnings in the Scala doc and also add logWarning messages about deprecation. That's almost as good

[GitHub] spark issue #15730: [SPARK-18218][ML][MLLib] Reduce shuffled data size of Bl...

2017-01-19 Thread jkbradley
Github user jkbradley commented on the issue: https://github.com/apache/spark/pull/15730 The API looks good to me. I have not reviewed the internals carefully. One comment: Let's add a check to verify that numMidDimSplits is > 0. --- If your project is set up for it, you

[GitHub] spark issue #7379: [SPARK-8682][SQL][WIP] Range Join

2017-01-19 Thread saj1th
Github user saj1th commented on the issue: https://github.com/apache/spark/pull/7379 Facing huge performance issues with range joins. Hoping to see this implemented --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If

[GitHub] spark issue #16581: [SPARK-18589] [SQL] Fix Python UDF accessing attributes ...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16581 **[Test build #71671 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71671/testReport)** for PR 16581 at commit

[GitHub] spark issue #16417: [SPARK-19014][SQL] support complex aggregate buffer in H...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16417 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71667/ Test FAILed. ---

[GitHub] spark issue #16417: [SPARK-19014][SQL] support complex aggregate buffer in H...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16417 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16417: [SPARK-19014][SQL] support complex aggregate buffer in H...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16417 **[Test build #71667 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71667/testReport)** for PR 16417 at commit

[GitHub] spark issue #16647: [SPARK-19292][SQL] filter with partition columns should ...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16647 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16647: [SPARK-19292][SQL] filter with partition columns should ...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16647 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71666/ Test PASSed. ---

[GitHub] spark issue #16647: [SPARK-19292][SQL] filter with partition columns should ...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16647 **[Test build #71666 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71666/testReport)** for PR 16647 at commit

[GitHub] spark issue #15192: [SPARK-14536] [SQL] fix to handle null value in array ty...

2017-01-19 Thread sureshthalamati
Github user sureshthalamati commented on the issue: https://github.com/apache/spark/pull/15192 @gatorsmile Verified on the master , problem still exist. Resolved the conflicts, when u get a chance can you please review. Error stack without the fix

[GitHub] spark issue #16557: [SPARK-18693][ML][MLLIB] ML Evaluators should use weight...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16557 **[Test build #71670 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71670/testReport)** for PR 16557 at commit

[GitHub] spark issue #16344: [SPARK-18929][ML] Add Tweedie distribution in GLM

2017-01-19 Thread actuaryzhang
Github user actuaryzhang commented on the issue: https://github.com/apache/spark/pull/16344 Could anybody help me understand what's causing this test to fail? I see several other ML PR failing as well, with the same error message like below: > Error instrumenting

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16639 **[Test build #71669 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71669/testReport)** for PR 16639 at commit

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16639 **[Test build #71668 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71668/testReport)** for PR 16639 at commit

[GitHub] spark issue #16639: [SPARK-19276][CORE] Fetch Failure handling robust to use...

2017-01-19 Thread squito
Github user squito commented on the issue: https://github.com/apache/spark/pull/16639 thanks for the feedback @mridulm , all good points. I pushed an update to address some of the points, also have some follow up discussion --- If your project is set up for it, you can reply to

[GitHub] spark pull request #16646: [SPARK-19291][SPARKR][ML] spark.gaussianMixture s...

2017-01-19 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/16646#discussion_r96910742 --- Diff: mllib/src/main/scala/org/apache/spark/ml/r/GaussianMixtureWrapper.scala --- @@ -91,7 +92,10 @@ private[r] object GaussianMixtureWrapper

[GitHub] spark pull request #16639: [SPARK-19276][CORE] Fetch Failure handling robust...

2017-01-19 Thread squito
Github user squito commented on a diff in the pull request: https://github.com/apache/spark/pull/16639#discussion_r96910697 --- Diff: core/src/main/scala/org/apache/spark/executor/Executor.scala --- @@ -405,6 +415,13 @@ private[spark] class Executor(

[GitHub] spark pull request #16639: [SPARK-19276][CORE] Fetch Failure handling robust...

2017-01-19 Thread squito
Github user squito commented on a diff in the pull request: https://github.com/apache/spark/pull/16639#discussion_r96910555 --- Diff: core/src/main/scala/org/apache/spark/executor/Executor.scala --- @@ -405,6 +415,13 @@ private[spark] class Executor(

[GitHub] spark pull request #16646: [SPARK-19291][SPARKR][ML] spark.gaussianMixture s...

2017-01-19 Thread felixcheung
Github user felixcheung commented on a diff in the pull request: https://github.com/apache/spark/pull/16646#discussion_r96910423 --- Diff: mllib/src/main/scala/org/apache/spark/ml/r/GaussianMixtureWrapper.scala --- @@ -91,7 +92,10 @@ private[r] object GaussianMixtureWrapper

[GitHub] spark pull request #16639: [SPARK-19276][CORE] Fetch Failure handling robust...

2017-01-19 Thread squito
Github user squito commented on a diff in the pull request: https://github.com/apache/spark/pull/16639#discussion_r96909723 --- Diff: core/src/main/scala/org/apache/spark/shuffle/FetchFailedException.scala --- @@ -45,6 +45,12 @@ private[spark] class FetchFailedException(

[GitHub] spark issue #16645: [SPARK-19290][SQL] add a new extending interface in Anal...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16645 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71663/ Test PASSed. ---

[GitHub] spark issue #16645: [SPARK-19290][SQL] add a new extending interface in Anal...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16645 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16645: [SPARK-19290][SQL] add a new extending interface in Anal...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16645 **[Test build #71663 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71663/testReport)** for PR 16645 at commit

[GitHub] spark issue #16630: [SPARK-19270][ML] Add summary table to GLM summary

2017-01-19 Thread actuaryzhang
Github user actuaryzhang commented on the issue: https://github.com/apache/spark/pull/16630 Jenkins, test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request #16643: [SPARK-17724][Streaming][WebUI] Unevaluated new l...

2017-01-19 Thread keypointt
Github user keypointt commented on a diff in the pull request: https://github.com/apache/spark/pull/16643#discussion_r96896268 --- Diff: core/src/main/scala/org/apache/spark/ui/jobs/JobProgressListener.scala --- @@ -19,7 +19,7 @@ package org.apache.spark.ui.jobs import

[GitHub] spark issue #16593: [SPARK-19153][SQL]DataFrameWriter.saveAsTable work with ...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16593 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16593: [SPARK-19153][SQL]DataFrameWriter.saveAsTable work with ...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16593 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71664/ Test PASSed. ---

[GitHub] spark issue #16593: [SPARK-19153][SQL]DataFrameWriter.saveAsTable work with ...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16593 **[Test build #71664 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71664/testReport)** for PR 16593 at commit

[GitHub] spark issue #16417: [SPARK-19014][SQL] support complex aggregate buffer in H...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16417 **[Test build #71667 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71667/testReport)** for PR 16417 at commit

[GitHub] spark issue #16648: [SPARK-18016][SQL][CATALYST] Code Generation: Constant P...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16648 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request #16648: [SPARK-18016][SQL][CATALYST] Code Generation: Con...

2017-01-19 Thread bdrillard
GitHub user bdrillard opened a pull request: https://github.com/apache/spark/pull/16648 [SPARK-18016][SQL][CATALYST] Code Generation: Constant Pool Limit [class_splitting] increasing stack size for Catalyst tests ## What changes were proposed in this pull request?

[GitHub] spark issue #15505: [SPARK-18890][CORE] Move task serialization from the Tas...

2017-01-19 Thread squito
Github user squito commented on the issue: https://github.com/apache/spark/pull/15505 This might be a really dumb question, but please humor me -- can you explain to me the threads involved, both before and after this change? I must be missing something, because it seems to me it

[GitHub] spark issue #16417: [SPARK-19014][SQL] support complex aggregate buffer in H...

2017-01-19 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/16417 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #16646: [SPARK-19291][SPARKR][ML] spark.gaussianMixture supports...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16646 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16646: [SPARK-19291][SPARKR][ML] spark.gaussianMixture supports...

2017-01-19 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/16646 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/71665/ Test PASSed. ---

[GitHub] spark issue #16646: [SPARK-19291][SPARKR][ML] spark.gaussianMixture supports...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16646 **[Test build #71665 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71665/testReport)** for PR 16646 at commit

[GitHub] spark pull request #15505: [SPARK-18890][CORE] Move task serialization from ...

2017-01-19 Thread squito
Github user squito commented on a diff in the pull request: https://github.com/apache/spark/pull/15505#discussion_r96889501 --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala --- @@ -602,6 +619,20 @@ class

[GitHub] spark issue #16647: [SPARK-19292][SQL] filter with partition columns should ...

2017-01-19 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16647 **[Test build #71666 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/71666/testReport)** for PR 16647 at commit

[GitHub] spark issue #16647: [SPARK-19292][SQL] filter with partition columns should ...

2017-01-19 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/16647 cc @yhuai @gatorsmile @ericl --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

<    1   2   3   4   5   >