[GitHub] spark pull request #17367: [MINOR][PYSPARK] Remove _inferSchema in context.p...

2017-03-20 Thread zjffdu
GitHub user zjffdu opened a pull request: https://github.com/apache/spark/pull/17367 [MINOR][PYSPARK] Remove _inferSchema in context.py ## What changes were proposed in this pull request? _inferSchema is not used in context.py, all the things have been moved to

[GitHub] spark issue #17347: [SPARK-19980][SQL] Add NULL checks in Bean serializer

2017-03-20 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/17347 okay! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark issue #17014: [SPARK-18608][ML] Fix double-caching in ML algorithms

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17014 **[Test build #74932 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74932/testReport)** for PR 17014 at commit

[GitHub] spark issue #17347: [SPARK-19980][SQL] Add NULL checks in Bean serializer

2017-03-20 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/17347 can you send a new PR for branch 2.1? thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #17293: [SPARK-19950][SQL] Fix to ignore nullable when df.load()...

2017-03-20 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/17293 Uh.. I am actually not sure if we want the non-nullability per https://github.com/apache/spark/pull/14124#issuecomment-259615392. I am willing to help test and verify if it is okay for

[GitHub] spark pull request #17191: [SPARK-14471][SQL] Aliases in SELECT could be use...

2017-03-20 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/17191#discussion_r107066194 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -836,17 +836,30 @@ class Analyzer(

[GitHub] spark issue #17136: [SPARK-19783][SQL] Treat longer lengths of tokens as mal...

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17136 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/74924/ Test FAILed. ---

[GitHub] spark issue #17136: [SPARK-19783][SQL] Treat longer lengths of tokens as mal...

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17136 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17136: [SPARK-19783][SQL] Treat longer lengths of tokens as mal...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17136 **[Test build #74924 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74924/testReport)** for PR 17136 at commit

[GitHub] spark issue #17347: [SPARK-19980][SQL] Add NULL checks in Bean serializer

2017-03-20 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/17347 thanks, merging to master! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request #17347: [SPARK-19980][SQL] Add NULL checks in Bean serial...

2017-03-20 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/17347 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #17276: [SPARK-19937] Collect metrics of block sizes when shuffl...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17276 **[Test build #74931 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74931/testReport)** for PR 17276 at commit

[GitHub] spark issue #17312: [SPARK-19973] Display num of executors for the stage.

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17312 **[Test build #74930 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74930/testReport)** for PR 17312 at commit

[GitHub] spark issue #17276: [SPARK-19937] Collect metrics of block sizes when shuffl...

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17276 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/74917/ Test FAILed. ---

[GitHub] spark issue #17276: [SPARK-19937] Collect metrics of block sizes when shuffl...

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17276 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17276: [SPARK-19937] Collect metrics of block sizes when shuffl...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17276 **[Test build #74917 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74917/testReport)** for PR 17276 at commit

[GitHub] spark issue #17014: [SPARK-18608][ML] Fix double-caching in ML algorithms

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17014 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/74925/ Test PASSed. ---

[GitHub] spark issue #17014: [SPARK-18608][ML] Fix double-caching in ML algorithms

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17014 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #17342: [SPARK-12868][SQL] Allow adding jars from hdfs

2017-03-20 Thread jerryshao
Github user jerryshao commented on a diff in the pull request: https://github.com/apache/spark/pull/17342#discussion_r107064584 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/internal/SharedState.scala --- @@ -148,6 +149,8 @@ private[sql] class SharedState(val

[GitHub] spark issue #17014: [SPARK-18608][ML] Fix double-caching in ML algorithms

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17014 **[Test build #74925 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74925/testReport)** for PR 17014 at commit

[GitHub] spark pull request #17342: [SPARK-12868][SQL] Allow adding jars from hdfs

2017-03-20 Thread jerryshao
Github user jerryshao commented on a diff in the pull request: https://github.com/apache/spark/pull/17342#discussion_r107064456 --- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala --- @@ -2767,3 +2767,24 @@ private[spark] class CircularBuffer(sizeInBytes: Int = 10240)

[GitHub] spark pull request #17342: [SPARK-12868][SQL] Allow adding jars from hdfs

2017-03-20 Thread weiqingy
Github user weiqingy commented on a diff in the pull request: https://github.com/apache/spark/pull/17342#discussion_r107063651 --- Diff: core/src/main/scala/org/apache/spark/util/Utils.scala --- @@ -2767,3 +2767,24 @@ private[spark] class CircularBuffer(sizeInBytes: Int = 10240)

[GitHub] spark pull request #16626: [SPARK-19261][SQL] Alter add columns for Hive ser...

2017-03-20 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/16626#discussion_r107063564 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalogSuite.scala --- @@ -455,8 +455,10 @@ abstract class

[GitHub] spark pull request #16626: [SPARK-19261][SQL] Alter add columns for Hive ser...

2017-03-20 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/16626#discussion_r107063558 --- Diff: sql/catalyst/src/test/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalogSuite.scala --- @@ -455,8 +455,10 @@ abstract class

[GitHub] spark pull request #16626: [SPARK-19261][SQL] Alter add columns for Hive ser...

2017-03-20 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/16626#discussion_r107063379 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalog.scala --- @@ -343,15 +343,11 @@ class SessionCatalog(

[GitHub] spark issue #17014: [SPARK-18608][ML] Fix double-caching in ML algorithms

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17014 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/74921/ Test PASSed. ---

[GitHub] spark issue #17014: [SPARK-18608][ML] Fix double-caching in ML algorithms

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17014 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #16626: [SPARK-19261][SQL] Alter add columns for Hive serde and ...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16626 **[Test build #74929 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74929/testReport)** for PR 16626 at commit

[GitHub] spark issue #17014: [SPARK-18608][ML] Fix double-caching in ML algorithms

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17014 **[Test build #74921 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74921/testReport)** for PR 17014 at commit

[GitHub] spark pull request #16626: [SPARK-19261][SQL] Alter add columns for Hive ser...

2017-03-20 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/16626#discussion_r107063198 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalog.scala --- @@ -296,6 +311,47 @@ class SessionCatalog(

[GitHub] spark issue #16626: [SPARK-19261][SQL] Alter add columns for Hive serde and ...

2017-03-20 Thread xwu0226
Github user xwu0226 commented on the issue: https://github.com/apache/spark/pull/16626 @cloud-fan @gatorsmile Thanks again! I updated the code based @cloud-fan 's review. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well.

[GitHub] spark pull request #17302: [SPARK-19959][SQL] Fix to throw NullPointerExcept...

2017-03-20 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/17302#discussion_r107062991 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/ExistingRDD.scala --- @@ -70,7 +70,20 @@ object RDDConversions { object ExternalRDD

[GitHub] spark issue #17312: [SPARK-19973] Display num of executors for the stage.

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17312 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17312: [SPARK-19973] Display num of executors for the stage.

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17312 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/74916/ Test FAILed. ---

[GitHub] spark issue #17312: [SPARK-19973] Display num of executors for the stage.

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17312 **[Test build #74916 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74916/testReport)** for PR 17312 at commit

[GitHub] spark pull request #17339: [SPARK-20010][SQL] Sort information is lost after...

2017-03-20 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/17339 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #17339: [SPARK-20010][SQL] Sort information is lost after sort m...

2017-03-20 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/17339 thanks, merging to master! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #17353: [SPARK-17080][SQL][FOLLOWUP] Improve documentation and n...

2017-03-20 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/17353 Yeah. The counts can help us understand the pruning rate of the search space. When CBO join reordering is very slow, we can check the counts. --- If your project is set up for it, you can reply

[GitHub] spark issue #14038: [SPARK-16317][SQL] Add a new interface to filter files i...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14038 **[Test build #74927 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74927/testReport)** for PR 14038 at commit

[GitHub] spark issue #14038: [SPARK-16317][SQL] Add a new interface to filter files i...

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14038 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/74927/ Test FAILed. ---

[GitHub] spark issue #14038: [SPARK-16317][SQL] Add a new interface to filter files i...

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14038 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17354: [SPARK-20024] [SQL] [test-maven] SessionCatalog reset ne...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17354 **[Test build #74928 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74928/testReport)** for PR 17354 at commit

[GitHub] spark issue #17354: [SPARK-20024] [SQL] [test-maven] SessionCatalog reset ne...

2017-03-20 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/17354 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #17354: [SPARK-20024] [SQL] [test-maven] SessionCatalog reset ne...

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17354 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17354: [SPARK-20024] [SQL] [test-maven] SessionCatalog reset ne...

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17354 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/74915/ Test FAILed. ---

[GitHub] spark issue #17354: [SPARK-20024] [SQL] [test-maven] SessionCatalog reset ne...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17354 **[Test build #74915 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74915/testReport)** for PR 17354 at commit

[GitHub] spark issue #17312: [SPARK-19973] Display num of executors for the stage.

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17312 **[Test build #74926 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74926/testReport)** for PR 17312 at commit

[GitHub] spark issue #14038: [SPARK-16317][SQL] Add a new interface to filter files i...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14038 **[Test build #74927 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74927/testReport)** for PR 14038 at commit

[GitHub] spark pull request #17314: [SPARK-15790][MLlib] Audit @Since annotations in ...

2017-03-20 Thread ehsun7b
Github user ehsun7b commented on a diff in the pull request: https://github.com/apache/spark/pull/17314#discussion_r107061443 --- Diff: mllib/src/main/scala/org/apache/spark/ml/r/AFTSurvivalRegressionWrapper.scala --- @@ -30,6 +32,7 @@ import

[GitHub] spark issue #17334: [SPARK-19998][Block Manager]BlockRDD block not found Exc...

2017-03-20 Thread jerryshao
Github user jerryshao commented on the issue: https://github.com/apache/spark/pull/17334 "Change the exception log to add RDD id of the related the block". --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark issue #17312: [SPARK-19973] Display num of executors for the stage.

2017-03-20 Thread jinxing64
Github user jinxing64 commented on the issue: https://github.com/apache/spark/pull/17312 Sure, that would be cool :) Thanks again you can help review this. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark issue #17306: [SPARK-19979][MLLIB] Allow multiple pipelines when tunin...

2017-03-20 Thread leifker
Github user leifker commented on the issue: https://github.com/apache/spark/pull/17306 Interesting, let me think about this a bit. I think that there is probably a better api around this approach for sure. --- If your project is set up for it, you can reply to this email and have

[GitHub] spark issue #17352: [SPARK-20021][PySpark] Miss backslash in python code

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17352 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/74923/ Test PASSed. ---

[GitHub] spark issue #17313: [SPARK-19974][Block Manager] in-memory LRU for partition...

2017-03-20 Thread jianran
Github user jianran commented on the issue: https://github.com/apache/spark/pull/17313 do you agree remove the comment is more appreciable? can i only remove the comment? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well.

[GitHub] spark issue #17352: [SPARK-20021][PySpark] Miss backslash in python code

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17352 **[Test build #74923 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74923/testReport)** for PR 17352 at commit

[GitHub] spark issue #17352: [SPARK-20021][PySpark] Miss backslash in python code

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17352 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17352: [SPARK-20021][PySpark] Miss backslash in python code

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17352 **[Test build #74923 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74923/testReport)** for PR 17352 at commit

[GitHub] spark issue #16605: [SPARK-18884][SQL] Throw an exception in compile time if...

2017-03-20 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/16605 @cloud-fan ping --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or

[GitHub] spark issue #17014: [SPARK-18608][ML] Fix double-caching in ML algorithms

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17014 **[Test build #74925 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74925/testReport)** for PR 17014 at commit

[GitHub] spark issue #17136: [SPARK-19783][SQL] Treat longer lengths of tokens as mal...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17136 **[Test build #74924 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74924/testReport)** for PR 17136 at commit

[GitHub] spark issue #17334: [SPARK-19998][Block Manager]BlockRDD block not found Exc...

2017-03-20 Thread jianran
Github user jianran commented on the issue: https://github.com/apache/spark/pull/17334 @jerryshao what's title do you think is better? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #17353: [SPARK-17080][SQL][FOLLOWUP] Improve documentation and n...

2017-03-20 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/17353 debug log SGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or

[GitHub] spark issue #17312: [SPARK-19973] Display num of executors for the stage.

2017-03-20 Thread jerryshao
Github user jerryshao commented on the issue: https://github.com/apache/spark/pull/17312 IMO "(2 executors)" should be enough :). --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17191: [SPARK-14471][SQL] Aliases in SELECT could be used in GR...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17191 **[Test build #74922 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74922/testReport)** for PR 17191 at commit

[GitHub] spark issue #17014: [SPARK-18608][ML] Fix double-caching in ML algorithms

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17014 **[Test build #74921 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74921/testReport)** for PR 17014 at commit

[GitHub] spark issue #14617: [SPARK-17019][Core] Expose on-heap and off-heap memory u...

2017-03-20 Thread CodingCat
Github user CodingCat commented on the issue: https://github.com/apache/spark/pull/14617 I agree with the checkpoint based solution , thanks for asking --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does

[GitHub] spark pull request #16626: [SPARK-19261][SQL] Alter add columns for Hive ser...

2017-03-20 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/16626#discussion_r107057509 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala --- @@ -2178,4 +2178,138 @@ abstract class DDLSuite extends

[GitHub] spark issue #16209: [WIP][SPARK-10849][SQL] Adds option to the JDBC data sou...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/16209 **[Test build #74920 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74920/testReport)** for PR 16209 at commit

[GitHub] spark pull request #16626: [SPARK-19261][SQL] Alter add columns for Hive ser...

2017-03-20 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/16626#discussion_r107057320 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/catalog/SessionCatalog.scala --- @@ -296,6 +311,51 @@ class SessionCatalog( }

[GitHub] spark issue #17347: [SPARK-19980][SQL] Add NULL checks in Bean serializer

2017-03-20 Thread maropu
Github user maropu commented on the issue: https://github.com/apache/spark/pull/17347 @cloud-fan Could you check this? Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17361: [SPARK-20030][SS][WIP]Event-time-based timeout for MapGr...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17361 **[Test build #74919 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74919/testReport)** for PR 17361 at commit

[GitHub] spark issue #17354: [SPARK-20024] [SQL] [test-maven] SessionCatalog reset ne...

2017-03-20 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/17354 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the

[GitHub] spark issue #17315: [SPARK-19949][SQL] unify bad record handling in CSV and ...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17315 **[Test build #74918 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74918/testReport)** for PR 17315 at commit

[GitHub] spark pull request #17315: [SPARK-19949][SQL] unify bad record handling in C...

2017-03-20 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/17315#discussion_r107055987 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/FailureSafeParser.scala --- @@ -0,0 +1,80 @@ +/* + * Licensed to the

[GitHub] spark pull request #16971: [SPARK-19573][SQL] Make NaN/null handling consist...

2017-03-20 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/16971 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #17362: [SPARK-20033][SQL] support hive permanent function

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17362 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/74912/ Test PASSed. ---

[GitHub] spark issue #17362: [SPARK-20033][SQL] support hive permanent function

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17362 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17362: [SPARK-20033][SQL] support hive permanent function

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17362 **[Test build #74912 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74912/testReport)** for PR 17362 at commit

[GitHub] spark issue #16971: [SPARK-19573][SQL] Make NaN/null handling consistent in ...

2017-03-20 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/16971 Since it is close to code freeze, I am first merging this PR. If any more comments, we can resolve them in the follow-up PR. --- If your project is set up for it, you can reply to this email

[GitHub] spark pull request #17315: [SPARK-19949][SQL] unify bad record handling in C...

2017-03-20 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/17315#discussion_r107055187 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/FailureSafeParser.scala --- @@ -0,0 +1,80 @@ +/* + * Licensed to the

[GitHub] spark issue #17219: [SPARK-19876][SS][WIP] OneTime Trigger Executor

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17219 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/74914/ Test FAILed. ---

[GitHub] spark issue #17219: [SPARK-19876][SS][WIP] OneTime Trigger Executor

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17219 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17219: [SPARK-19876][SS][WIP] OneTime Trigger Executor

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17219 **[Test build #74914 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74914/testReport)** for PR 17219 at commit

[GitHub] spark pull request #16626: [SPARK-19261][SQL] Alter add columns for Hive ser...

2017-03-20 Thread xwu0226
Github user xwu0226 commented on a diff in the pull request: https://github.com/apache/spark/pull/16626#discussion_r107053921 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala --- @@ -2178,4 +2178,138 @@ abstract class DDLSuite extends

[GitHub] spark issue #17276: [SPARK-19937] Collect metrics of block sizes when shuffl...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17276 **[Test build #74917 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74917/testReport)** for PR 17276 at commit

[GitHub] spark pull request #17315: [SPARK-19949][SQL] unify bad record handling in C...

2017-03-20 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/17315#discussion_r107053686 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/util/FailureSafeParser.scala --- @@ -0,0 +1,80 @@ +/* + * Licensed to the

[GitHub] spark issue #17295: [SPARK-19556][core] Do not encrypt block manager data in...

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17295 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17295: [SPARK-19556][core] Do not encrypt block manager data in...

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17295 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/74911/ Test FAILed. ---

[GitHub] spark issue #17295: [SPARK-19556][core] Do not encrypt block manager data in...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17295 **[Test build #74911 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74911/testReport)** for PR 17295 at commit

[GitHub] spark issue #17267: [SPARK-19926][PYSPARK] Make pyspark exception more reada...

2017-03-20 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/17267 I'll take a look at reviewing this later on this week @uncleGen. Two minor thing that we can do in the meantime is make the JIRA description a bit clearer as to what the proposed change is, the

[GitHub] spark issue #17352: [SPARK-20021][PySpark] Miss backslash in python code

2017-03-20 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/17352 This looks reasonable, however there is a conflict, can you update this to the latest master? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark issue #17312: [SPARK-19973] Display num of executors for the stage.

2017-03-20 Thread jinxing64
Github user jinxing64 commented on the issue: https://github.com/apache/spark/pull/17312 @rxin @jerryshao I uploaded another screenshot and give a short description there. Now it is (2 executors supplied). --- If your project is set up for it, you can reply to this email and

[GitHub] spark issue #17312: [SPARK-19973] Display num of executors for the stage.

2017-03-20 Thread jinxing64
Github user jinxing64 commented on the issue: https://github.com/apache/spark/pull/17312 ![screenshot2](https://cloud.githubusercontent.com/assets/4058918/24127926/5e0e7294-0e13-11e7-8af0-434b05e2815a.png) --- If your project is set up for it, you can reply to this email

[GitHub] spark pull request #16626: [SPARK-19261][SQL] Alter add columns for Hive ser...

2017-03-20 Thread xwu0226
Github user xwu0226 commented on a diff in the pull request: https://github.com/apache/spark/pull/16626#discussion_r107051682 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/HiveDDLSuite.scala --- @@ -1860,4 +1861,115 @@ class HiveDDLSuite }

[GitHub] spark issue #17312: [SPARK-19973] Display num of executors for the stage.

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17312 **[Test build #74916 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74916/testReport)** for PR 17312 at commit

[GitHub] spark issue #17361: [SPARK-20030][SS][WIP]Event-time-based timeout for MapGr...

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17361 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17361: [SPARK-20030][SS][WIP]Event-time-based timeout for MapGr...

2017-03-20 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/17361 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/74910/ Test PASSed. ---

[GitHub] spark issue #17361: [SPARK-20030][SS][WIP]Event-time-based timeout for MapGr...

2017-03-20 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/17361 **[Test build #74910 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/74910/testReport)** for PR 17361 at commit

[GitHub] spark pull request #16626: [SPARK-19261][SQL] Alter add columns for Hive ser...

2017-03-20 Thread xwu0226
Github user xwu0226 commented on a diff in the pull request: https://github.com/apache/spark/pull/16626#discussion_r107051308 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/command/DDLSuite.scala --- @@ -2178,4 +2178,138 @@ abstract class DDLSuite extends

[GitHub] spark pull request #16626: [SPARK-19261][SQL] Alter add columns for Hive ser...

2017-03-20 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/16626#discussion_r107050210 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/HiveDDLSuite.scala --- @@ -1860,4 +1861,115 @@ class HiveDDLSuite }

<    1   2   3   4   5   6   7   >