[GitHub] spark issue #14324: [SPARK-16664][SQL] Fix persist call on Data frames with ...

2016-07-22 Thread lw-lin
Github user lw-lin commented on the issue: https://github.com/apache/spark/pull/14324 @breakdawn it'd be great to do more tests when you open a request. As I'm investigate into this too, I found that my same fix works for 201 cols but fails for 8118 cols. The exact limit is 8117.

[GitHub] spark issue #14324: [SPARK-16664][SQL] Fix persist call on Data frames with ...

2016-07-22 Thread breakdawn
Github user breakdawn commented on the issue: https://github.com/apache/spark/pull/14324 Yes, working on that --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #13756: [SPARK-16041][SQL] Disallow Duplicate Columns in partiti...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/13756 **[Test build #62746 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62746/consoleFull)** for PR 13756 at commit

[GitHub] spark issue #14324: [SPARK-16664][SQL] Fix persist call on Data frames with ...

2016-07-22 Thread rxin
Github user rxin commented on the issue: https://github.com/apache/spark/pull/14324 Can you add a test case? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #13756: [SPARK-16041][SQL] Disallow Duplicate Columns in partiti...

2016-07-22 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/13756 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #14296: [SPARK-16639][SQL] The query with having condition that ...

2016-07-22 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14296 @cloud-fan any more comments? Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14322: [SPARK-16689] [SQL] FileSourceStrategy: Pruning Partitio...

2016-07-22 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/14322 cc @marmbrus @cloud-fan @liancheng After history checking, most of codes are done by you. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear

[GitHub] spark issue #14324: [SPARK-16664][SQL] Fix persist call on Data frames with ...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14324 Can one of the admins verify this patch? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request #14324: [SPARK-16664][SQL] Fix persist call on Data frame...

2016-07-22 Thread breakdawn
GitHub user breakdawn opened a pull request: https://github.com/apache/spark/pull/14324 [SPARK-16664][SQL] Fix persist call on Data frames with more than 200… ## What changes were proposed in this pull request? f12f11e578169b47e3f8b18b299948c0670ba585 introduced this bug,

[GitHub] spark issue #14323: [SPARK-16675][SQL] Avoid per-record type dispatch in JDB...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14323 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14323: [SPARK-16675][SQL] Avoid per-record type dispatch in JDB...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14323 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62745/ Test PASSed. ---

[GitHub] spark issue #14323: [SPARK-16675][SQL] Avoid per-record type dispatch in JDB...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14323 **[Test build #62745 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62745/consoleFull)** for PR 14323 at commit

[GitHub] spark issue #14270: [SPARK-5847][CORE] Allow for configuring MetricsSystem's...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14270 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14270: [SPARK-5847][CORE] Allow for configuring MetricsSystem's...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14270 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62744/ Test PASSed. ---

[GitHub] spark issue #14270: [SPARK-5847][CORE] Allow for configuring MetricsSystem's...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14270 **[Test build #62744 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62744/consoleFull)** for PR 14270 at commit

[GitHub] spark issue #14323: [SPARK-16675][SQL] Avoid per-record type dispatch in JDB...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14323 **[Test build #62745 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62745/consoleFull)** for PR 14323 at commit

[GitHub] spark pull request #14323: [SPARK-16675][SQL] Avoid per-record type dispatch...

2016-07-22 Thread HyukjinKwon
GitHub user HyukjinKwon opened a pull request: https://github.com/apache/spark/pull/14323 [SPARK-16675][SQL] Avoid per-record type dispatch in JDBC when writing ## What changes were proposed in this pull request? Currently, `JdbcUtils.savePartition` is doing type-based

[GitHub] spark issue #14322: [SPARK-16689] [SQL] FileSourceStrategy: Pruning Partitio...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14322 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62743/ Test PASSed. ---

[GitHub] spark issue #14322: [SPARK-16689] [SQL] FileSourceStrategy: Pruning Partitio...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14322 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14322: [SPARK-16689] [SQL] FileSourceStrategy: Pruning Partitio...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14322 **[Test build #62743 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62743/consoleFull)** for PR 14322 at commit

[GitHub] spark pull request #14164: [SPARK-16629] Allow comparisons between UDTs and ...

2016-07-22 Thread cloud-fan
Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/14164#discussion_r71965649 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala --- @@ -110,6 +110,28 @@ class DataFrameSuite extends QueryTest with

[GitHub] spark pull request #14296: [SPARK-16639][SQL] The query with having conditio...

2016-07-22 Thread viirya
Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/14296#discussion_r71965620 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -1207,6 +1207,12 @@ class Analyzer(

[GitHub] spark pull request #14322: [SPARK-16689] [SQL] FileSourceStrategy: Pruning P...

2016-07-22 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/14322#discussion_r71965622 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/FileSourceStrategy.scala --- @@ -135,9 +135,17 @@ private[sql] object

[GitHub] spark issue #13761: [SPARK-12197] [SparkCore] Kryo & Avro - Support Schema R...

2016-07-22 Thread RotemShaul
Github user RotemShaul commented on the issue: https://github.com/apache/spark/pull/13761 Indeed it is, but then you lose the already implemented GenericAvroSerializer abilities which come out of the box with Spark. (Caching / Registering of static schemas ) As Spark

[GitHub] spark issue #14259: [SPARK-16622][SQL] Fix NullPointerException when the ret...

2016-07-22 Thread viirya
Github user viirya commented on the issue: https://github.com/apache/spark/pull/14259 Thanks for reviewing this. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #14259: [SPARK-16622][SQL] Fix NullPointerException when the ret...

2016-07-22 Thread cloud-fan
Github user cloud-fan commented on the issue: https://github.com/apache/spark/pull/14259 thanks, merging to master! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request #14259: [SPARK-16622][SQL] Fix NullPointerException when ...

2016-07-22 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/14259 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #13761: [SPARK-12197] [SparkCore] Kryo & Avro - Support Schema R...

2016-07-22 Thread rxin
Github user rxin commented on the issue: https://github.com/apache/spark/pull/13761 @RotemShaul is this something doable by implementing a custom serializer outside Spark? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark issue #14270: [SPARK-5847][CORE] Allow for configuring MetricsSystem's...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14270 **[Test build #62744 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62744/consoleFull)** for PR 14270 at commit

[GitHub] spark issue #14270: [SPARK-5847][CORE] Allow for configuring MetricsSystem's...

2016-07-22 Thread markgrover
Github user markgrover commented on the issue: https://github.com/apache/spark/pull/14270 Ok, I have pushed changes to use the expansion capabilities brought in by SPARK-16272. Overall, I think it was a very good call to use that, so thanks for the suggestions! Would appreciate a

[GitHub] spark issue #14216: [SPARK-16561][MLLib] fix multivarOnlineSummary min/max b...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14216 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14216: [SPARK-16561][MLLib] fix multivarOnlineSummary min/max b...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14216 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62742/ Test PASSed. ---

[GitHub] spark issue #14216: [SPARK-16561][MLLib] fix multivarOnlineSummary min/max b...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14216 **[Test build #62742 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62742/consoleFull)** for PR 14216 at commit

[GitHub] spark issue #14322: [SPARK-16689] [SQL] FileSourceStrategy: Pruning Partitio...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14322 **[Test build #62743 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62743/consoleFull)** for PR 14322 at commit

[GitHub] spark issue #14322: [SPARK-16689] [SQL] FileSourceStrategy: Pruning Partitio...

2016-07-22 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/14322 **After the PR changes**, the whole-stage codegen output is like: ```JAVA == Subtree 1 / 1 == *Scan json [value#37L] Format: JSON, InputPaths:

[GitHub] spark issue #14322: [SPARK-16689] [SQL] FileSourceStrategy: Pruning Partitio...

2016-07-22 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/14322 **Before the PR changes**, the whole-stage codegen output is like: ```JAVA == Subtree 1 / 1 == *Project [value#37L] +- *Scan json [value#37L,p1#39,p2#40,p3#41] Format: JSON,

[GitHub] spark pull request #14322: [SPARK-16689] [SQL] FileSourceStrategy: Pruning P...

2016-07-22 Thread gatorsmile
GitHub user gatorsmile opened a pull request: https://github.com/apache/spark/pull/14322 [SPARK-16689] [SQL] FileSourceStrategy: Pruning Partition Columns When No Partition Column Exist in Project ### What changes were proposed in this pull request? For partitioned file

[GitHub] spark issue #14216: [SPARK-16561][MLLib] fix multivarOnlineSummary min/max b...

2016-07-22 Thread WeichenXu123
Github user WeichenXu123 commented on the issue: https://github.com/apache/spark/pull/14216 @srowen Oh, I miss your comment about loop brace, now it added, thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark issue #14216: [SPARK-16561][MLLib] fix multivarOnlineSummary min/max b...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14216 **[Test build #62742 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62742/consoleFull)** for PR 14216 at commit

[GitHub] spark issue #14086: [SPARK-16463][SQL] Support `truncate` option in Overwrit...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14086 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62741/ Test PASSed. ---

[GitHub] spark issue #14086: [SPARK-16463][SQL] Support `truncate` option in Overwrit...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14086 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14086: [SPARK-16463][SQL] Support `truncate` option in Overwrit...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14086 **[Test build #62741 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62741/consoleFull)** for PR 14086 at commit

[GitHub] spark issue #14304: [SPARK-16668][TEST] Test parquet reader for row groups c...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14304 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62740/ Test PASSed. ---

[GitHub] spark issue #14304: [SPARK-16668][TEST] Test parquet reader for row groups c...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14304 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14304: [SPARK-16668][TEST] Test parquet reader for row groups c...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14304 **[Test build #62740 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62740/consoleFull)** for PR 14304 at commit

[GitHub] spark issue #14321: [SPARK-8971][ML] Add stratified sampling to ML CrossVali...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14321 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14321: [SPARK-8971][ML] Add stratified sampling to ML CrossVali...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14321 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62738/ Test PASSed. ---

[GitHub] spark issue #14321: [SPARK-8971][ML] Add stratified sampling to ML CrossVali...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14321 **[Test build #62738 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62738/consoleFull)** for PR 14321 at commit

[GitHub] spark issue #14319: [SPARK-16635] [WEBUI] [SQL] [WIP] Provide Session suppor...

2016-07-22 Thread nblintao
Github user nblintao commented on the issue: https://github.com/apache/spark/pull/14319 Thanks, @ajbozarth. Yes, I think the configuration alone is not enough for a new tab. @yhuai and I actually plan to do more on this tab. As mentioned in

[GitHub] spark issue #14295: [SPARK-16648][SQL] Overrides TreeNode.withNewChildren in...

2016-07-22 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/14295 @liancheng Can you also change `First`? I think that one is also broken for this case. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark issue #14086: [SPARK-16463][SQL] Support `truncate` option in Overwrit...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14086 **[Test build #62741 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62741/consoleFull)** for PR 14086 at commit

[GitHub] spark issue #14304: [SPARK-16668][TEST] Test parquet reader for row groups c...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14304 **[Test build #62740 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62740/consoleFull)** for PR 14304 at commit

[GitHub] spark issue #14086: [SPARK-16463][SQL] Support `truncate` option in Overwrit...

2016-07-22 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/14086 Rebased. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or

[GitHub] spark issue #14304: [SPARK-16668][TEST] Test parquet reader for row groups c...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14304 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62739/ Test FAILed. ---

[GitHub] spark issue #14304: [SPARK-16668][TEST] Test parquet reader for row groups c...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14304 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14304: [SPARK-16668][TEST] Test parquet reader for row groups c...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14304 **[Test build #62739 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62739/consoleFull)** for PR 14304 at commit

[GitHub] spark issue #14304: [SPARK-16668][TEST] Test parquet reader for row groups c...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14304 **[Test build #62739 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62739/consoleFull)** for PR 14304 at commit

[GitHub] spark pull request #14304: [SPARK-16668][TEST] Test parquet reader for row g...

2016-07-22 Thread sameeragarwal
Github user sameeragarwal commented on a diff in the pull request: https://github.com/apache/spark/pull/14304#discussion_r71954942 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetEncodingSuite.scala --- @@ -78,4 +78,30 @@ class

[GitHub] spark pull request #14304: [SPARK-16668][TEST] Test parquet reader for row g...

2016-07-22 Thread sameeragarwal
Github user sameeragarwal commented on a diff in the pull request: https://github.com/apache/spark/pull/14304#discussion_r71954540 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetEncodingSuite.scala --- @@ -78,4 +78,30 @@ class

[GitHub] spark issue #14248: [SPARK-16589][PYTHON] Chained cartesian produces incorre...

2016-07-22 Thread zero323
Github user zero323 commented on the issue: https://github.com/apache/spark/pull/14248 @holdenk Can we move this discussion to JIRA? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #14079: [SPARK-8425][CORE] New Blacklist Mechanism

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14079 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14079: [SPARK-8425][CORE] New Blacklist Mechanism

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14079 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62737/ Test PASSed. ---

[GitHub] spark issue #14079: [SPARK-8425][CORE] New Blacklist Mechanism

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14079 **[Test build #62737 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62737/consoleFull)** for PR 14079 at commit

[GitHub] spark issue #14321: [SPARK-8971][ML] Add stratified sampling to ML CrossVali...

2016-07-22 Thread sethah
Github user sethah commented on the issue: https://github.com/apache/spark/pull/14321 cc @MLnick @hhbyyh @mengxr I believe there is still interest in stratified sampling methods. Could you provide feedback/review on this patch? Thanks! --- If your project is set up for it, you can

[GitHub] spark issue #14321: [SPARK-8971][ML] Add stratified sampling to ML CrossVali...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14321 **[Test build #62738 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62738/consoleFull)** for PR 14321 at commit

[GitHub] spark pull request #14321: [SPARK-8971][ML] Add stratified sampling to ML Cr...

2016-07-22 Thread sethah
GitHub user sethah opened a pull request: https://github.com/apache/spark/pull/14321 [SPARK-8971][ML] Add stratified sampling to ML CrossValidator and TrainValidationSplit ## What changes were proposed in this pull request? This patch adds the ability to do stratified

[GitHub] spark issue #14319: [SPARK-16635][WEBUI][SQL] Provide Session support in the...

2016-07-22 Thread ajbozarth
Github user ajbozarth commented on the issue: https://github.com/apache/spark/pull/14319 I've read through your code and didn't catch any issues, I also checked it out and it looks good. I think this is a nice feature to add, my only qualm is it add yet another tab to the Web UI. If

[GitHub] spark issue #9766: [SPARK-11775][PYSPARK][SQL] Allow PySpark to register Jav...

2016-07-22 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/9766 I was looking at some similar stuff as part of https://github.com/apache/spark/pull/13571 and I was thinking that (to match the Scala API) it would be good to return the UDF object as well so people

[GitHub] spark issue #14233: [SPARK-16490] [Examples] added a python example for chis...

2016-07-22 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/14233 Thanks for taking this on! More documentation is always an improvement - looking at the Scala & Java examples it seems like they are included in ./docs/mllib-feature-extraction.md - it would

[GitHub] spark issue #13932: [SPARK-15354] [CORE] [WIP] Topology aware block replicat...

2016-07-22 Thread shubhamchopra
Github user shubhamchopra commented on the issue: https://github.com/apache/spark/pull/13932 Based on feedback from @rxin, added a Basic Strategy that replicates HDFS behavior as a simpler alternative to the constraint solver. I also ran some performance tests on the constraint

[GitHub] spark issue #14269: [SPARK-15703] [Scheduler][Core][WebUI] Make ListenerBus ...

2016-07-22 Thread dhruve
Github user dhruve commented on the issue: https://github.com/apache/spark/pull/14269 The set of failures from [Test build #62733](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62733/consoleFull) are unrelated. --- If your project is set up for it, you can

[GitHub] spark pull request #14314: [SPARK-16678] [SPARK-16677] [SQL] Fix two View-re...

2016-07-22 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/14314#discussion_r71939503 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/SQLViewSuite.scala --- @@ -55,6 +54,76 @@ class SQLViewSuite extends QueryTest

[GitHub] spark issue #14309: [SPARK-11977][SQL] Support accessing a column contains "...

2016-07-22 Thread shivaram
Github user shivaram commented on the issue: https://github.com/apache/spark/pull/14309 I'm not sure I am good reviewer for this as I dont fully understand the consequences inside SQL for this change. cc @liancheng @rxin --- If your project is set up for it, you can reply to this

[GitHub] spark pull request #14314: [SPARK-16678] [SPARK-16677] [SQL] Fix two View-re...

2016-07-22 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/14314#discussion_r71939188 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/SQLViewSuite.scala --- @@ -55,6 +54,76 @@ class SQLViewSuite extends QueryTest

[GitHub] spark pull request #14314: [SPARK-16678] [SPARK-16677] [SQL] Fix two View-re...

2016-07-22 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/14314#discussion_r71938799 --- Diff: sql/hive/src/test/scala/org/apache/spark/sql/hive/execution/SQLViewSuite.scala --- @@ -55,6 +54,76 @@ class SQLViewSuite extends QueryTest

[GitHub] spark pull request #14240: [SPARK-16594] [SQL] Remove Physical Plan Differen...

2016-07-22 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/14240#discussion_r71938022 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/sources/PrunedScanSuite.scala --- @@ -114,16 +114,15 @@ class PrunedScanSuite extends

[GitHub] spark pull request #14240: [SPARK-16594] [SQL] Remove Physical Plan Differen...

2016-07-22 Thread gatorsmile
Github user gatorsmile closed the pull request at: https://github.com/apache/spark/pull/14240 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature

[GitHub] spark issue #14240: [SPARK-16594] [SQL] Remove Physical Plan Differences whe...

2016-07-22 Thread gatorsmile
Github user gatorsmile commented on the issue: https://github.com/apache/spark/pull/14240 Since File Scan is completely different from Data Source Table Scan, Hive Table Scan and In-memory Table Scan, it does not make sense to make all of them consistent. Close it now. --- If your

[GitHub] spark issue #2135: [SPARK-3229] spark.shuffle.safetyFraction and spark.stora...

2016-07-22 Thread hastimal
Github user hastimal commented on the issue: https://github.com/apache/spark/pull/2135 @andrewor14 Thank you for these info. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14079: [SPARK-8425][CORE] New Blacklist Mechanism

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14079 **[Test build #62737 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62737/consoleFull)** for PR 14079 at commit

[GitHub] spark issue #14079: [SPARK-8425][CORE] New Blacklist Mechanism

2016-07-22 Thread squito
Github user squito commented on the issue: https://github.com/apache/spark/pull/14079 Jenkins, retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #14275: [SPARK-16637] Unified containerizer

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14275 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14275: [SPARK-16637] Unified containerizer

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14275 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62735/ Test PASSed. ---

[GitHub] spark issue #14275: [SPARK-16637] Unified containerizer

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14275 **[Test build #62735 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62735/consoleFull)** for PR 14275 at commit

[GitHub] spark issue #14288: [SPARK-16651][PYSPARK][DOC] Make `withColumnRenamed/drop...

2016-07-22 Thread dongjoon-hyun
Github user dongjoon-hyun commented on the issue: https://github.com/apache/spark/pull/14288 Thank you for review and merging, @srowen and @rxin . --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark issue #14079: [SPARK-8425][CORE] New Blacklist Mechanism

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14079 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14079: [SPARK-8425][CORE] New Blacklist Mechanism

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14079 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62734/ Test FAILed. ---

[GitHub] spark issue #14079: [SPARK-8425][CORE] New Blacklist Mechanism

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14079 **[Test build #62734 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62734/consoleFull)** for PR 14079 at commit

[GitHub] spark issue #14269: [SPARK-15703] [Scheduler][Core][WebUI] Make ListenerBus ...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14269 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14269: [SPARK-15703] [Scheduler][Core][WebUI] Make ListenerBus ...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14269 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62732/ Test PASSed. ---

[GitHub] spark issue #14269: [SPARK-15703] [Scheduler][Core][WebUI] Make ListenerBus ...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14269 **[Test build #62732 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62732/consoleFull)** for PR 14269 at commit

[GitHub] spark issue #14320: [SPARK-16416] [Core] force eager creation of loggers to ...

2016-07-22 Thread srowen
Github user srowen commented on the issue: https://github.com/apache/spark/pull/14320 This doesn't seem to be the change discussed in the JIRA --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #14269: [SPARK-15703] [Scheduler][Core][WebUI] Make ListenerBus ...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14269 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #14269: [SPARK-15703] [Scheduler][Core][WebUI] Make ListenerBus ...

2016-07-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/14269 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/62733/ Test FAILed. ---

[GitHub] spark issue #14174: [SPARK-16524][SQL] Add RowBatch and RowBasedHashMapGener...

2016-07-22 Thread sameeragarwal
Github user sameeragarwal commented on the issue: https://github.com/apache/spark/pull/14174 LGTM. cc @ericl --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #14269: [SPARK-15703] [Scheduler][Core][WebUI] Make ListenerBus ...

2016-07-22 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/14269 **[Test build #62733 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/62733/consoleFull)** for PR 14269 at commit

[GitHub] spark pull request #14292: [SPARK-14131][SQL[STREAMING] Improved fix for avo...

2016-07-22 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/14292#discussion_r71924307 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/SQLTestUtils.scala --- @@ -247,6 +248,46 @@ private[sql] trait SQLTestUtils } }

[GitHub] spark pull request #14174: [SPARK-16524][SQL] Add RowBatch and RowBasedHashM...

2016-07-22 Thread sameeragarwal
Github user sameeragarwal commented on a diff in the pull request: https://github.com/apache/spark/pull/14174#discussion_r71924045 --- Diff: sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/RowBasedKeyValueBatch.java --- @@ -0,0 +1,182 @@ +/* + *

[GitHub] spark pull request #14292: [SPARK-14131][SQL[STREAMING] Improved fix for avo...

2016-07-22 Thread tdas
Github user tdas commented on a diff in the pull request: https://github.com/apache/spark/pull/14292#discussion_r71923960 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/test/SQLTestUtils.scala --- @@ -247,6 +248,46 @@ private[sql] trait SQLTestUtils } }

[GitHub] spark pull request #14174: [SPARK-16524][SQL] Add RowBatch and RowBasedHashM...

2016-07-22 Thread sameeragarwal
Github user sameeragarwal commented on a diff in the pull request: https://github.com/apache/spark/pull/14174#discussion_r71923918 --- Diff: sql/catalyst/src/main/java/org/apache/spark/sql/catalyst/expressions/RowBasedKeyValueBatch.java --- @@ -0,0 +1,182 @@ +/* + *

  1   2   3   >