[GitHub] spark issue #15651: [SPARK-17972][SQL] Add Dataset.checkpoint() to truncate ...

2016-10-27 Thread liancheng
Github user liancheng commented on the issue: https://github.com/apache/spark/pull/15651 @viirya `Dataset.localCheckpoint()` also makes sense. Would like to add it as a follow-up though. Thanks for the suggestion! --- If your project is set up for it, you can reply to this email and

[GitHub] spark issue #15663: [SPARK-18143][SQL]Ignore Structured Streaming event logs...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15663 **[Test build #67680 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67680/consoleFull)** for PR 15663 at commit

[GitHub] spark issue #10162: [SPARK-11250] [SQL] Generate different alias for columns...

2016-10-27 Thread NarineK
Github user NarineK commented on the issue: https://github.com/apache/spark/pull/10162 In pandas it has 2 arguments: lsuffix='', rsuffix='', respectively for left and right sides. And it appends the suffixes to all column names regardless if they are in join condition or not .

[GitHub] spark issue #15651: [SPARK-17972][SQL] Add Dataset.checkpoint() to truncate ...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15651 **[Test build #67679 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67679/consoleFull)** for PR 15651 at commit

[GitHub] spark issue #15663: [SPARK-18143][SQL]Ignore Structured Streaming event logs...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15663 **[Test build #67678 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67678/consoleFull)** for PR 15663 at commit

[GitHub] spark issue #15276: [SPARK-17703][SQL] Add unnamed version of addReferenceOb...

2016-10-27 Thread rxin
Github user rxin commented on the issue: https://github.com/apache/spark/pull/15276 Sure sounds good. On Thursday, October 27, 2016, Davies Liu wrote: > @rxin Hit the same issue here today (10k errMsg > in

[GitHub] spark issue #15662: [SPARK-18141][SQL] Fix to quote column names in the pred...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15662 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/67669/ Test PASSed. ---

[GitHub] spark issue #15662: [SPARK-18141][SQL] Fix to quote column names in the pred...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15662 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15662: [SPARK-18141][SQL] Fix to quote column names in the pred...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15662 **[Test build #67669 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67669/consoleFull)** for PR 15662 at commit

[GitHub] spark issue #15647: [SPARK-18088][ML] Various ChiSqSelector cleanups

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15647 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/67672/ Test PASSed. ---

[GitHub] spark issue #15647: [SPARK-18088][ML] Various ChiSqSelector cleanups

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15647 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #15633: [SPARK-18087] [SQL] Optimize insert to not requir...

2016-10-27 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/15633#discussion_r85445969 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/WriteOutput.scala --- @@ -375,6 +378,7 @@ object WriteOutput extends Logging {

[GitHub] spark issue #15647: [SPARK-18088][ML] Various ChiSqSelector cleanups

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15647 **[Test build #67672 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67672/consoleFull)** for PR 15647 at commit

[GitHub] spark pull request #15633: [SPARK-18087] [SQL] Optimize insert to not requir...

2016-10-27 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/15633#discussion_r85445643 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/WriteOutput.scala --- @@ -386,13 +390,18 @@ object WriteOutput extends Logging

[GitHub] spark issue #15471: [SPARK-17919] Make timeout to RBackend configurable in S...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15471 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #10162: [SPARK-11250] [SQL] Generate different alias for columns...

2016-10-27 Thread NarineK
Github user NarineK commented on the issue: https://github.com/apache/spark/pull/10162 Thank you for following up on this, @marmbrus ! I looked into two places: R and Pandas DataFrames. In R it seems that they give new names to columns(columns which aren't in merge/join

[GitHub] spark issue #15471: [SPARK-17919] Make timeout to RBackend configurable in S...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15471 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/67667/ Test PASSed. ---

[GitHub] spark issue #15471: [SPARK-17919] Make timeout to RBackend configurable in S...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15471 **[Test build #67667 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67667/consoleFull)** for PR 15471 at commit

[GitHub] spark issue #15665: [SPARK-18146] [SQL] Avoid using Union to chain together ...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15665 **[Test build #67677 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67677/consoleFull)** for PR 15665 at commit

[GitHub] spark pull request #15665: [SPARK-18146] [SQL] Avoid using Union to chain to...

2016-10-27 Thread ericl
GitHub user ericl opened a pull request: https://github.com/apache/spark/pull/15665 [SPARK-18146] [SQL] Avoid using Union to chain together create table and repair partition commands ## What changes were proposed in this pull request? The behavior of union is not well

[GitHub] spark issue #15634: [SPARK-18103] [SQL] Rename *FileCatalog to *FileProvider

2016-10-27 Thread ericl
Github user ericl commented on the issue: https://github.com/apache/spark/pull/15634 Concretely, I propose renaming ``` TableFileCatalog => MetastoreFileIndex ListingFileCatalog => InMemoryFileIndex MetadataLogFileCatalog => MetadataLogFileIndex

[GitHub] spark pull request #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/15148#discussion_r85444756 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/RandomProjection.scala --- @@ -0,0 +1,215 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/15148#discussion_r85444781 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/MinHash.scala --- @@ -0,0 +1,192 @@ +/* + * Licensed to the Apache Software Foundation (ASF)

[GitHub] spark issue #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15148 **[Test build #67676 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67676/consoleFull)** for PR 15148 at commit

[GitHub] spark issue #9313: [SPARK-10658][SPARK-11421][PYSPARK][CORE] Provide add jar...

2016-10-27 Thread mariusvniekerk
Github user mariusvniekerk commented on the issue: https://github.com/apache/spark/pull/9313 So since py4j now uses the context classloader, we can remove the python pieces about loading a class by name. @holdenk If you want I can revisit this PR. This case

[GitHub] spark issue #15663: [SPARK-18143][SQL]Ignore Structured Streaming event logs...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15663 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/67666/ Test PASSed. ---

[GitHub] spark issue #15663: [SPARK-18143][SQL]Ignore Structured Streaming event logs...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15663 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15663: [SPARK-18143][SQL]Ignore Structured Streaming event logs...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15663 **[Test build #67666 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67666/consoleFull)** for PR 15663 at commit

[GitHub] spark issue #15659: [WIP][SPARK-1267][SPARK-18129] Allow PySpark to be pip i...

2016-10-27 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/15659 (Yes, it seems the particular test is really flicky) --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not

[GitHub] spark issue #15659: [WIP][SPARK-1267][SPARK-18129] Allow PySpark to be pip i...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15659 **[Test build #67675 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67675/consoleFull)** for PR 15659 at commit

[GitHub] spark pull request #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/15148#discussion_r85443153 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/RandomProjection.scala --- @@ -0,0 +1,215 @@ +/* + * Licensed to the Apache Software

[GitHub] spark issue #15659: [WIP][SPARK-1267][SPARK-18129] Allow PySpark to be pip i...

2016-10-27 Thread holdenk
Github user holdenk commented on the issue: https://github.com/apache/spark/pull/15659 Scala SQL failure seems possibly unrelated, jenkins retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark issue #15513: [SPARK-17963][SQL][Documentation] Add examples (extend) ...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15513 **[Test build #67674 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67674/consoleFull)** for PR 15513 at commit

[GitHub] spark issue #15513: [SPARK-17963][SQL][Documentation] Add examples (extend) ...

2016-10-27 Thread HyukjinKwon
Github user HyukjinKwon commented on the issue: https://github.com/apache/spark/pull/15513 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #15657: [DO NOT MERGE] Test partition

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15657 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/67670/ Test FAILed. ---

[GitHub] spark issue #15657: [DO NOT MERGE] Test partition

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15657 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15657: [DO NOT MERGE] Test partition

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15657 **[Test build #67670 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67670/consoleFull)** for PR 15657 at commit

[GitHub] spark issue #15664: [SPARK-18123][SQL] Use db column names instead of RDD co...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15664 **[Test build #67673 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67673/consoleFull)** for PR 15664 at commit

[GitHub] spark issue #15354: [SPARK-17764][SQL] Add `to_json` supporting to convert n...

2016-10-27 Thread felixcheung
Github user felixcheung commented on the issue: https://github.com/apache/spark/pull/15354 looks good, should we clarify output JSON is in JSON Lines format? http://jsonlines.org/ --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark pull request #15664: [SPARK-18123][SQL] Use db column names instead of...

2016-10-27 Thread dongjoon-hyun
GitHub user dongjoon-hyun opened a pull request: https://github.com/apache/spark/pull/15664 [SPARK-18123][SQL] Use db column names instead of RDD column ones during JDBC Writing ## What changes were proposed in this pull request? Apache Spark supports the following cases

[GitHub] spark issue #15663: [SPARK-18143][SQL]Ignore Structured Streaming event logs...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15663 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/67664/ Test PASSed. ---

[GitHub] spark issue #15634: [SPARK-18103] [SQL] Rename *FileCatalog to *FileProvider

2016-10-27 Thread ericl
Github user ericl commented on the issue: https://github.com/apache/spark/pull/15634 How about `FileIndex`, since this class is responsible for both listing and filtering functionality? --- If your project is set up for it, you can reply to this email and have your reply appear on

[GitHub] spark issue #15663: [SPARK-18143][SQL]Ignore Structured Streaming event logs...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15663 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15663: [SPARK-18143][SQL]Ignore Structured Streaming event logs...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15663 **[Test build #67664 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67664/consoleFull)** for PR 15663 at commit

[GitHub] spark issue #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15148 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/67668/ Test PASSed. ---

[GitHub] spark issue #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15148 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15647: [SPARK-18088][ML] Various ChiSqSelector cleanups

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15647 **[Test build #67672 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67672/consoleFull)** for PR 15647 at commit

[GitHub] spark issue #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15148 **[Test build #67668 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67668/consoleFull)** for PR 15148 at commit

[GitHub] spark issue #15647: [SPARK-18088][ML] Various ChiSqSelector cleanups

2016-10-27 Thread jkbradley
Github user jkbradley commented on the issue: https://github.com/apache/spark/pull/15647 Updated---thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark issue #15663: [SPARK-18143][SQL]Ignore Structured Streaming event logs...

2016-10-27 Thread tdas
Github user tdas commented on the issue: https://github.com/apache/spark/pull/15663 one comment, but overall looks good. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #15633: [SPARK-18087] [SQL] Optimize insert to not require REPAI...

2016-10-27 Thread ericl
Github user ericl commented on the issue: https://github.com/apache/spark/pull/15633 cc @cloud-fan @davies --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark issue #15633: [SPARK-18087] [SQL] [WIP] Optimize insert to not require...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15633 **[Test build #67671 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67671/consoleFull)** for PR 15633 at commit

[GitHub] spark issue #15657: [DO NOT MERGE] Test partition

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15657 **[Test build #67670 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67670/consoleFull)** for PR 15657 at commit

[GitHub] spark pull request #15515: [SPARK-17970][SQL] store partition spec in metast...

2016-10-27 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/15515 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is

[GitHub] spark issue #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15148 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/67665/ Test PASSed. ---

[GitHub] spark issue #15657: [DO NOT MERGE] Test partition

2016-10-27 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/15657 test this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or

[GitHub] spark issue #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15148 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15148 **[Test build #67665 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67665/consoleFull)** for PR 15148 at commit

[GitHub] spark issue #15515: [SPARK-17970][SQL] store partition spec in metastore for...

2016-10-27 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/15515 Cool. I am merging this pr to unblock other tasks. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark issue #15662: [SPARK-18141][SQL] Fix to quote column names in the pred...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15662 **[Test build #67669 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67669/consoleFull)** for PR 15662 at commit

[GitHub] spark pull request #15515: [SPARK-17970][SQL] store partition spec in metast...

2016-10-27 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/15515#discussion_r85424491 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeColumnCommand.scala --- @@ -50,7 +50,8 @@ case class AnalyzeColumnCommand(

[GitHub] spark pull request #15515: [SPARK-17970][SQL] store partition spec in metast...

2016-10-27 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/15515#discussion_r85429395 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala --- @@ -531,6 +529,11 @@ case class AlterTableRecoverPartitionsCommand(

[GitHub] spark pull request #15515: [SPARK-17970][SQL] store partition spec in metast...

2016-10-27 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/15515#discussion_r85429255 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/createDataSourceTables.scala --- @@ -232,6 +238,15 @@ case class

[GitHub] spark pull request #15515: [SPARK-17970][SQL] store partition spec in metast...

2016-10-27 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/15515#discussion_r85424270 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameWriter.scala --- @@ -387,7 +388,15 @@ final class DataFrameWriter[T] private[sql](ds:

[GitHub] spark pull request #15515: [SPARK-17970][SQL] store partition spec in metast...

2016-10-27 Thread ericl
Github user ericl commented on a diff in the pull request: https://github.com/apache/spark/pull/15515#discussion_r85429519 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/DataSourceStrategy.scala --- @@ -188,6 +188,15 @@ case class

[GitHub] spark issue #15651: [SPARK-17972][SQL] Add Dataset.checkpoint() to truncate ...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15651 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15662: [SPARK-18141][SQL] Fix to quote column names in the pred...

2016-10-27 Thread sureshthalamati
Github user sureshthalamati commented on the issue: https://github.com/apache/spark/pull/15662 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark issue #15651: [SPARK-17972][SQL] Add Dataset.checkpoint() to truncate ...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15651 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/67663/ Test FAILed. ---

[GitHub] spark issue #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15148 **[Test build #67668 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67668/consoleFull)** for PR 15148 at commit

[GitHub] spark issue #15651: [SPARK-17972][SQL] Add Dataset.checkpoint() to truncate ...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15651 **[Test build #67663 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67663/consoleFull)** for PR 15651 at commit

[GitHub] spark issue #15662: [SPARK-18141][SQL] Fix to quote column names in the pred...

2016-10-27 Thread sureshthalamati
Github user sureshthalamati commented on the issue: https://github.com/apache/spark/pull/15662 Test failed is org.apache.spark.sql.streaming.StreamingQuerySuite, unrelated to this change. Might have been fixed in commit

[GitHub] spark issue #15538: [SPARK-17993][SQL] Fix Parquet log output redirection

2016-10-27 Thread ericl
Github user ericl commented on the issue: https://github.com/apache/spark/pull/15538 Jenkins retest this please On Thu, Oct 27, 2016, 12:34 PM Michael Allman wrote: Looks like the test failed for reasons unrelated to this PR. Can someone

[GitHub] spark issue #15636: [SPARK-18109][ML] Add instrumentation to GMM

2016-10-27 Thread jkbradley
Github user jkbradley commented on the issue: https://github.com/apache/spark/pull/15636 Other than that, this looks fine. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #15636: [SPARK-18109][ML] Add instrumentation to GMM

2016-10-27 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/15636#discussion_r85426938 --- Diff: mllib/src/main/scala/org/apache/spark/ml/clustering/GaussianMixture.scala --- @@ -337,6 +340,8 @@ class GaussianMixture @Since("2.0.0") (

[GitHub] spark pull request #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/15148#discussion_r85424257 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/RandomProjection.scala --- @@ -0,0 +1,215 @@ +/* + * Licensed to the Apache Software

[GitHub] spark issue #15471: [SPARK-17919] Make timeout to RBackend configurable in S...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15471 **[Test build #67667 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67667/consoleFull)** for PR 15471 at commit

[GitHub] spark issue #15471: [SPARK-17919] Make timeout to RBackend configurable in S...

2016-10-27 Thread falaki
Github user falaki commented on the issue: https://github.com/apache/spark/pull/15471 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so,

[GitHub] spark pull request #15515: [SPARK-17970][SQL] store partition spec in metast...

2016-10-27 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/15515#discussion_r85420521 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/AnalyzeColumnCommand.scala --- @@ -50,7 +50,8 @@ case class AnalyzeColumnCommand(

[GitHub] spark pull request #15515: [SPARK-17970][SQL] store partition spec in metast...

2016-10-27 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/15515#discussion_r85415502 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/DataFrameWriter.scala --- @@ -387,7 +388,15 @@ final class DataFrameWriter[T] private[sql](ds:

[GitHub] spark issue #15515: [SPARK-17970][SQL] store partition spec in metastore for...

2016-10-27 Thread yhuai
Github user yhuai commented on the issue: https://github.com/apache/spark/pull/15515 Looks good. I left a few questions. Let me know if you want to address them in follow-up prs. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub

[GitHub] spark pull request #15515: [SPARK-17970][SQL] store partition spec in metast...

2016-10-27 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/15515#discussion_r85421683 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/ddl.scala --- @@ -531,6 +529,11 @@ case class AlterTableRecoverPartitionsCommand(

[GitHub] spark pull request #15515: [SPARK-17970][SQL] store partition spec in metast...

2016-10-27 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/15515#discussion_r85421410 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/command/createDataSourceTables.scala --- @@ -232,6 +238,15 @@ case class

[GitHub] spark issue #15471: [SPARK-17919] Make timeout to RBackend configurable in S...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15471 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/67661/ Test FAILed. ---

[GitHub] spark pull request #15651: [SPARK-17972][SQL] Add Dataset.checkpoint() to tr...

2016-10-27 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/15651#discussion_r85421484 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/ExistingRDD.scala --- @@ -130,17 +130,23 @@ case class ExternalRDDScanExec[T]( /**

[GitHub] spark issue #15471: [SPARK-17919] Make timeout to RBackend configurable in S...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15471 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15471: [SPARK-17919] Make timeout to RBackend configurable in S...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15471 **[Test build #67661 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67661/consoleFull)** for PR 15471 at commit

[GitHub] spark issue #15662: [SPARK-18141][SQL] Fix to quote column names in the pred...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15662 Merged build finished. Test FAILed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15662: [SPARK-18141][SQL] Fix to quote column names in the pred...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15662 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/67662/ Test FAILed. ---

[GitHub] spark issue #15662: [SPARK-18141][SQL] Fix to quote column names in the pred...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15662 **[Test build #67662 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67662/consoleFull)** for PR 15662 at commit

[GitHub] spark issue #15663: [SPARK-18143][SQL]Ignore Structured Streaming event logs...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15663 **[Test build #67666 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67666/consoleFull)** for PR 15663 at commit

[GitHub] spark pull request #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/15148#discussion_r85418443 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/RandomProjection.scala --- @@ -0,0 +1,215 @@ +/* + * Licensed to the Apache Software

[GitHub] spark pull request #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/15148#discussion_r85417786 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/MinHash.scala --- @@ -0,0 +1,186 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] spark pull request #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/15148#discussion_r85417570 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/MinHash.scala --- @@ -0,0 +1,186 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] spark pull request #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread jkbradley
Github user jkbradley commented on a diff in the pull request: https://github.com/apache/spark/pull/15148#discussion_r85415500 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/MinHash.scala --- @@ -0,0 +1,186 @@ +/* + * Licensed to the Apache Software Foundation

[GitHub] spark issue #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15148 **[Test build #67665 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67665/consoleFull)** for PR 15148 at commit

[GitHub] spark pull request #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/15148#discussion_r85418671 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/MinHash.scala --- @@ -0,0 +1,186 @@ +/* + * Licensed to the Apache Software Foundation (ASF)

[GitHub] spark pull request #15148: [SPARK-5992][ML] Locality Sensitive Hashing

2016-10-27 Thread Yunni
Github user Yunni commented on a diff in the pull request: https://github.com/apache/spark/pull/15148#discussion_r85417885 --- Diff: mllib/src/main/scala/org/apache/spark/ml/feature/LSH.scala --- @@ -0,0 +1,339 @@ +/* + * Licensed to the Apache Software Foundation (ASF)

[GitHub] spark issue #15661: [SPARK-16963][SQL]Fix test "StreamExecution metadata gar...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15661 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #15661: [SPARK-16963][SQL]Fix test "StreamExecution metadata gar...

2016-10-27 Thread AmplabJenkins
Github user AmplabJenkins commented on the issue: https://github.com/apache/spark/pull/15661 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/67659/ Test PASSed. ---

[GitHub] spark issue #15661: [SPARK-16963][SQL]Fix test "StreamExecution metadata gar...

2016-10-27 Thread SparkQA
Github user SparkQA commented on the issue: https://github.com/apache/spark/pull/15661 **[Test build #67659 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/67659/consoleFull)** for PR 15661 at commit

<    1   2   3   4   5   >