[GitHub] spark pull request: [SPARK-7069][SQL] Rename NativeType -> AtomicT...

2015-04-22 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/5650#issuecomment-95465861 Going to re-open a PR. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this fe

[GitHub] spark pull request: [SPARK-7069][SQL] Rename NativeType -> AtomicT...

2015-04-22 Thread rxin
Github user rxin closed the pull request at: https://github.com/apache/spark/pull/5650 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enable

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95465033 [Test build #30812 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30812/consoleFull) for PR 5643 at commit [`41c20d5`](https://gith

[GitHub] spark pull request: [MLlib] Add support for BooleanType to VectorA...

2015-04-22 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/5648 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95465053 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [SPARK-7068][SQL] Remove PrimitiveType

2015-04-22 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/5646 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [HOTFIX][SQL] Fix broken cached test

2015-04-22 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/5640 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-7026][SQL] Fix bugs when there are non ...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5612#issuecomment-95464897 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [SPARK-7026][SQL] Fix bugs when there are non ...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5612#issuecomment-95464884 [Test build #30813 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30813/consoleFull) for PR 5612 at commit [`3d7f589`](https://gith

[GitHub] spark pull request: [MLlib] Add support for BooleanType to VectorA...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5648#issuecomment-95464723 [Test build #30811 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30811/consoleFull) for PR 5648 at commit [`1bf3d40`](https://gith

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread adrian-wang
Github user adrian-wang commented on a diff in the pull request: https://github.com/apache/spark/pull/5643#discussion_r28940019 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala --- @@ -298,6 +298,13 @@ class SQLQuerySuite extends QueryTest with BeforeAndA

[GitHub] spark pull request: [SPARK-7068][SQL] Remove PrimitiveType

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5646#issuecomment-95464699 [Test build #30810 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30810/consoleFull) for PR 5646 at commit [`01b673d`](https://gith

[GitHub] spark pull request: [MLlib] Add support for BooleanType to VectorA...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5648#issuecomment-95464728 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [SPARK-7069][SQL] Rename NativeType -> AtomicT...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5650#issuecomment-95464700 [Test build #30816 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30816/consoleFull) for PR 5650 at commit [`97d2e1d`](https://githu

[GitHub] spark pull request: [SPARK-7068][SQL] Remove PrimitiveType

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5646#issuecomment-95464709 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [SPARK-7069][SQL] Rename NativeType -> AtomicT...

2015-04-22 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/5650#issuecomment-95464577 This also contains https://github.com/apache/spark/pull/5646 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well.

[GitHub] spark pull request: [SPARK-7069][SQL] Rename NativeType -> AtomicT...

2015-04-22 Thread rxin
GitHub user rxin opened a pull request: https://github.com/apache/spark/pull/5650 [SPARK-7069][SQL] Rename NativeType -> AtomicType You can merge this pull request into a Git repository by running: $ git pull https://github.com/rxin/spark atomic-type Alternatively you can rev

[GitHub] spark pull request: [SPARK-7026][SQL] Fix bugs when there are non ...

2015-04-22 Thread scwf
Github user scwf closed the pull request at: https://github.com/apache/spark/pull/5612 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enable

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread scwf
Github user scwf commented on a diff in the pull request: https://github.com/apache/spark/pull/5643#discussion_r28939761 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala --- @@ -298,6 +298,13 @@ class SQLQuerySuite extends QueryTest with BeforeAndAfterAll

[GitHub] spark pull request: [SPARK-7026][SQL] Fix bugs when there are non ...

2015-04-22 Thread scwf
Github user scwf commented on the pull request: https://github.com/apache/spark/pull/5612#issuecomment-95463452 i am closing this. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request: SPARK-7063 when lz4 compression is used, it ca...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5641#issuecomment-95462973 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95462958 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: SPARK-7063 when lz4 compression is used, it ca...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5641#issuecomment-95462960 [Test build #30806 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30806/consoleFull) for PR 5641 at commit [`c33827f`](https://gith

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95462951 [Test build #30808 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30808/consoleFull) for PR 5643 at commit [`19201e0`](https://gith

[GitHub] spark pull request: [SPARK-5342][YARN] Allow long running Spark ap...

2015-04-22 Thread harishreedharan
Github user harishreedharan commented on the pull request: https://github.com/apache/spark/pull/4688#issuecomment-95462834 I have done testing by making sure the app runs longer than the expiration time, but not killing the executors. I will test that scenario too and report back. In

[GitHub] spark pull request: [SPARK-7026][SQL] Fix bugs when there are non ...

2015-04-22 Thread scwf
Github user scwf commented on the pull request: https://github.com/apache/spark/pull/5612#issuecomment-95459557 Get it. interesting case! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-6862][Streaming][WebUI] Add BatchPage t...

2015-04-22 Thread zsxwing
Github user zsxwing commented on the pull request: https://github.com/apache/spark/pull/5473#issuecomment-95457175 @tdas, I have a question about the failure case: If a Spark job throws an exception, it will propagate to `ssc.awaitTermination()`. In such case, I suppose the user will

[GitHub] spark pull request: [SPARK-7026][SQL] Fix bugs when there are non ...

2015-04-22 Thread adrian-wang
Github user adrian-wang commented on the pull request: https://github.com/apache/spark/pull/5612#issuecomment-95456832 select a1 from t1 left semi join t3 on t1.a1 = t3.a3 and t1.a1>t3.b3 - 100; --- If your project is set up for it, you can reply to this email and have your reply appe

[GitHub] spark pull request: [MLlib] UnaryTransformer nullability should no...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5644#issuecomment-95449763 [Test build #30807 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30807/consoleFull) for PR 5644 at commit [`a727e5b`](https://gith

[GitHub] spark pull request: [MLlib] UnaryTransformer nullability should no...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5644#issuecomment-95449803 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [SPARK-5342][YARN] Allow long running Spark ap...

2015-04-22 Thread harishreedharan
Github user harishreedharan commented on a diff in the pull request: https://github.com/apache/spark/pull/4688#discussion_r28938263 --- Diff: yarn/src/main/scala/org/apache/spark/deploy/yarn/AMDelegationTokenRenewer.scala --- @@ -0,0 +1,203 @@ +/* + * Licensed to the Apach

[GitHub] spark pull request: [SPARK-5342][YARN] Allow long running Spark ap...

2015-04-22 Thread harishreedharan
Github user harishreedharan commented on a diff in the pull request: https://github.com/apache/spark/pull/4688#discussion_r28938209 --- Diff: core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala --- @@ -121,8 +125,7 @@ class CoarseGrainedSched

[GitHub] spark pull request: [SPARK-5342][YARN] Allow long running Spark ap...

2015-04-22 Thread harishreedharan
Github user harishreedharan commented on a diff in the pull request: https://github.com/apache/spark/pull/4688#discussion_r28937949 --- Diff: yarn/src/main/scala/org/apache/spark/deploy/yarn/AMDelegationTokenRenewer.scala --- @@ -0,0 +1,203 @@ +/* + * Licensed to the Apach

[GitHub] spark pull request: [SPARK-5342][YARN] Allow long running Spark ap...

2015-04-22 Thread harishreedharan
Github user harishreedharan commented on a diff in the pull request: https://github.com/apache/spark/pull/4688#discussion_r28937965 --- Diff: yarn/src/main/scala/org/apache/spark/deploy/yarn/AMDelegationTokenRenewer.scala --- @@ -0,0 +1,203 @@ +/* + * Licensed to the Apach

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95446869 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95446846 [Test build #30805 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30805/consoleFull) for PR 5643 at commit [`cf435db`](https://gith

[GitHub] spark pull request: [SPARK-7055][SQL]Use correct ClassLoader for J...

2015-04-22 Thread vinodkc
Github user vinodkc commented on the pull request: https://github.com/apache/spark/pull/5633#issuecomment-95446962 @marmbrus, Have you tried MySQL and Postgres integration tests for this PR ? https://github.com/apache/spark/pull/5543 --- If your project is set up for it, you

[GitHub] spark pull request: [SPARK-7026][SQL] Fix bugs when there are non ...

2015-04-22 Thread chenghao-intel
Github user chenghao-intel commented on the pull request: https://github.com/apache/spark/pull/5612#issuecomment-95446474 Assume we have data t1(2), t3((2,2), (2,1)) `select a1 from t1 left semi join t3 on t1.a1 = t3.a3 and t1.a1>=t3.b3` will only outputs 2. However, if we

[GitHub] spark pull request: [SPARK-7051] [SQL] Configuration for parquet d...

2015-04-22 Thread chenghao-intel
Github user chenghao-intel commented on a diff in the pull request: https://github.com/apache/spark/pull/5630#discussion_r28937606 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/SQLConf.scala --- @@ -36,6 +36,8 @@ private[spark] object SQLConf { val PARQUET_INT96_AS_T

[GitHub] spark pull request: [SPARK-7068][SQL] Remove PrimitiveType

2015-04-22 Thread liancheng
Github user liancheng commented on the pull request: https://github.com/apache/spark/pull/5646#issuecomment-95442354 LGTM pending Jenkins. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: [SPARK-7070][MLLIB] LDA.setBeta should call se...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5649#issuecomment-95442316 [Test build #30815 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30815/consoleFull) for PR 5649 at commit [`f06048f`](https://githu

[GitHub] spark pull request: [SPARK-7051] [SQL] Configuration for parquet d...

2015-04-22 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/5630#discussion_r28937324 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/SQLConf.scala --- @@ -36,6 +36,8 @@ private[spark] object SQLConf { val PARQUET_INT96_AS_TIMESTAMP

[GitHub] spark pull request: [MLlib] Add support for BooleanType to VectorA...

2015-04-22 Thread mengxr
Github user mengxr commented on the pull request: https://github.com/apache/spark/pull/5648#issuecomment-95442116 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and

[GitHub] spark pull request: [SPARK-7056] Make the Write Ahead Log pluggabl...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5645#issuecomment-95442062 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [SPARK-7056] Make the Write Ahead Log pluggabl...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5645#issuecomment-95442059 [Test build #30814 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30814/consoleFull) for PR 5645 at commit [`837c4f5`](https://gith

[GitHub] spark pull request: [SPARK-7056] Make the Write Ahead Log pluggabl...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5645#issuecomment-95441936 [Test build #30814 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30814/consoleFull) for PR 5645 at commit [`837c4f5`](https://githu

[GitHub] spark pull request: [SPARK-7070][MLLIB] LDA.setBeta should call se...

2015-04-22 Thread mengxr
GitHub user mengxr opened a pull request: https://github.com/apache/spark/pull/5649 [SPARK-7070][MLLIB] LDA.setBeta should call setTopicConcentration. @jkbradley You can merge this pull request into a Git repository by running: $ git pull https://github.com/mengxr/spark SPARK-7

[GitHub] spark pull request: [SPARK-7026][SQL] Fix bugs when there are non ...

2015-04-22 Thread scwf
Github user scwf commented on the pull request: https://github.com/apache/spark/pull/5612#issuecomment-95441755 in this case `select a1 from t1 left semi join t3 on t1.a1 = t3.a3` will not go into hash join, it should go with left semi join since there is no additional condition ---

[GitHub] spark pull request: [SPARK-7026][SQL] Fix bugs when there are non ...

2015-04-22 Thread adrian-wang
Github user adrian-wang commented on the pull request: https://github.com/apache/spark/pull/5612#issuecomment-95440920 This is still not right... table t1 (a1 int) table t3 (a3 int, b3 int) t1: {1,2,3} t3: {(2,1),(2,2),(2,2)} select a1 from t1 left semi join t3

[GitHub] spark pull request: [SPARK-7068][SQL] Remove PrimitiveType

2015-04-22 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/5646#discussion_r28937160 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/parquet/ParquetTypes.scala --- @@ -48,8 +48,10 @@ private[parquet] case class ParquetTypeInfo(

[GitHub] spark pull request: [SPARK-7051] [SQL] Configuration for parquet d...

2015-04-22 Thread chenghao-intel
Github user chenghao-intel commented on a diff in the pull request: https://github.com/apache/spark/pull/5630#discussion_r28937128 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/SQLConf.scala --- @@ -36,6 +36,8 @@ private[spark] object SQLConf { val PARQUET_INT96_AS_T

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread scwf
Github user scwf commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95439709 Yes, i understand that when no additional condition it go the old way. i mean when there are additional conditions, your broadcasting hashmap may be much bigger since you al

[GitHub] spark pull request: [SPARK-7068][SQL] Remove PrimitiveType

2015-04-22 Thread liancheng
Github user liancheng commented on a diff in the pull request: https://github.com/apache/spark/pull/5646#discussion_r28937072 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/parquet/ParquetTypes.scala --- @@ -48,8 +48,10 @@ private[parquet] case class ParquetTypeInfo(

[GitHub] spark pull request: [SPARK-6752][Streaming] Allow StreamingContext...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5428#issuecomment-95439240 [Test build #696 has started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/696/consoleFull) for PR 5428 at commit [`94db63c`](https://github

[GitHub] spark pull request: [MLlib] Add support for BooleanType to VectorA...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5648#issuecomment-95438748 [Test build #30811 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30811/consoleFull) for PR 5648 at commit [`1bf3d40`](https://githu

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95438751 [Test build #30812 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30812/consoleFull) for PR 5643 at commit [`41c20d5`](https://githu

[GitHub] spark pull request: [SPARK-7026][SQL] Fix bugs when there are non ...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5612#issuecomment-95438125 [Test build #30813 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30813/consoleFull) for PR 5612 at commit [`3d7f589`](https://githu

[GitHub] spark pull request: [SPARK-7068][SQL] Remove PrimitiveType

2015-04-22 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/5646#discussion_r28937000 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/types/dataTypes.scala --- @@ -190,13 +205,11 @@ object DataType { equalsIgnoreNullability

[GitHub] spark pull request: [SPARK-7068][SQL] Remove PrimitiveType

2015-04-22 Thread rxin
Github user rxin commented on a diff in the pull request: https://github.com/apache/spark/pull/5646#discussion_r28936992 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/types/dataTypes.scala --- @@ -41,6 +41,21 @@ import org.apache.spark.util.Utils object DataType {

[GitHub] spark pull request: [SPARK-6612] [MLLib] [PySpark] Python KMeans p...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5647#issuecomment-95435307 [Test build #30809 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30809/consoleFull) for PR 5647 at commit [`9903837`](https://githu

[GitHub] spark pull request: [SPARK-7068][SQL] Remove PrimitiveType

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5646#issuecomment-95435210 [Test build #30810 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30810/consoleFull) for PR 5646 at commit [`01b673d`](https://githu

[GitHub] spark pull request: [MLlib] Add support for BooleanType to VectorA...

2015-04-22 Thread rxin
GitHub user rxin opened a pull request: https://github.com/apache/spark/pull/5648 [MLlib] Add support for BooleanType to VectorAssembler. You can merge this pull request into a Git repository by running: $ git pull https://github.com/rxin/spark vectorAssembler-boolean Alterna

[GitHub] spark pull request: [SPARK-6612] [MLLib] [PySpark] Python KMeans p...

2015-04-22 Thread FlytxtRnD
GitHub user FlytxtRnD opened a pull request: https://github.com/apache/spark/pull/5647 [SPARK-6612] [MLLib] [PySpark] Python KMeans parity The following items are added to Python kmeans: kmeans - setEpsilon, setInitializationSteps KMeansModel - computeCost, k You can me

[GitHub] spark pull request: [SPARK-6612] [MLLib] [PySpark] Python KMeans p...

2015-04-22 Thread FlytxtRnD
Github user FlytxtRnD closed the pull request at: https://github.com/apache/spark/pull/5391 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is e

[GitHub] spark pull request: [SPARK-7068][SQL] Remove PrimitiveType

2015-04-22 Thread rxin
GitHub user rxin opened a pull request: https://github.com/apache/spark/pull/5646 [SPARK-7068][SQL] Remove PrimitiveType You can merge this pull request into a Git repository by running: $ git pull https://github.com/rxin/spark remove-primitive-type Alternatively you can revi

[GitHub] spark pull request: [SPARK-6067][SQL][WIP]delete the file with the...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/4980#issuecomment-95433297 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [SPARK-6067][SQL][WIP]delete the file with the...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4980#issuecomment-95433275 [Test build #30804 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30804/consoleFull) for PR 4980 at commit [`4d5f65c`](https://gith

[GitHub] spark pull request: [SPARK-7051] [SQL] Configuration for parquet d...

2015-04-22 Thread yhuai
Github user yhuai commented on a diff in the pull request: https://github.com/apache/spark/pull/5630#discussion_r28936699 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/SQLConf.scala --- @@ -36,6 +36,8 @@ private[spark] object SQLConf { val PARQUET_INT96_AS_TIMESTAMP

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95433022 [Test build #30808 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30808/consoleFull) for PR 5643 at commit [`19201e0`](https://githu

[GitHub] spark pull request: [HOTFIX][SQL] Ignore flaky CachedTableSuite te...

2015-04-22 Thread tdas
Github user tdas commented on the pull request: https://github.com/apache/spark/pull/5639#issuecomment-95432802 @marmbrus Mind if I merge this? This is blocking a number of PRs. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread adrian-wang
Github user adrian-wang commented on a diff in the pull request: https://github.com/apache/spark/pull/5643#discussion_r28936597 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/joins/BroadcastLeftSemiJoinHash.scala --- @@ -32,15 +32,22 @@ case class BroadcastLeftSe

[GitHub] spark pull request: [SPARK-7066][MLlib] VectorAssembler should use...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5642#issuecomment-95432554 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [SPARK-7066][MLlib] VectorAssembler should use...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5642#issuecomment-95432551 [Test build #30799 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30799/consoleFull) for PR 5642 at commit [`e23af5b`](https://gith

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95430732 [Test build #30800 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30800/consoleFull) for PR 5643 at commit [`a99f492`](https://gith

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95430736 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [SPARK-7046] Remove InputMetrics from BlockRes...

2015-04-22 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/5627 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-7066][MLlib] VectorAssembler should use...

2015-04-22 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/5642 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread scwf
Github user scwf commented on a diff in the pull request: https://github.com/apache/spark/pull/5643#discussion_r28936107 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/joins/BroadcastLeftSemiJoinHash.scala --- @@ -32,15 +32,22 @@ case class BroadcastLeftSemiJoinH

[GitHub] spark pull request: [MLlib] UnaryTransformer nullability should no...

2015-04-22 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/5644 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is enab

[GitHub] spark pull request: [SPARK-7055][SQL]Use correct ClassLoader for J...

2015-04-22 Thread tmyklebu
Github user tmyklebu commented on the pull request: https://github.com/apache/spark/pull/5633#issuecomment-95428960 Do the MySQL and Postgres integration tests both pass with this change? It's not a security issue per se. The trouble is that JDBC's security stuff "fails close

[GitHub] spark pull request: [MLlib] UnaryTransformer nullability should no...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5644#issuecomment-95427275 [Test build #30807 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30807/consoleFull) for PR 5644 at commit [`a727e5b`](https://githu

[GitHub] spark pull request: [MLlib] UnaryTransformer nullability should no...

2015-04-22 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/5644#issuecomment-95426549 Jenkins, retest this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have th

[GitHub] spark pull request: [SPARK-3468][WebUI] Timeline-View feature

2015-04-22 Thread pwendell
Github user pwendell commented on the pull request: https://github.com/apache/spark/pull/2342#issuecomment-95423610 If you have time to do an iteration in the next day or two, it would be helpful. We can continue to go back and fourth ideally over the next week to have it ready

[GitHub] spark pull request: SPARK-7063 when lz4 compression is used, it ca...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5641#issuecomment-95423349 [Test build #30806 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30806/consoleFull) for PR 5641 at commit [`c33827f`](https://githu

[GitHub] spark pull request: [MLlib] UnaryTransformer nullability should no...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5644#issuecomment-95423243 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [MLlib] UnaryTransformer nullability should no...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5644#issuecomment-95423236 [Test build #30801 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30801/consoleFull) for PR 5644 at commit [`a727e5b`](https://gith

[GitHub] spark pull request: [SPARK-7026][SQL] Fix bugs when there are non ...

2015-04-22 Thread adrian-wang
Github user adrian-wang commented on the pull request: https://github.com/apache/spark/pull/5612#issuecomment-95423015 table t1 (a1 int) table t2 (a2 int) t1: {1,2,3} t2: {2,2,2} select a1 from t1 left semi join t2 on t1.a1 = t2.a2; should output {2}

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95422651 [Test build #30805 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30805/consoleFull) for PR 5643 at commit [`cf435db`](https://githu

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread adrian-wang
Github user adrian-wang commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95422614 @scwf Of course we can go the old way when there's no additional conditions. --- If your project is set up for it, you can reply to this email and have your reply app

[GitHub] spark pull request: SPARK-7063 when lz4 compression is used, it ca...

2015-04-22 Thread pwendell
Github user pwendell commented on the pull request: https://github.com/apache/spark/pull/5641#issuecomment-95422599 Jenkins, test this please. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: [SPARK-7055][SQL]Use correct ClassLoader for J...

2015-04-22 Thread pwendell
Github user pwendell commented on the pull request: https://github.com/apache/spark/pull/5633#issuecomment-95422532 This is how we usually do class loading, but IIRC, there is an issue with certain JDBC drivers where they need to be loaded from the primordial classloader or else there

[GitHub] spark pull request: [SPARK-6418] Add simple per-stage visualizatio...

2015-04-22 Thread punya
Github user punya commented on the pull request: https://github.com/apache/spark/pull/5547#issuecomment-95420852 Also, if you're looking for a reasonable path to panning/zooming, you might want to take a look at http://plottablejs.org/. --- If your project is set up for it, you can r

[GitHub] spark pull request: [SPARK-7026] [SQL] fix left semi join with equ...

2015-04-22 Thread scwf
Github user scwf commented on the pull request: https://github.com/apache/spark/pull/5643#issuecomment-95416494 I am not sure it is suitable to broadcast a hashmap contains key and related rows, this maybe much bigger than the old hashset, may cause OOM issue. --- If your project is

[GitHub] spark pull request: [SPARK-7026][SQL] Fix bugs when there are non ...

2015-04-22 Thread scwf
Github user scwf commented on the pull request: https://github.com/apache/spark/pull/5612#issuecomment-95415298 >>> If you do shuffleHashJoin and then filter the result, you may get wrong results for the join if the right table contains rows with same key. can you give a e

[GitHub] spark pull request: [SPARK-6067][SQL][WIP]delete the file with the...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/4980#issuecomment-95413864 [Test build #30804 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30804/consoleFull) for PR 4980 at commit [`4d5f65c`](https://githu

[GitHub] spark pull request: [SPARK-6067][SQL][WIP]delete the file with the...

2015-04-22 Thread baishuo
Github user baishuo commented on the pull request: https://github.com/apache/spark/pull/4980#issuecomment-95413614 thank you @srowen --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feat

[GitHub] spark pull request: [SPARK-7009] repackaging spark assembly jar wi...

2015-04-22 Thread zhzhan
Github user zhzhan commented on the pull request: https://github.com/apache/spark/pull/5637#issuecomment-95413475 Retest it please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request: [SPARK-7056] Make the Write Ahead Log pluggabl...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5645#issuecomment-95413174 [Test build #30803 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30803/consoleFull) for PR 5645 at commit [`09bc6fe`](https://gith

[GitHub] spark pull request: [SPARK-7056] Make the Write Ahead Log pluggabl...

2015-04-22 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/5645#issuecomment-95413177 Test FAILed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30

[GitHub] spark pull request: [SPARK-7056] Make the Write Ahead Log pluggabl...

2015-04-22 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/5645#issuecomment-95413168 [Test build #30803 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30803/consoleFull) for PR 5645 at commit [`09bc6fe`](https://githu

  1   2   3   4   5   6   >