Github user rxin commented on the pull request:
https://github.com/apache/spark/pull/5650#issuecomment-95465861
Going to re-open a PR.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this fe
Github user rxin closed the pull request at:
https://github.com/apache/spark/pull/5650
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enable
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95465033
[Test build #30812 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30812/consoleFull)
for PR 5643 at commit
[`41c20d5`](https://gith
Github user asfgit closed the pull request at:
https://github.com/apache/spark/pull/5648
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enab
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95465053
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user asfgit closed the pull request at:
https://github.com/apache/spark/pull/5646
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enab
Github user asfgit closed the pull request at:
https://github.com/apache/spark/pull/5640
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enab
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5612#issuecomment-95464897
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5612#issuecomment-95464884
[Test build #30813 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30813/consoleFull)
for PR 5612 at commit
[`3d7f589`](https://gith
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5648#issuecomment-95464723
[Test build #30811 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30811/consoleFull)
for PR 5648 at commit
[`1bf3d40`](https://gith
Github user adrian-wang commented on a diff in the pull request:
https://github.com/apache/spark/pull/5643#discussion_r28940019
--- Diff: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala
---
@@ -298,6 +298,13 @@ class SQLQuerySuite extends QueryTest with
BeforeAndA
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5646#issuecomment-95464699
[Test build #30810 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30810/consoleFull)
for PR 5646 at commit
[`01b673d`](https://gith
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5648#issuecomment-95464728
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5650#issuecomment-95464700
[Test build #30816 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30816/consoleFull)
for PR 5650 at commit
[`97d2e1d`](https://githu
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5646#issuecomment-95464709
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user rxin commented on the pull request:
https://github.com/apache/spark/pull/5650#issuecomment-95464577
This also contains https://github.com/apache/spark/pull/5646
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well.
GitHub user rxin opened a pull request:
https://github.com/apache/spark/pull/5650
[SPARK-7069][SQL] Rename NativeType -> AtomicType
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/rxin/spark atomic-type
Alternatively you can rev
Github user scwf closed the pull request at:
https://github.com/apache/spark/pull/5612
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enable
Github user scwf commented on a diff in the pull request:
https://github.com/apache/spark/pull/5643#discussion_r28939761
--- Diff: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala
---
@@ -298,6 +298,13 @@ class SQLQuerySuite extends QueryTest with
BeforeAndAfterAll
Github user scwf commented on the pull request:
https://github.com/apache/spark/pull/5612#issuecomment-95463452
i am closing this.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5641#issuecomment-95462973
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95462958
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5641#issuecomment-95462960
[Test build #30806 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30806/consoleFull)
for PR 5641 at commit
[`c33827f`](https://gith
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95462951
[Test build #30808 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30808/consoleFull)
for PR 5643 at commit
[`19201e0`](https://gith
Github user harishreedharan commented on the pull request:
https://github.com/apache/spark/pull/4688#issuecomment-95462834
I have done testing by making sure the app runs longer than the expiration
time, but not killing the executors. I will test that scenario too and report
back. In
Github user scwf commented on the pull request:
https://github.com/apache/spark/pull/5612#issuecomment-95459557
Get it. interesting case!
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this
Github user zsxwing commented on the pull request:
https://github.com/apache/spark/pull/5473#issuecomment-95457175
@tdas, I have a question about the failure case: If a Spark job throws an
exception, it will propagate to `ssc.awaitTermination()`. In such case, I
suppose the user will
Github user adrian-wang commented on the pull request:
https://github.com/apache/spark/pull/5612#issuecomment-95456832
select a1 from t1 left semi join t3 on t1.a1 = t3.a3 and t1.a1>t3.b3 - 100;
---
If your project is set up for it, you can reply to this email and have your
reply appe
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5644#issuecomment-95449763
[Test build #30807 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30807/consoleFull)
for PR 5644 at commit
[`a727e5b`](https://gith
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5644#issuecomment-95449803
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user harishreedharan commented on a diff in the pull request:
https://github.com/apache/spark/pull/4688#discussion_r28938263
--- Diff:
yarn/src/main/scala/org/apache/spark/deploy/yarn/AMDelegationTokenRenewer.scala
---
@@ -0,0 +1,203 @@
+/*
+ * Licensed to the Apach
Github user harishreedharan commented on a diff in the pull request:
https://github.com/apache/spark/pull/4688#discussion_r28938209
--- Diff:
core/src/main/scala/org/apache/spark/scheduler/cluster/CoarseGrainedSchedulerBackend.scala
---
@@ -121,8 +125,7 @@ class CoarseGrainedSched
Github user harishreedharan commented on a diff in the pull request:
https://github.com/apache/spark/pull/4688#discussion_r28937949
--- Diff:
yarn/src/main/scala/org/apache/spark/deploy/yarn/AMDelegationTokenRenewer.scala
---
@@ -0,0 +1,203 @@
+/*
+ * Licensed to the Apach
Github user harishreedharan commented on a diff in the pull request:
https://github.com/apache/spark/pull/4688#discussion_r28937965
--- Diff:
yarn/src/main/scala/org/apache/spark/deploy/yarn/AMDelegationTokenRenewer.scala
---
@@ -0,0 +1,203 @@
+/*
+ * Licensed to the Apach
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95446869
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95446846
[Test build #30805 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30805/consoleFull)
for PR 5643 at commit
[`cf435db`](https://gith
Github user vinodkc commented on the pull request:
https://github.com/apache/spark/pull/5633#issuecomment-95446962
@marmbrus,
Have you tried MySQL and Postgres integration tests for this PR ?
https://github.com/apache/spark/pull/5543
---
If your project is set up for it, you
Github user chenghao-intel commented on the pull request:
https://github.com/apache/spark/pull/5612#issuecomment-95446474
Assume we have data t1(2), t3((2,2), (2,1))
`select a1 from t1 left semi join t3 on t1.a1 = t3.a3 and t1.a1>=t3.b3`
will only outputs 2.
However, if we
Github user chenghao-intel commented on a diff in the pull request:
https://github.com/apache/spark/pull/5630#discussion_r28937606
--- Diff: sql/core/src/main/scala/org/apache/spark/sql/SQLConf.scala ---
@@ -36,6 +36,8 @@ private[spark] object SQLConf {
val PARQUET_INT96_AS_T
Github user liancheng commented on the pull request:
https://github.com/apache/spark/pull/5646#issuecomment-95442354
LGTM pending Jenkins.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5649#issuecomment-95442316
[Test build #30815 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30815/consoleFull)
for PR 5649 at commit
[`f06048f`](https://githu
Github user yhuai commented on a diff in the pull request:
https://github.com/apache/spark/pull/5630#discussion_r28937324
--- Diff: sql/core/src/main/scala/org/apache/spark/sql/SQLConf.scala ---
@@ -36,6 +36,8 @@ private[spark] object SQLConf {
val PARQUET_INT96_AS_TIMESTAMP
Github user mengxr commented on the pull request:
https://github.com/apache/spark/pull/5648#issuecomment-95442116
LGTM
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5645#issuecomment-95442062
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5645#issuecomment-95442059
[Test build #30814 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30814/consoleFull)
for PR 5645 at commit
[`837c4f5`](https://gith
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5645#issuecomment-95441936
[Test build #30814 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30814/consoleFull)
for PR 5645 at commit
[`837c4f5`](https://githu
GitHub user mengxr opened a pull request:
https://github.com/apache/spark/pull/5649
[SPARK-7070][MLLIB] LDA.setBeta should call setTopicConcentration.
@jkbradley
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/mengxr/spark SPARK-7
Github user scwf commented on the pull request:
https://github.com/apache/spark/pull/5612#issuecomment-95441755
in this case `select a1 from t1 left semi join t3 on t1.a1 = t3.a3` will
not go into hash join, it should go with left semi join since there is no
additional condition
---
Github user adrian-wang commented on the pull request:
https://github.com/apache/spark/pull/5612#issuecomment-95440920
This is still not right...
table t1 (a1 int)
table t3 (a3 int, b3 int)
t1: {1,2,3}
t3: {(2,1),(2,2),(2,2)}
select a1 from t1 left semi join t3
Github user liancheng commented on a diff in the pull request:
https://github.com/apache/spark/pull/5646#discussion_r28937160
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/parquet/ParquetTypes.scala ---
@@ -48,8 +48,10 @@ private[parquet] case class ParquetTypeInfo(
Github user chenghao-intel commented on a diff in the pull request:
https://github.com/apache/spark/pull/5630#discussion_r28937128
--- Diff: sql/core/src/main/scala/org/apache/spark/sql/SQLConf.scala ---
@@ -36,6 +36,8 @@ private[spark] object SQLConf {
val PARQUET_INT96_AS_T
Github user scwf commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95439709
Yes, i understand that when no additional condition it go the old way. i
mean when there are additional conditions, your broadcasting hashmap may be
much bigger since you al
Github user liancheng commented on a diff in the pull request:
https://github.com/apache/spark/pull/5646#discussion_r28937072
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/parquet/ParquetTypes.scala ---
@@ -48,8 +48,10 @@ private[parquet] case class ParquetTypeInfo(
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5428#issuecomment-95439240
[Test build #696 has
started](https://amplab.cs.berkeley.edu/jenkins/job/NewSparkPullRequestBuilder/696/consoleFull)
for PR 5428 at commit
[`94db63c`](https://github
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5648#issuecomment-95438748
[Test build #30811 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30811/consoleFull)
for PR 5648 at commit
[`1bf3d40`](https://githu
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95438751
[Test build #30812 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30812/consoleFull)
for PR 5643 at commit
[`41c20d5`](https://githu
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5612#issuecomment-95438125
[Test build #30813 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30813/consoleFull)
for PR 5612 at commit
[`3d7f589`](https://githu
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/5646#discussion_r28937000
--- Diff:
sql/catalyst/src/main/scala/org/apache/spark/sql/types/dataTypes.scala ---
@@ -190,13 +205,11 @@ object DataType {
equalsIgnoreNullability
Github user rxin commented on a diff in the pull request:
https://github.com/apache/spark/pull/5646#discussion_r28936992
--- Diff:
sql/catalyst/src/main/scala/org/apache/spark/sql/types/dataTypes.scala ---
@@ -41,6 +41,21 @@ import org.apache.spark.util.Utils
object DataType {
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5647#issuecomment-95435307
[Test build #30809 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30809/consoleFull)
for PR 5647 at commit
[`9903837`](https://githu
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5646#issuecomment-95435210
[Test build #30810 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30810/consoleFull)
for PR 5646 at commit
[`01b673d`](https://githu
GitHub user rxin opened a pull request:
https://github.com/apache/spark/pull/5648
[MLlib] Add support for BooleanType to VectorAssembler.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/rxin/spark vectorAssembler-boolean
Alterna
GitHub user FlytxtRnD opened a pull request:
https://github.com/apache/spark/pull/5647
[SPARK-6612] [MLLib] [PySpark] Python KMeans parity
The following items are added to Python kmeans:
kmeans - setEpsilon, setInitializationSteps
KMeansModel - computeCost, k
You can me
Github user FlytxtRnD closed the pull request at:
https://github.com/apache/spark/pull/5391
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is e
GitHub user rxin opened a pull request:
https://github.com/apache/spark/pull/5646
[SPARK-7068][SQL] Remove PrimitiveType
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/rxin/spark remove-primitive-type
Alternatively you can revi
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/4980#issuecomment-95433297
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/4980#issuecomment-95433275
[Test build #30804 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30804/consoleFull)
for PR 4980 at commit
[`4d5f65c`](https://gith
Github user yhuai commented on a diff in the pull request:
https://github.com/apache/spark/pull/5630#discussion_r28936699
--- Diff: sql/core/src/main/scala/org/apache/spark/sql/SQLConf.scala ---
@@ -36,6 +36,8 @@ private[spark] object SQLConf {
val PARQUET_INT96_AS_TIMESTAMP
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95433022
[Test build #30808 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30808/consoleFull)
for PR 5643 at commit
[`19201e0`](https://githu
Github user tdas commented on the pull request:
https://github.com/apache/spark/pull/5639#issuecomment-95432802
@marmbrus Mind if I merge this? This is blocking a number of PRs.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as
Github user adrian-wang commented on a diff in the pull request:
https://github.com/apache/spark/pull/5643#discussion_r28936597
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/joins/BroadcastLeftSemiJoinHash.scala
---
@@ -32,15 +32,22 @@ case class BroadcastLeftSe
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5642#issuecomment-95432554
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5642#issuecomment-95432551
[Test build #30799 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30799/consoleFull)
for PR 5642 at commit
[`e23af5b`](https://gith
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95430732
[Test build #30800 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30800/consoleFull)
for PR 5643 at commit
[`a99f492`](https://gith
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95430736
Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user asfgit closed the pull request at:
https://github.com/apache/spark/pull/5627
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enab
Github user asfgit closed the pull request at:
https://github.com/apache/spark/pull/5642
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enab
Github user scwf commented on a diff in the pull request:
https://github.com/apache/spark/pull/5643#discussion_r28936107
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/joins/BroadcastLeftSemiJoinHash.scala
---
@@ -32,15 +32,22 @@ case class BroadcastLeftSemiJoinH
Github user asfgit closed the pull request at:
https://github.com/apache/spark/pull/5644
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enab
Github user tmyklebu commented on the pull request:
https://github.com/apache/spark/pull/5633#issuecomment-95428960
Do the MySQL and Postgres integration tests both pass with this change?
It's not a security issue per se. The trouble is that JDBC's security
stuff "fails close
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5644#issuecomment-95427275
[Test build #30807 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30807/consoleFull)
for PR 5644 at commit
[`a727e5b`](https://githu
Github user rxin commented on the pull request:
https://github.com/apache/spark/pull/5644#issuecomment-95426549
Jenkins, retest this please.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have th
Github user pwendell commented on the pull request:
https://github.com/apache/spark/pull/2342#issuecomment-95423610
If you have time to do an iteration in the next day or two, it would be
helpful. We can continue to go back and fourth ideally over the next week
to have it ready
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5641#issuecomment-95423349
[Test build #30806 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30806/consoleFull)
for PR 5641 at commit
[`c33827f`](https://githu
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5644#issuecomment-95423243
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5644#issuecomment-95423236
[Test build #30801 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30801/consoleFull)
for PR 5644 at commit
[`a727e5b`](https://gith
Github user adrian-wang commented on the pull request:
https://github.com/apache/spark/pull/5612#issuecomment-95423015
table t1 (a1 int)
table t2 (a2 int)
t1: {1,2,3}
t2: {2,2,2}
select a1 from t1 left semi join t2 on t1.a1 = t2.a2;
should output {2}
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95422651
[Test build #30805 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30805/consoleFull)
for PR 5643 at commit
[`cf435db`](https://githu
Github user adrian-wang commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95422614
@scwf Of course we can go the old way when there's no additional conditions.
---
If your project is set up for it, you can reply to this email and have your
reply app
Github user pwendell commented on the pull request:
https://github.com/apache/spark/pull/5641#issuecomment-95422599
Jenkins, test this please.
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have
Github user pwendell commented on the pull request:
https://github.com/apache/spark/pull/5633#issuecomment-95422532
This is how we usually do class loading, but IIRC, there is an issue with
certain JDBC drivers where they need to be loaded from the primordial
classloader or else there
Github user punya commented on the pull request:
https://github.com/apache/spark/pull/5547#issuecomment-95420852
Also, if you're looking for a reasonable path to panning/zooming, you might
want to take a look at http://plottablejs.org/.
---
If your project is set up for it, you can r
Github user scwf commented on the pull request:
https://github.com/apache/spark/pull/5643#issuecomment-95416494
I am not sure it is suitable to broadcast a hashmap contains key and
related rows, this maybe much bigger than the old hashset, may cause OOM issue.
---
If your project is
Github user scwf commented on the pull request:
https://github.com/apache/spark/pull/5612#issuecomment-95415298
>>>
If you do shuffleHashJoin and then filter the result, you may get wrong
results for the join if the right table contains rows with same key.
can you give a e
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/4980#issuecomment-95413864
[Test build #30804 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30804/consoleFull)
for PR 4980 at commit
[`4d5f65c`](https://githu
Github user baishuo commented on the pull request:
https://github.com/apache/spark/pull/4980#issuecomment-95413614
thank you @srowen
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feat
Github user zhzhan commented on the pull request:
https://github.com/apache/spark/pull/5637#issuecomment-95413475
Retest it please
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5645#issuecomment-95413174
[Test build #30803 has
finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30803/consoleFull)
for PR 5645 at commit
[`09bc6fe`](https://gith
Github user AmplabJenkins commented on the pull request:
https://github.com/apache/spark/pull/5645#issuecomment-95413177
Test FAILed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/30
Github user SparkQA commented on the pull request:
https://github.com/apache/spark/pull/5645#issuecomment-95413168
[Test build #30803 has
started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/30803/consoleFull)
for PR 5645 at commit
[`09bc6fe`](https://githu
1 - 100 of 551 matches
Mail list logo