[GitHub] spark pull request #17433: [SPARK-20100][SQL] Refactor SessionState initiali...

2017-03-27 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17433#discussion_r108112572 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/internal/SessionState.scala --- @@ -179,88 +132,295 @@ private[sql] class SessionState

[GitHub] spark pull request #17432: [SPARK-20086][SQL] CollapseWindow should not coll...

2017-03-26 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17432#discussion_r108074254 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala --- @@ -597,12 +597,14 @@ object CollapseRepartition

[GitHub] spark issue #17432: [SPARK-20086][SQL] CollapseWindow should not collapse de...

2017-03-26 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17432 The project will get eliminated by the optimizer. From the JIRA ticket: ```scala import org.apache.spark.sql.expressions.Window val win_spec_max = Window.partitionBy("x"

[GitHub] spark issue #17432: [SPARK-20086][SQL] CollapseWindow should not collapse de...

2017-03-26 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17432 Thanks for the review. Merging this to master/2.1. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #17164: [SPARK-16844][SQL] Support codegen for sort-based aggrea...

2017-03-26 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17164 @maropu I do think this is useful. However we really need to refactor the planner, if we want to get the most value from this. --- If your project is set up for it, you can reply to this email

[GitHub] spark issue #17432: [SPARK-20086][SQL] CollapseWindow should not collapse de...

2017-03-26 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17432 cc @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #17433: [SPARK-20100][SQL] Refactor SessionState initialization

2017-03-26 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17433 cc @kunalkhamar --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #17433: [SPARK-20100][SQL] Refactor SessionState initialization

2017-03-26 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17433 cc @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #17433: [SPARK-20100][SQL] Refactor SessionState initiali...

2017-03-26 Thread hvanhovell
GitHub user hvanhovell opened a pull request: https://github.com/apache/spark/pull/17433 [SPARK-20100][SQL] Refactor SessionState initialization ## What changes were proposed in this pull request? The current SessionState initialization code path is quite complex. A part

[GitHub] spark issue #17432: [SPARK-20086][SQL] CollapseWindow should not collapse de...

2017-03-26 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17432 cc @dongjoon-hyun --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark pull request #17432: [SPARK-20086][SQL] CollapseWindow should not coll...

2017-03-26 Thread hvanhovell
GitHub user hvanhovell opened a pull request: https://github.com/apache/spark/pull/17432 [SPARK-20086][SQL] CollapseWindow should not collapse dependent adjacent windows ## What changes were proposed in this pull request? The `CollapseWindow` is currently to aggressive when

[GitHub] spark issue #17378: [SPARK-20046][SQL] Facilitate loop optimizations in a JI...

2017-03-26 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17378 LGTM - merging to master thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17397: [SPARK-20070][SQL] Redact DataSourceScanExec treeString

2017-03-24 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17397 @ueshin here you go: https://github.com/apache/spark/pull/17420 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark pull request #17420: [SPARK-20070][SQL] Fix 2.10 build

2017-03-24 Thread hvanhovell
GitHub user hvanhovell opened a pull request: https://github.com/apache/spark/pull/17420 [SPARK-20070][SQL] Fix 2.10 build ## What changes were proposed in this pull request? Commit https://github.com/apache/spark/commit/91fa80fe8a2480d64c430bd10f97b3d44c007bcc broke the build

[GitHub] spark issue #17397: [SPARK-20070][SQL] Redact DataSourceScanExec treeString

2017-03-23 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17397 cc @liufengdb --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #17397: [SPARK-20070][SQL] Redact DataSourceScanExec tree...

2017-03-23 Thread hvanhovell
GitHub user hvanhovell opened a pull request: https://github.com/apache/spark/pull/17397 [SPARK-20070][SQL] Redact DataSourceScanExec treeString ## What changes were proposed in this pull request? The explain output of `DataSourceScanExec` can contain sensitive information

[GitHub] spark issue #17376: [SPARK-19994] [HOTFIX] [BRANCH-2.0] Change InnerLike to ...

2017-03-21 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17376 Merging to branch-2.0. Thanks! Can you close? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #17376: [SPARK-19994] [HOTFIX] [BRANCH-2.0] Change InnerLike to ...

2017-03-21 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17376 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #17350: [SPARK-20017][SQL] change the nullability of function 'S...

2017-03-21 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17350 @wzhfy why wouldn't we change the nullability? This is obviously a bug in the current master: https://github.com/apache/spark/blob/master/sql/catalyst/src/main/scala/org/apache/spark/sql

[GitHub] spark issue #17292: [SPARK-19946][testing]DebugFilesystem.assertNoOpenStream...

2017-03-16 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17292 LGTM - merging to master. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17292: [SPARK-19946][testing]DebugFilesystem.assertNoOpenStream...

2017-03-16 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17292 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark pull request #17244: [SPARK-19889][SQL] Make TaskContext callbacks thr...

2017-03-16 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17244#discussion_r106363391 --- Diff: core/src/main/scala/org/apache/spark/TaskContextImpl.scala --- @@ -52,62 +63,79 @@ private[spark] class TaskContextImpl( @volatile

[GitHub] spark issue #16909: [SPARK-13450] Introduce ExternalAppendOnlyUnsafeRowArray...

2017-03-15 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/16909 LGTM - merging to master. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark issue #17301: [SPARK-19944][SQL] Move SQLConf from sql/core to sql/cat...

2017-03-15 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17301 Merging this one. @rxin could you close? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #16909: [SPARK-13450] Introduce ExternalAppendOnlyUnsafeRowArray...

2017-03-15 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/16909 @tejasapatil there is a small merge conflict. Can you update this? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project

[GitHub] spark issue #17244: [SPARK-19889][SQL] Make TaskContext callbacks thread saf...

2017-03-15 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17244 Thanks for the reviews! Merging to master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #17289: [SPARK-19948] Document that saveAsTable uses catalog as ...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17289 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #17292: DebugFilesystem.assertNoOpenStreams should report the op...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17292 @bogdanrdc can you add the ticket to the title? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #17294: [SPARK-18966][SQL] NOT IN subquery with correlated expre...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17294 @nsyca that makes sense. Lets not backport for now. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #17294: [SPARK-18966][SQL] NOT IN subquery with correlated expre...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17294 @nsyca should we backport this to 2.1/2.0? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #17294: [SPARK-18966][SQL] NOT IN subquery with correlated expre...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17294 Merging to master. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #17294: [SPARK-18966][SQL] NOT IN subquery with correlated expre...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17294 LGTM - pending jenkins. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #17278: [SPARK-19933][SQL] Do not change output of a subquery

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17278 Merging to master. Thanks for the review! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #17264: [SPARK-19923][SQL] Remove unnecessary type conversions p...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17264 LGTM merging to master. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark issue #17289: [SPARK-19948] Document that saveAsTable uses catalog as ...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17289 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #17288: [SPARK-18874][SQL] Fix 2.10 build after moving the subqu...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17288 Ok merging this to master. Thanks for the approval. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #17289: [SPARK-19948] Document that saveAsTable uses catalog as ...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17289 cc @cloud-fan --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #17289: [SPARK-19948] Document that saveAsTable uses catalog as ...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17289 ok to test --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark issue #17245: [SPARK-19850][SQL] Allow the use of aliases in SQL funct...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17245 Merging to master. Thanks! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #17285: [SPARK-19944][SQL] Move SQLConf from sql/core to ...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17285#discussion_r105878506 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/SimpleCatalystConf.scala --- @@ -0,0 +1,48 @@ +/* + * Licensed

[GitHub] spark pull request #17285: [SPARK-19944][SQL] Move SQLConf from sql/core to ...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17285#discussion_r105877761 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -251,7 +247,7 @@ object SQLConf

[GitHub] spark pull request #17285: [SPARK-19944][SQL] Move SQLConf from sql/core to ...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17285#discussion_r105874978 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/SimpleCatalystConf.scala --- @@ -0,0 +1,48 @@ +/* + * Licensed

[GitHub] spark pull request #17288: [SPARK-18874][SQL] Fix 2.10 build after moving th...

2017-03-14 Thread hvanhovell
GitHub user hvanhovell opened a pull request: https://github.com/apache/spark/pull/17288 [SPARK-18874][SQL] Fix 2.10 build after moving the subquery rules to optimization ## What changes were proposed in this pull request? Commit https://github.com/apache/spark/commit

[GitHub] spark pull request #17278: [SPARK-19933][SQL] Do not change output of a subq...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17278#discussion_r105862033 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala --- @@ -140,7 +140,8 @@ abstract class Optimizer

[GitHub] spark issue #16954: [SPARK-18874][SQL] First phase: Deferring the correlated...

2017-03-14 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/16954 LGTM. Merging to master. Thanks for the hard work, and your patience! --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark issue #17244: [SPARK-19889][SQL] Make TaskContext callbacks thread saf...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17244 Ok, had a small discussion offline. It seems weird that we have different calling policies for failure and completion listeners. I am going to change the invocation of completion listeners

[GitHub] spark pull request #17244: [SPARK-19889][SQL] Make TaskContext callbacks thr...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17244#discussion_r105788774 --- Diff: core/src/main/scala/org/apache/spark/TaskContext.scala --- @@ -126,14 +134,14 @@ abstract class TaskContext extends Serializable

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105780981 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/subquery.scala --- @@ -40,19 +43,189 @@ abstract class PlanExpression[T

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105780845 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/subquery.scala --- @@ -40,19 +43,189 @@ abstract class PlanExpression[T

[GitHub] spark pull request #17278: [SPARK-19933][SQL] Do not change output of a subq...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17278#discussion_r105773239 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala --- @@ -140,7 +140,8 @@ abstract class Optimizer

[GitHub] spark pull request #17278: [SPARK-19933][SQL] Do not change output of a subq...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17278#discussion_r105770937 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala --- @@ -140,7 +140,8 @@ abstract class Optimizer

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105766906 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala --- @@ -365,17 +368,73 @@ object TypeCoercion

[GitHub] spark pull request #17278: [SPARK-19933][SQL] Do not change output of a subq...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17278#discussion_r105764212 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala --- @@ -140,7 +140,8 @@ abstract class Optimizer

[GitHub] spark issue #16954: [SPARK-18874][SQL] First phase: Deferring the correlated...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/16954 @dilipbiswal can you update. I left a few minor comments, but it is good to go anyway (pending the update). --- If your project is set up for it, you can reply to this email and have your reply

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105724836 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/subquery.scala --- @@ -40,19 +43,189 @@ abstract class PlanExpression[T

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105723410 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -1204,80 +1250,44 @@ class Analyzer

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105719841 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/subquery.scala --- @@ -40,19 +43,189 @@ abstract class PlanExpression[T

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105718880 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/subquery.scala --- @@ -40,19 +43,189 @@ abstract class PlanExpression[T

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105717565 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/subquery.scala --- @@ -40,19 +43,189 @@ abstract class PlanExpression[T

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105717078 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/predicates.scala --- @@ -123,19 +123,36 @@ case class Not(child

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105716983 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/predicates.scala --- @@ -123,19 +123,36 @@ case class Not(child

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105715265 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala --- @@ -365,17 +368,73 @@ object TypeCoercion

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105714984 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala --- @@ -365,17 +385,66 @@ object TypeCoercion

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105716052 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala --- @@ -365,17 +368,73 @@ object TypeCoercion

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105714546 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala --- @@ -365,17 +368,73 @@ object TypeCoercion

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105712270 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala --- @@ -109,6 +109,26 @@ object TypeCoercion

[GitHub] spark pull request #16954: [SPARK-18874][SQL] First phase: Deferring the cor...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/16954#discussion_r105711886 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/TypeCoercion.scala --- @@ -109,6 +109,26 @@ object TypeCoercion

[GitHub] spark pull request #17241: [SPARK-19877][SQL] Restrict the nested level of a...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17241#discussion_r105709959 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -555,6 +555,17 @@ object SQLConf { .booleanConf

[GitHub] spark pull request #17241: [SPARK-19877][SQL] Restrict the nested level of a...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17241#discussion_r105709767 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala --- @@ -555,6 +555,17 @@ object SQLConf { .booleanConf

[GitHub] spark pull request #17241: [SPARK-19877][SQL] Restrict the nested level of a...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17241#discussion_r105709364 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -595,6 +594,11 @@ class Analyzer( case

[GitHub] spark pull request #17241: [SPARK-19877][SQL] Restrict the nested level of a...

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17241#discussion_r105709239 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala --- @@ -595,6 +594,11 @@ class Analyzer( case

[GitHub] spark issue #17278: [SPARK-19933][SQL] Do not change output of a subquery

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17278 cc @wzhfy --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so

[GitHub] spark pull request #17278: [SPARK-19933][SQL] Do not change output of a subq...

2017-03-13 Thread hvanhovell
GitHub user hvanhovell opened a pull request: https://github.com/apache/spark/pull/17278 [SPARK-19933][SQL] Do not change output of a subquery ## What changes were proposed in this pull request? The `RemoveRedundantAlias` rule can change the output attributes (the expression

[GitHub] spark issue #17278: [SPARK-19933][SQL] Do not change output of a subquery

2017-03-13 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17278 cc @cloud-fan @ueshin --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #17244: [SPARK-19889][SQL] Make TaskContext callbacks thr...

2017-03-12 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17244#discussion_r105559913 --- Diff: core/src/main/scala/org/apache/spark/TaskContextImpl.scala --- @@ -57,57 +68,75 @@ private[spark] class TaskContextImpl( // Whether

[GitHub] spark pull request #17244: [SPARK-19889][SQL] Make TaskContext callbacks thr...

2017-03-12 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17244#discussion_r105559906 --- Diff: core/src/main/scala/org/apache/spark/TaskContextImpl.scala --- @@ -57,57 +68,75 @@ private[spark] class TaskContextImpl( // Whether

[GitHub] spark pull request #17244: [SPARK-19889][SQL] Make TaskContext callbacks thr...

2017-03-12 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17244#discussion_r105559898 --- Diff: core/src/main/scala/org/apache/spark/TaskContextImpl.scala --- @@ -57,57 +68,75 @@ private[spark] class TaskContextImpl( // Whether

[GitHub] spark pull request #17244: [SPARK-19889][SQL] Make TaskContext callbacks thr...

2017-03-12 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17244#discussion_r105559900 --- Diff: core/src/main/scala/org/apache/spark/TaskContextImpl.scala --- @@ -57,57 +68,75 @@ private[spark] class TaskContextImpl( // Whether

[GitHub] spark issue #17264: [SPARK-19923][SQL] Remove unnecessary type conversions p...

2017-03-12 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17264 @maropu this looks like a nice improvement. It is an idea to check all uses of `wrap`, and see if we can replace them by `wrapperFor`? --- If your project is set up for it, you can reply

[GitHub] spark pull request #17244: [SPARK-19889][SQL] Make TaskContext callbacks thr...

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17244#discussion_r105491214 --- Diff: core/src/main/scala/org/apache/spark/TaskContextImpl.scala --- @@ -57,57 +68,75 @@ private[spark] class TaskContextImpl( // Whether

[GitHub] spark issue #17241: [SPARK-19877][SQL] Restrict the nested level of a view

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17241 Yeah you have a point there. cc @rxin wdyt? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark issue #17122: [SPARK-19786][SQL] Facilitate loop optimizations in a JI...

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17122 LGTM merging to master --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled

[GitHub] spark pull request #17245: [SPARK-19850][SQL] Allow the use of aliases in SQ...

2017-03-10 Thread hvanhovell
GitHub user hvanhovell opened a pull request: https://github.com/apache/spark/pull/17245 [SPARK-19850][SQL] Allow the use of aliases in SQL function calls ## What changes were proposed in this pull request? We currently cannot use aliases in SQL function calls

[GitHub] spark issue #17244: [SPARK-19889][SQL] Make TaskContext callbacks thread saf...

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17244 cc @rxin @sameeragarwal @zsxwing --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature

[GitHub] spark pull request #17244: [SPARK-19889][SQL] Make TaskContext callbacks thr...

2017-03-10 Thread hvanhovell
GitHub user hvanhovell opened a pull request: https://github.com/apache/spark/pull/17244 [SPARK-19889][SQL] Make TaskContext callbacks thread safe ## What changes were proposed in this pull request? It is sometimes useful to use multiple threads in a task to parallelize tasks

[GitHub] spark pull request #17236: [SPARK-19893][SQL] Cannot run intersect/except/di...

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17236#discussion_r105383505 --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DataFrameSuite.scala --- @@ -1703,4 +1703,17 @@ class DataFrameSuite extends QueryTest

[GitHub] spark issue #17241: [SPARK-19877][SQL] Restrict the nested level of a view

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17241 @jiangxb1987 I am not entirely sure about the usefulness of this (I thought it might be good to enforce some restriction when we wrote the design doc). I don't think it matters much what we do

[GitHub] spark pull request #17240: [SPARK-17080][SQL][followup] 1.Postpone column pr...

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17240#discussion_r105371649 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/CostBasedJoinReorder.scala --- @@ -122,46 +123,59 @@ case class

[GitHub] spark issue #17236: [SPARK-19893][SQL] Cannot run intersect/except/distinct ...

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17236 LGTM pending jenkins --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes

[GitHub] spark pull request #17240: [SPARK-17080][SQL][followup] simplify algorithm

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17240#discussion_r105368556 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/CostBasedJoinReorder.scala --- @@ -122,46 +123,59 @@ case class

[GitHub] spark pull request #17240: [SPARK-17080][SQL][followup] simplify algorithm

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17240#discussion_r105368297 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/CostBasedJoinReorder.scala --- @@ -87,8 +88,8 @@ case class

[GitHub] spark pull request #17240: [SPARK-17080][SQL][followup] simplify algorithm

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17240#discussion_r105368212 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/CostBasedJoinReorder.scala --- @@ -75,8 +75,9 @@ case class

[GitHub] spark pull request #17240: [SPARK-17080][SQL][followup] simplify algorithm

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17240#discussion_r105368096 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/CostBasedJoinReorder.scala --- @@ -36,27 +36,27 @@ case class

[GitHub] spark issue #17236: [SPARK-19893][SQL] Cannot run intersect/except with map ...

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17236 @cloud-fan can we also check `Distinct`? We rewrite this into an `Aggregate` in the optimizer, which has the same problem with `MapType`. --- If your project is set up for it, you can reply

[GitHub] spark pull request #17236: [SPARK-19893][SQL] Cannot run intersect/except wi...

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17236#discussion_r105352149 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CheckAnalysis.scala --- @@ -319,7 +322,7 @@ trait CheckAnalysis extends

[GitHub] spark pull request #17236: [SPARK-19893][SQL] Cannot run intersect/except wi...

2017-03-10 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17236#discussion_r105351724 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/CheckAnalysis.scala --- @@ -319,7 +322,7 @@ trait CheckAnalysis extends

[GitHub] spark pull request #17214: [MINOR][SQL] The analyzer rules are fired twice f...

2017-03-08 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17214#discussion_r105047064 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/QueryExecution.scala --- @@ -46,9 +46,13 @@ class QueryExecution(val sparkSession

[GitHub] spark issue #17138: [SPARK-17080] [SQL] join reorder

2017-03-08 Thread hvanhovell
Github user hvanhovell commented on the issue: https://github.com/apache/spark/pull/17138 Spark can process bushy joins. This implementation of `DPSize` should be able to create bushy joins (or I have missed something). --- If your project is set up for it, you can reply

[GitHub] spark pull request #17214: [MINOR][SQL] The analyzer rules are fired twice f...

2017-03-08 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/17214#discussion_r105037075 --- Diff: sql/core/src/main/scala/org/apache/spark/sql/execution/QueryExecution.scala --- @@ -46,9 +46,13 @@ class QueryExecution(val sparkSession

[GitHub] spark pull request #15363: [SPARK-17791][SQL] Join reordering using star sch...

2017-03-08 Thread hvanhovell
Github user hvanhovell commented on a diff in the pull request: https://github.com/apache/spark/pull/15363#discussion_r105030111 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/joins.scala --- @@ -20,19 +20,340 @@ package

<    5   6   7   8   9   10   11   12   13   14   >