[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-19 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-197724073 @gatorsmile @rxin @cloud-fan Since this PR does not solve the problem as expected, also introduce other problems (can't prune columns for Filter(Join(xx)), I have revert

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-19 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-197725609 @davies Sorry for that. Could you review another PR: https://github.com/apache/spark/pull/11745 ? That is built on this PR to resolve all the issues. Thanks

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-19 Thread davies
Github user davies commented on a diff in the pull request: https://github.com/apache/spark/pull/11682#discussion_r56459598 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala --- @@ -410,6 +401,24 @@ object ColumnPruning extends Rule[Logi

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-19 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/11682#discussion_r56459726 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala --- @@ -410,6 +401,24 @@ object ColumnPruning extends Rule[

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-19 Thread gatorsmile
Github user gatorsmile commented on a diff in the pull request: https://github.com/apache/spark/pull/11682#discussion_r56459774 --- Diff: sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/optimizer/Optimizer.scala --- @@ -410,6 +401,24 @@ object ColumnPruning extends Rule[

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-15 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-197004091 After writing more test cases, I found a couple of issues. In addition, we need another split of the rule `ColumnPruning`. Have a new rule `PushProjectThroughPredica

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-15 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-196939816 @davies I might understand your points. We still prefer `PushPredicateThroughProject`. Let me submit another PR to address that issue and at you. Let me ex

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-15 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-196922074 @gatorsmile This latest changes does not address the problem in the PR description, ColumnPruning and PushPredicateThroughProject still conflict with each other, right?

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-15 Thread gatorsmile
Github user gatorsmile commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-196839488 @davies In the case Filter(Project()), the second rule `EliminateOperators` removes the useless Project if it does not prune any column. Could you explain a

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-15 Thread davies
Github user davies commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-196732337 @gatorsmile Have you missed the special rule for Filter(Project()) ? --- If your project is set up for it, you can reply to this email and have your reply appear on GitH

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-15 Thread asfgit
Github user asfgit closed the pull request at: https://github.com/apache/spark/pull/11682 --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enabled and wishes so, or if the feature is ena

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-15 Thread rxin
Github user rxin commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-196701013 Merging in master. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feat

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-196628698 Test PASSed. Refer to this link for build results (access rights to CI server needed): https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-14 Thread AmplabJenkins
Github user AmplabJenkins commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-196628692 Merged build finished. Test PASSed. --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your projec

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-14 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-196627725 **[Test build #53144 has finished](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/53144/consoleFull)** for PR 11682 at commit [`bc4685a`](https://g

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-14 Thread cloud-fan
Github user cloud-fan commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-196617839 LGTM --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this feature enable

[GitHub] spark pull request: [SPARK-13840] [SQL] Split Optimizer Rule Colum...

2016-03-14 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/11682#issuecomment-196594548 **[Test build #53144 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/53144/consoleFull)** for PR 11682 at commit [`bc4685a`](https://gi