[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16064972#comment-16064972
]
Barry Becker commented on SPARK-20226:
--
Calling cache() on the dataframe on the after the addColumn
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960868#comment-15960868
]
Barry Becker commented on SPARK-20226:
--
Only 11 columns. I did not want to wait for 10 or 20 minutes
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960835#comment-15960835
]
Liang-Chi Hsieh commented on SPARK-20226:
-
How many columns are added in above runs? I didn't see
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960806#comment-15960806
]
Barry Becker commented on SPARK-20226:
--
OK, I set the flag using
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15960001#comment-15960001
]
Liang-Chi Hsieh commented on SPARK-20226:
-
{{spark.sql.constraintPropagation.enabled}} is a SQL
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15959134#comment-15959134
]
Barry Becker commented on SPARK-20226:
--
Yes. We are running through spark job-server, and local.conf
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15959072#comment-15959072
]
Liang-Chi Hsieh commented on SPARK-20226:
-
I am not sure what the job-server local.conf is. Does
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15959024#comment-15959024
]
Barry Becker commented on SPARK-20226:
--
I set spark.sql.constraintPropagation.enabled to false in
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15958512#comment-15958512
]
Sean Owen commented on SPARK-20226:
---
Yeah it does looks like it's slowing down not purely because of
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15958259#comment-15958259
]
Liang-Chi Hsieh commented on SPARK-20226:
-
[~barrybecker4] Can you try to disable this config
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15957732#comment-15957732
]
Barry Becker commented on SPARK-20226:
--
I did some profiling using the sampler in JVisualVM and took
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15957496#comment-15957496
]
Sean Owen commented on SPARK-20226:
---
It could just look that way because caching means evaluating, and
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15957489#comment-15957489
]
Barry Becker commented on SPARK-20226:
--
I thought the problem was in the cacheTable call because
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15957468#comment-15957468
]
Sean Owen commented on SPARK-20226:
---
OK, this doesn't sound like it's anything to do with SQLContext or
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15957457#comment-15957457
]
Barry Becker commented on SPARK-20226:
--
It seems like it has to do with the interaction between the
[
https://issues.apache.org/jira/browse/SPARK-20226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15957296#comment-15957296
]
Barry Becker commented on SPARK-20226:
--
We noticed that this is reproducible just by adding a new
16 matches
Mail list logo