[jira] [Commented] (SPARK-21851) Spark 2.0 data corruption with cache and 200 columns

2017-08-28 Thread Anton Suchaneck (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16144394#comment-16144394 ] Anton Suchaneck commented on SPARK-21851: - Not quite production, but still for re

[jira] [Commented] (SPARK-21851) Spark 2.0 data corruption with cache and 200 columns

2017-08-28 Thread Anton Suchaneck (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16144376#comment-16144376 ] Anton Suchaneck commented on SPARK-21851: - I wish upgrading was that easy when yo

[jira] [Commented] (SPARK-21851) Spark 2.0 data corruption with cache and 200 columns

2017-08-28 Thread Anton Suchaneck (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16144108#comment-16144108 ] Anton Suchaneck commented on SPARK-21851: - I tried on a VM with Spark 2.1 and did

[jira] [Updated] (SPARK-21851) Spark 2.0 data corruption with cache and 200 columns

2017-08-28 Thread Anton Suchaneck (JIRA)
[ https://issues.apache.org/jira/browse/SPARK-21851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Anton Suchaneck updated SPARK-21851: Description: Doing a join and cache can corrupt data as shown here: {code} import pyspark.

[jira] [Created] (SPARK-21851) Spark 2.0 data corruption with cache and 200 columns

2017-08-28 Thread Anton Suchaneck (JIRA)
Anton Suchaneck created SPARK-21851: --- Summary: Spark 2.0 data corruption with cache and 200 columns Key: SPARK-21851 URL: https://issues.apache.org/jira/browse/SPARK-21851 Project: Spark Is