[
https://issues.apache.org/jira/browse/SPARK-21851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16144394#comment-16144394
]
Anton Suchaneck commented on SPARK-21851:
-
Not quite production, but still for re
[
https://issues.apache.org/jira/browse/SPARK-21851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16144376#comment-16144376
]
Anton Suchaneck commented on SPARK-21851:
-
I wish upgrading was that easy when yo
[
https://issues.apache.org/jira/browse/SPARK-21851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16144108#comment-16144108
]
Anton Suchaneck commented on SPARK-21851:
-
I tried on a VM with Spark 2.1 and did
[
https://issues.apache.org/jira/browse/SPARK-21851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Anton Suchaneck updated SPARK-21851:
Description:
Doing a join and cache can corrupt data as shown here:
{code}
import pyspark.
Anton Suchaneck created SPARK-21851:
---
Summary: Spark 2.0 data corruption with cache and 200 columns
Key: SPARK-21851
URL: https://issues.apache.org/jira/browse/SPARK-21851
Project: Spark
Is