Kousuke Saruta created SPARK-28548:
--------------------------------------

             Summary: explain() shows wrong result for persisted DataFrames 
after some operations
                 Key: SPARK-28548
                 URL: https://issues.apache.org/jira/browse/SPARK-28548
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 3.0.0
            Reporter: Kousuke Saruta
            Assignee: Kousuke Saruta


After some operations against Datasets and then persist them, Dataset.explain 
shows wrong result.
One of those operations is explain() itself.
An example here.

{code}
val df = spark.range(10)
df.explain
df.persist
df.explain
{code}

Expected result is like as follows.
{code}
== Physical Plan ==
*(1) ColumnarToRow
+- InMemoryTableScan [id#7L]
      +- InMemoryRelation [id#7L], StorageLevel(disk, memory, deserialized, 1 
replicas)
            +- *(1) Range (0, 10, step=1, splits=12)
{code}

But I got this.
{code}
== Physical Plan ==
*(1) Range (0, 10, step=1, splits=12)
{code}



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to