Livia Zhu created SPARK-53625:
---------------------------------

             Summary: Propagate metadata columns through projections to address 
ApplyCharTypePadding incompatibility in streaming
                 Key: SPARK-53625
                 URL: https://issues.apache.org/jira/browse/SPARK-53625
             Project: Spark
          Issue Type: Improvement
          Components: Structured Streaming
    Affects Versions: 4.1.0
            Reporter: Livia Zhu


Currently, streams where the output DF access the `_metadata` column with a 
char type in the schema and ApplyCharTypePadding rule applied will encounter an 
AssertionError in streaming MicrobatchExecution. This is because the 
ApplyCharTypePadding rule hides the metadata column with a Projection and this 
code 
([https://github.com/apache/spark/blob/87a71fabb097e1543a935fae8167bc47a29a127e/sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/runtime/MicroBatchExecution.scala#L803)]
 does not propagate the metadata column through the projection.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to