[ https://issues.apache.org/jira/browse/SPARK-44464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jungtaek Lim resolved SPARK-44464. ---------------------------------- Fix Version/s: 3.5.0 Assignee: Siying Dong Resolution: Fixed Issue resolved via [https://github.com/apache/spark/pull/42046] > Fix applyInPandasWithStatePythonRunner to output rows that have Null as first > column value > ------------------------------------------------------------------------------------------ > > Key: SPARK-44464 > URL: https://issues.apache.org/jira/browse/SPARK-44464 > Project: Spark > Issue Type: Bug > Components: Structured Streaming > Affects Versions: 3.3.3 > Reporter: Siying Dong > Assignee: Siying Dong > Priority: Major > Fix For: 3.5.0 > > > The current implementation of {{ApplyInPandasWithStatePythonRunner}} cannot > deal with outputs where the first column of the row is {{{}null{}}}, as it > cannot distinguish the case where the column is null, or the field is filled > as the number of data records are smaller than state records. It causes > incorrect results for the former case. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org