HeartSaVioR commented on issue #23634: [SPARK-26154][SS] Streaming left/right 
outer join should not return outer nulls for already matched rows
URL: https://github.com/apache/spark/pull/23634#issuecomment-458915512
 
 
   I just realized current approach is not compatible with old state, and it is 
non-trivial to address the issue. I would end up with applying either one to 
support backward compatibility:
   
   1) Store state data with empty map for all of previous batches on `matched` 
state store. It still requires leaving old logic as fail-back mechanism to 
handle outer join correctly with old state. (8e1a2f4 actually does this, but I 
forgot I should fill up state checkpoints for matched state store.)
   
   2) Versioning state (old state = version 1 / new state = version 2), and 
separate logic against state version. This means we give up fixing bug with 
state version 1.
   
   As we have been addressing state compatibility issue via 2), I would try to 
tackle 2) first.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to