Jungtaek Lim created SPARK-38522:
------------------------------------

             Summary: Strengthen the contract on iterator method in StateStore
                 Key: SPARK-38522
                 URL: https://issues.apache.org/jira/browse/SPARK-38522
             Project: Spark
          Issue Type: Improvement
          Components: Structured Streaming
    Affects Versions: 3.3.0
            Reporter: Jungtaek Lim


The root cause of SPARK-38320 was that the logic initialized the iterator 
first, and performed some updates against state store, and iterated through 
iterator expecting that all updates in between should be visible in iterator.

That is not guaranteed in RocksDB state store, and the contract of Java 
ConcurrentHashMap which is used in HDFSBackedStateStore does not also guarantee 
it.

It would be clearer if we update the contract to draw a line on behavioral 
guarantee to callers so that callers don't get such expectation.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to