Burak Yavuz created SPARK-21370:
-----------------------------------

             Summary: Clarify In-Memory State Store purpose (read-only, 
read-write) with an additional state
                 Key: SPARK-21370
                 URL: https://issues.apache.org/jira/browse/SPARK-21370
             Project: Spark
          Issue Type: Improvement
          Components: Structured Streaming
    Affects Versions: 2.1.1
            Reporter: Burak Yavuz
            Assignee: Burak Yavuz


Currently the HDFSBackedStateStore sets it's state as UPDATING as it is 
initialized.

For every trigger, we create two state stores, one used during "Restore" and 
one during "Save". The "Restore" StateStore is read-only. This state store gets 
"aborted" after a task is completed, which results in a file being created and 
immediately deleted.

This can be avoided if there is an INITIALIZED state and abort deletes files 
only when there is an update to the state store using "put" or "remove".



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to