Yuchen Liu created SPARK-48589:
----------------------------------

             Summary: Add option snapshotStartBatchId and snapshotPartitionId 
to state data source
                 Key: SPARK-48589
                 URL: https://issues.apache.org/jira/browse/SPARK-48589
             Project: Spark
          Issue Type: New Feature
          Components: Structured Streaming
    Affects Versions: 4.0.0
            Reporter: Yuchen Liu


Define two new options, _snapshotStartBatchId_ and _snapshotPartitionId_, for 
the existing state reader. Both of them should be provided at the same time.
 # When there is no snapshot file at that batch (note there is an off-by-one 
issue between version and batch Id), throw an exception.
 # Otherwise, the reader should continue to rebuild the state by reading delta 
files only, and ignore all snapshot files afterwards.
 # Note that if a batchId option is already specified. That batchId is the 
ending batchId, we should then end at that batchId.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to