Yun Tang created FLINK-25479:
--------------------------------

             Summary: Changlog materialization with incremental checkpoint 
cannot work well in local tests
                 Key: FLINK-25479
                 URL: https://issues.apache.org/jira/browse/FLINK-25479
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Checkpointing, Runtime / State Backends, Tests
            Reporter: Yun Tang
             Fix For: 1.15.0


Currently, changelog materialization would call RocksDB state backend's 
snapshot method to generate {{IncrementalRemoteKeyedStateHandle}} as 
ChangelogStateBackendHandleImpl's materialized artifacts. And before next 
materialization, it will always report the same 
{{IncrementalRemoteKeyedStateHandle}} as before.

For local tests, TM would report the {{IncrementalRemoteKeyedStateHandle}} to 
JM via local {{LocalRpcInvocation}}. However, as {{LocalRpcInvocation}} would 
not de/serialize message, which leads once we register the 
{{IncrementalRemoteKeyedStateHandle}} on JM side, it will also add a 
{{sharedStateRegistry}} to the one located on TM side. For the 2nd checkpoint, 
TM would reported same {{IncrementalRemoteKeyedStateHandle}} with  
{{sharedStateRegistry}} to JM. And it will then throw exception as it already 
contains a {{sharedStateRegistry}}:

IncrementalRemoteKeyedStateHandle
{code:java}
public void registerSharedStates(SharedStateRegistry stateRegistry, long 
checkpointID) {
       Preconditions.checkState(
                sharedStateRegistry != stateRegistry,
                "The state handle has already registered its shared states to 
the given registry.");

}
{code}

This bug would go in distribution environment as 
{{IncrementalRemoteKeyedStateHandle}} would be serialized and 
{{sharedStateRegistry}} is tagged as {{transient}}.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to