Stephan Ewen created FLINK-16443:
------------------------------------

             Summary: Fix wrong fix for user-code CheckpointExceptions
                 Key: FLINK-16443
                 URL: https://issues.apache.org/jira/browse/FLINK-16443
             Project: Flink
          Issue Type: Bug
          Components: Runtime / Checkpointing
            Reporter: Stephan Ewen
             Fix For: 1.11.0


The problem of having exceptions that are only in the user code classloader was 
fixed by proactively serializing them inside the {{CheckpointException}}. That 
means all consumers of  {{CheckpointException}} now need to be aware of that 
and unwrap the serializable exception.

I believe the right way to fix this would have been to use a 
SerializedException in the {{DeclineCheckpoint}} message instead, which would 
have localized the change to the actual problem: RPC transport.

I would suggest to revert https://github.com/apache/flink/pull/9742 and instead 
apply the above described change.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to