Re: [PR] KAFKA-15905 Restarts of MirrorCheckpointTask should not permanently i… [kafka]

via GitHub Fri, 10 May 2024 09:42:22 -0700


gharris1727 commented on PR #15910:
URL: https://github.com/apache/kafka/pull/15910#issuecomment-2104915418


   @edoardocomar In general connectors do have to add a configuration like this 
eventually, because users have different tolerances for errors. Some users want 
the errors to cause the connector to become FAILED, so that they can see the 
exception in the REST API and retry it explicitly. Other users want the 
connector to retry internally infinitely, and not fail for any reason.
   
   MM2 has a _lot_ of operations that can fail, and virtually none of them 
cause the connector to fail. The reason for this is that MM2 has dedicated 
mode, where there isn't a REST API to surface errors or perform external 
retries, so external retries are very expensive. It is definitely something 
that could be fixed eventually with like a "strict mode"? configuration or 
similar. We've also considered ways to address this from the framework side, 
with retry policies and automatic restarts, but none of that has been fully 
designed or implemented yet.
   
   I think we should not block this fix on solving that more general problem. 
If there is a permissions error loading the checkpoints, MM2 should log that, 
and then degrade gracefully to the current behavior. We can have a KIP that 
adds "strict mode" make this failure surface, to make this new permission 
required.
   
   In practical terms, without a configuration and with the graceful 
degradation implementation, we can get this into 3.8.
   If you're interested in the configuration, that will delay this feature 
until 4.0. I'm fine with either, but I think the current behavior has caused 
such considerable friction in the community that we should prefer a 3.8 release.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: jira-unsubscr...@kafka.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] KAFKA-15905 Restarts of MirrorCheckpointTask should not permanently i… [kafka]

Reply via email to