[ 
https://issues.apache.org/jira/browse/RATIS-2329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xianming Lei updated RATIS-2329:
--------------------------------
    Description: In our production environment celeborn cluster, due to network 
problems, a TimeOutIOException will occur for a period of time when the leader 
appends entries to the follower. When the network is restored, the leader will 
be stuck when continuing to append entries to the follower, causing the log to 
be unable to be replicated.  (was: In our production environment, due to 
network problems, a TimeOutIOException will occur for a period of time when the 
leader appends entries to the follower. When the network is restored, the 
leader will be stuck when continuing to append entries to the follower, causing 
the log to be unable to be replicated.)

> NettyRpcProxy should support handling netty.channel exceptions.
> ---------------------------------------------------------------
>
>                 Key: RATIS-2329
>                 URL: https://issues.apache.org/jira/browse/RATIS-2329
>             Project: Ratis
>          Issue Type: Bug
>          Components: Netty
>    Affects Versions: 2.5.1
>            Reporter: Xianming Lei
>            Priority: Major
>
> In our production environment celeborn cluster, due to network problems, a 
> TimeOutIOException will occur for a period of time when the leader appends 
> entries to the follower. When the network is restored, the leader will be 
> stuck when continuing to append entries to the follower, causing the log to 
> be unable to be replicated.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to