AFAIR, your manual kill won't count towards the max-attempt counter in hadoop's logic. Please see this post for more details: http://johnjianfang.blogspot.com/2015/04/the-number-of-maximum-attempts-of-yarn.html .
On Sun, Jun 2, 2019 at 9:48 AM 新平和礼 <k.shinhira.1...@gmail.com> wrote: > Hi all, > > > I'm Flink newbie, and trying to understand Flink cluster’s recovery > feature using Flink 1.7.2 and YARN 2.8. > > To confirm HA cluster’s behavior, I created Flink YARN session cluster > and stopped JobManager repeatedly using kill command after job deployment. > > In that test, I set “yarn.application-attempts” to 5, but Flink cluster > was recovered more than 5 times. > > > Does anyone know what “yarn.application-attempts” mean, and when Flink > cluster’s attempts time will be incremented ? > > > I asked same question at stackoverflow, but I still don’t get it. > > > > https://stackoverflow.com/questions/56225088/why-is-flink-ha-cluster-on-yarn-recovered-more-than-the-maximum-number-of-attemp > > > > Best, > -- > Kazunori Shinhira > Mail : k.shinhira.1...@gmail.com >