hi,jiliang1993:

??????????????yarn??yarn.resourcemanager.am.max-attempts????????????
????yarn.application-attempt-failures-validity-interval??????????????????attempts????????????????????????10??????????????????????10????????????????????attempts??????1????10??????????????attempts??????????????????attempts????????min(yarn????????yarn.resourcemanager.am.max-attempts,flink????????yarn.application-attempts)????yarn??????????????????????????????
????????????????????????????


Best,
MuChen.


------------------ ???????? ------------------
??????:&nbsp;"jiliang1993"<jiliang1...@gmail.com&gt;;
????????:&nbsp;2020??7??1??(??????) ????10:56
??????:&nbsp;"MuChen"<9329...@qq.com&gt;;

????:&nbsp;?????? flink????yarn??HA??????????????HA??????????????????????state



????????????????ha????????yarn??attempt ??????????





------------------ ???????? ------------------
??????: "MuChen" <9329...@qq.com&gt; <"MuChen" <9329...@qq.com&gt;&gt;
????????: 2020??7??1?? 22:48
??????: jiliang1993 <jiliang1...@gmail.com&gt;
????: ?????? flink????yarn??HA??????????????HA??????????????????????state



hi???????? ?????????????????? Best, MuChen. 
------------------&amp;nbsp;????????&amp;nbsp;------------------ 
??????:&amp;nbsp;"????"<sdlcwangson...@gmail.com&amp;gt;; 
????????:&amp;nbsp;2020??7??1??(??????) ????8:17 
??????:&amp;nbsp;"user-zh"<user-zh@flink.apache.org&amp;gt;; ????:&amp;nbsp;Re: 
flink????yarn??HA??????????????HA??????????????????????state hi, muchen 1. 
yarn.application-attempts 
??????????????????????????????yarn.application-attempt-failures-validity-interval????????????????????????????interval????????????????????????flink
 
job??????????????????????interval??????????????????????????????yarn.application-attempts:
 2??yarn.application-attempt-failures-validity-interval = 
10000??????????10s??????????10s?? flink job ????????2?????????????????? 2. 
??????????checkpoint??????????????????????state?? 
???????????????????????????????????? MuChen <9329...@qq.com&amp;gt; 
??2020??7??1?????? ????7:50?????? &amp;gt; hi??all?? &amp;gt; &amp;gt; 
??????????????https://blog.csdn.net/cndotaci/article/details/106870413 &amp;gt; 
??????????????flink????yarn??????????????????????????????????????2??????????????????????6????????????????????yarn??????
 &amp;gt; &amp;gt; ???????????? &amp;gt; &amp;gt; 1. 
???????????????????????????????????? &amp;gt; &amp;gt; 2. 
????HA????????????????????????????????????????state?? &amp;gt; &amp;gt; 
flink??????1.10.0 &amp;gt; &amp;gt; flink-conf.yaml?????? &amp;gt; $ grep -v ^# 
flink-conf.yaml |grep -v ^$ jobmanager.rpc.address: localhost &amp;gt; 
jobmanager.rpc.port: 6123 jobmanager.heap.size: 1024m &amp;gt; 
taskmanager.memory.process.size: 1568m taskmanager.numberOfTaskSlots: 1 
&amp;gt; parallelism.default: 1 high-availability: zookeeper &amp;gt; 
high-availability.storageDir: hdfs:///flink/ha/ &amp;gt; 
high-availability.zookeeper.quorum: &amp;gt; 
uhadoop-op3raf-master1,uhadoop-op3raf-master2,uhadoop-op3raf-core1 &amp;gt; 
state.checkpoints.dir: hdfs:///flink/checkpoint state.savepoints.dir: &amp;gt; 
hdfs:///flink/flink-savepoints state.checkpoints.num-retained:60 &amp;gt; 
state.backend.incremental: true jobmanager.execution.failover-strategy: 
&amp;gt; region jobmanager.archive.fs.dir: hdfs:///flink/flink-jobs/ &amp;gt; 
historyserver.web.port: 8082 historyserver.archive.fs.dir: &amp;gt; 
hdfs:///flink/flink-jobs/ historyserver.archive.fs.refresh-interval: 10000 
&amp;gt; # HA???????? yarn.application-attempts: 2 &amp;gt; 
ssh??jm??????????kill???????????????? &amp;gt; [root@uhadoop-op3raf-task48 ~]# 
jps 34785 YarnTaskExecutorRunner 16853 &amp;gt; YarnTaskExecutorRunner 17527 
PrestoServer 33289 YarnTaskExecutorRunner &amp;gt; 18026 
YarnJobClusterEntrypoint 20283 Jps 39599 NodeManager &amp;gt; 
[root@uhadoop-op3raf-task48 ~]# kill -9 18026 [root@uhadoop-op3raf-task48 
&amp;gt; ~]# jps 34785 YarnTaskExecutorRunner 16853 -- process information 
&amp;gt; unavailable 17527 PrestoServer 21383 Jps 33289 YarnTaskExecutorRunner 
20412 &amp;gt; YarnJobClusterEntrypoint 39599 NodeManager 
[root@uhadoop-op3raf-task48 &amp;gt; ~]# kill -9 20412 
[root@uhadoop-op3raf-task48 ~]# jps 34785 &amp;gt; YarnTaskExecutorRunner 21926 
YarnJobClusterEntrypoint 23207 Jps 17527 &amp;gt; PrestoServer 33289 
YarnTaskExecutorRunner 39599 NodeManager &amp;gt; [root@uhadoop-op3raf-task48 
~]# kill -9 21926 [root@uhadoop-op3raf-task48 &amp;gt; ~]# jps 34785 
YarnTaskExecutorRunner 23318 YarnJobClusterEntrypoint 26279 &amp;gt; Jps 17527 
PrestoServer 33289 YarnTaskExecutorRunner 39599 NodeManager &amp;gt; 
[root@uhadoop-op3raf-task48 ~]# kill -9 23318

回复