[ https://issues.apache.org/jira/browse/YARN-6531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15984406#comment-15984406 ]
Naganarasimha G R commented on YARN-6531: ----------------------------------------- [~bibinchundatt], Agree a bad app in a public cluster kind of setup would be riskier. But the catch is, limit is in a pluggable state store so we need to ensure a cleaner way to do the check before we send a event to the state store. > Check appStateData size before saving to Zookeeper > -------------------------------------------------- > > Key: YARN-6531 > URL: https://issues.apache.org/jira/browse/YARN-6531 > Project: Hadoop YARN > Issue Type: Bug > Reporter: Bibin A Chundatt > Assignee: Bibin A Chundatt > Priority: Critical > > Application with large size Application submission context could cause store > to Zookeeper failure due to znode size limit. Zookeeper znode limit exception > thrown {{org.apache.zookeeper.KeeperException$ConnectionLossException}}. > ZkStateStore will retry for configured times and will throw > ConnectionLossException after configured limit. > Which could cause Resource manager to switch from active To StandBy and other > application submitted not getting save to ZK. > Solution {{ApplicationStateData}} size to be validated before saving and > reject application so that ResourceManager is not impacted. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org