[ https://issues.apache.org/jira/browse/YARN-9809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17138894#comment-17138894 ]
Eric Yang commented on YARN-9809: --------------------------------- [~ebadger] Sorry, my statement was not clear. If the script name is incorrect, resulting exit code is non-zero, or the execution exit code is non-zero. In those cases, health check will report as healthy. I think those conditions must be considered as unhealthy, in the event that check script does not have proper prerequisites. The errors can be caught. Is this something that we can fix to make this more user friendly? > NMs should supply a health status when registering with RM > ---------------------------------------------------------- > > Key: YARN-9809 > URL: https://issues.apache.org/jira/browse/YARN-9809 > Project: Hadoop YARN > Issue Type: Bug > Reporter: Eric Badger > Assignee: Eric Badger > Priority: Major > Attachments: YARN-9809.001.patch, YARN-9809.002.patch, > YARN-9809.003.patch, YARN-9809.004.patch > > > Currently if the NM registers with the RM and it is unhealthy, it can be > scheduled many containers before the first heartbeat. After the first > heartbeat, the RM will mark the NM as unhealthy and kill all of the > containers. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org