[
https://issues.apache.org/jira/browse/YARN-8995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16681105#comment-16681105
]
zhuqi edited comment on YARN-8995 at 11/9/18 9:06 AM:
------------------------------------------------------
Hi [~cheersyang]
Thanks for your reply, i think not only the queue size, we can also add a
eventMetrics class to monitor the health of cluster's all event dispatchers.
was (Author: zhuqi):
Hi [~cheersyang]
Thanks for your reply, i think not only the queue size, we can also add a
eventMetrics class to monitor the health of cluster's all event dispachers.
> Log the event type of the too big AsyncDispatcher event queue size, and add
> the information to the metrics.
> ------------------------------------------------------------------------------------------------------------
>
> Key: YARN-8995
> URL: https://issues.apache.org/jira/browse/YARN-8995
> Project: Hadoop YARN
> Issue Type: Improvement
> Components: metrics, nodemanager, resourcemanager
> Affects Versions: 3.1.0
> Reporter: zhuqi
> Assignee: zhuqi
> Priority: Major
>
> In our growing cluster,there are unexpected situations that cause some event
> queues to block the performance of the cluster, such as the bug of
> https://issues.apache.org/jira/browse/YARN-5262 . I think it's necessary to
> log the event type of the too big event queue size, and add the information
> to the metrics, and the threshold of queue size is a parametor which can be
> changed.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]