[ 
https://issues.apache.org/jira/browse/YARN-10720?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Qi Zhu updated YARN-10720:
--------------------------
    Description: 
Following is proxy server show, {color:#de350b}too many connections from one 
client{color}, this caused the proxy server hang, and the yarn web can't jump 
to web proxy.

!image-2021-03-29-13-42-47-672.png|width=718,height=62!

 

Following is the AM which is abnormal, but proxy server don't know it is 
abnormal already, so the connections can't be closed, we should add time out 
support in proxy server to prevent this.

!image-2021-03-29-13-44-05-579.png|width=657,height=97!

 

After i kill the abnormal AM, the proxy server become healthy. This case 
happened many times in our many production clusters, our clusters are huge, and 
the abnormal AM will be existed in a regular case.

 

cc  [~pbacsko] [~ebadger] [~Jim_Brennan]  [~ztang]  [~epayne] [~gandras]  
[~bteke]

 

  was:
Following is proxy server show, {color:#de350b}too many connections from one 
client{color}, this caused the proxy server hang, and the yarn web can't jump 
to web proxy.

!image-2021-03-29-13-42-47-672.png|width=718,height=62!

 

Following is the AM which is abnormal, but proxy server don't know it is 
abnormal already, so the connections can't be closed, we should add time out 
support in proxy server to prevent this.

!image-2021-03-29-13-44-05-579.png|width=657,height=97!

 

After i kill the abnormal AM, the proxy server become healthy. This case 
happened many times in our many production clusters, our clusters are huge, and 
the abnormal AM will be existed in a regular case.

 


> YARN WebAppProxyServlet should support connection timeout to prevent too many 
> abnormal connections.
> ---------------------------------------------------------------------------------------------------
>
>                 Key: YARN-10720
>                 URL: https://issues.apache.org/jira/browse/YARN-10720
>             Project: Hadoop YARN
>          Issue Type: Bug
>            Reporter: Qi Zhu
>            Assignee: Qi Zhu
>            Priority: Critical
>         Attachments: image-2021-03-29-13-42-47-672.png, 
> image-2021-03-29-13-44-05-579.png
>
>
> Following is proxy server show, {color:#de350b}too many connections from one 
> client{color}, this caused the proxy server hang, and the yarn web can't jump 
> to web proxy.
> !image-2021-03-29-13-42-47-672.png|width=718,height=62!
>  
> Following is the AM which is abnormal, but proxy server don't know it is 
> abnormal already, so the connections can't be closed, we should add time out 
> support in proxy server to prevent this.
> !image-2021-03-29-13-44-05-579.png|width=657,height=97!
>  
> After i kill the abnormal AM, the proxy server become healthy. This case 
> happened many times in our many production clusters, our clusters are huge, 
> and the abnormal AM will be existed in a regular case.
>  
> cc  [~pbacsko] [~ebadger] [~Jim_Brennan]  [~ztang]  [~epayne] [~gandras]  
> [~bteke]
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org

Reply via email to