[jira] [Updated] (STORM-3016) Nimbus gets down when job has large amount of parallelism components

StaticMian (JIRA) Thu, 29 Mar 2018 21:17:11 -0700

     [ 
https://issues.apache.org/jira/browse/STORM-3016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


StaticMian updated STORM-3016:
------------------------------
    Description: 
When a job having large amount of parallelism components( total parallelism 
rises to 5000 for example) been submmited to storm cluster, Nimubs might get 
crashed, the work flow is as below:

1)  Nimbus computting assignment

2) Nimbus sending assignment to zk

{color:#ff0000}3) When assignment mapping info string is too long due to  total 
parallelism of job being too large, sending this info to zk will fail (zNode 
datalength set default is 1M ){color}

{color:#333333}4) Nimbus getting assignment from zk fails , then it gives up 
and crashed, with that happend, the stablity of the cluster will be greatly 
impact {color}

  was:
When a job having large amount of parallelism components( total parallelism 
rises to 5000 for example) been submmited to storm cluster, Nimubs might get 
crashed, the work flow is as below:

1)  Nimbus computting assignment

2) Nimbus sending assignment to zk

{color:#FF0000}3) When assignment mapping info string is too long due to  total 
parallelism of job being too large, sending this info to zk will fail (zk node 
data set default is 1M ){color}

{color:#333333}4) Nimbus getting assignment from zk fails , then it gives up 
and crashed, with that happend, the stablity of the cluster will be 
influnced... {color}


> Nimbus gets down when job has large amount of parallelism components
> --------------------------------------------------------------------
>
>                 Key: STORM-3016
>                 URL: https://issues.apache.org/jira/browse/STORM-3016
>             Project: Apache Storm
>          Issue Type: Improvement
>          Components: storm-core
>    Affects Versions: 2.0.0
>            Reporter: StaticMian
>            Priority: Major
>              Labels: security
>             Fix For: 2.0.0
>
>   Original Estimate: 96h
>  Remaining Estimate: 96h
>
> When a job having large amount of parallelism components( total parallelism 
> rises to 5000 for example) been submmited to storm cluster, Nimubs might get 
> crashed, the work flow is as below:
> 1)  Nimbus computting assignment
> 2) Nimbus sending assignment to zk
> {color:#ff0000}3) When assignment mapping info string is too long due to  
> total parallelism of job being too large, sending this info to zk will fail 
> (zNode datalength set default is 1M ){color}
> {color:#333333}4) Nimbus getting assignment from zk fails , then it gives up 
> and crashed, with that happend, the stablity of the cluster will be greatly 
> impact {color}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Updated] (STORM-3016) Nimbus gets down when job has large amount of parallelism components

Reply via email to