Hello, We have a multiple-node storm cluster running on a Production environment. We have had some issues with a couple of machines, which have been out of service for a few hours.
Because some workers of the deployed topologies were running on the failed machines, cluster's behaviour has been unusual (It has been running but not as it should). Once we recovered the failed nodes, and rebalanced the topologies, the cluster returned to work properly. We would like to know if there is any way to alert nimbus, when a node fall down, in order to rebalance the affected topologies and create new workers in the healthy nodes of the cluster that supply those who were working on the failed ones. This would have helped us so much, because we could have kept consistency in our service in spite of the failed nodes. Any advice? Tahnks in advance! *JULIÁN BERMEJO FERREIRO* *Departamento de Tecnología * *[email protected] <[email protected]>* <http://www.beeva.com/>
