[ https://issues.apache.org/jira/browse/IGNITE-10815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16728796#comment-16728796 ]
ASF GitHub Bot commented on IGNITE-10815: ----------------------------------------- GitHub user Jokser opened a pull request: https://github.com/apache/ignite/pull/5746 IGNITE-10815 Fixed coordinator failover in case of exchanges merge and non-affinity nodes You can merge this pull request into a Git repository by running: $ git pull https://github.com/gridgain/apache-ignite ignite-10815 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/ignite/pull/5746.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #5746 ---- commit 97d4d22f12f0a24060ea1cd7253758065cf77023 Author: Pavel Kovalenko <jokserfn@...> Date: 2018-12-25T17:25:57Z IGNITE-10815 WIP Signed-off-by: Pavel Kovalenko <jokse...@gmail.com> commit 141f40b8742d12681b3f41f7ee3dbc3ae2702380 Author: Pavel Kovalenko <jokserfn@...> Date: 2018-12-25T18:51:11Z IGNITE-10815 Fix and test. Signed-off-by: Pavel Kovalenko <jokse...@gmail.com> commit 84b3fc09cca1f7133883bd44c64a1466d32d5b53 Author: Pavel Kovalenko <jokserfn@...> Date: 2018-12-25T18:52:08Z IGNITE-10815 Cleanup Signed-off-by: Pavel Kovalenko <jokse...@gmail.com> ---- > NullPointerException in InitNewCoordinatorFuture.init() leads to cluster hang > ----------------------------------------------------------------------------- > > Key: IGNITE-10815 > URL: https://issues.apache.org/jira/browse/IGNITE-10815 > Project: Ignite > Issue Type: Bug > Affects Versions: 2.4 > Reporter: Anton Kurbanov > Assignee: Pavel Kovalenko > Priority: Critical > Fix For: 2.8 > > > Possible scenario to reproduce: > 1. Force few consecutive exchange merges and finish. > 2. Trigger exchange. > 3. Shutdown coordinator node before sending/receiving full partitions message. > > Stacktrace: > {code:java} > 2018-12-24 15:54:02,664 sys-#48%gg% ERROR > org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture > - Failed to init new coordinator future: bd74f7ed-6984-4f78-9941-480df673ab77 > java.lang.NullPointerException: null > at > org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture.events(GridDhtPartitionsExchangeFuture.java:534) > ~[ignite-core-2.4.13.b4.jar:2.4.13.b4] > at > org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager$18.applyx(CacheAffinitySharedManager.java:1790) > ~[ignite-core-2.4.13.b4.jar:2.4.13.b4] > at > org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager$18.applyx(CacheAffinitySharedManager.java:1738) > ~[ignite-core-2.4.13.b4.jar:2.4.13.b4] > at > org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager.forAllRegisteredCacheGroups(CacheAffinitySharedManager.java:1107) > ~[ignite-core-2.4.13.b4.jar:2.4.13.b4] > at > org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager.initCoordinatorCaches(CacheAffinitySharedManager.java:1738) > ~[ignite-core-2.4.13.b4.jar:2.4.13.b4] > at > org.apache.ignite.internal.processors.cache.distributed.dht.preloader.InitNewCoordinatorFuture.init(InitNewCoordinatorFuture.java:104) > ~[ignite-core-2.4.13.b4.jar:2.4.13.b4] > at > org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture$8$1.call(GridDhtPartitionsExchangeFuture.java:3439) > [ignite-core-2.4.13.b4.jar:2.4.13.b4] > at > org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture$8$1.call(GridDhtPartitionsExchangeFuture.java:3435) > [ignite-core-2.4.13.b4.jar:2.4.13.b4] > at > org.apache.ignite.internal.util.IgniteUtils.wrapThreadLoader(IgniteUtils.java:6720) > [ignite-core-2.4.13.b4.jar:2.4.13.b4] > at > org.apache.ignite.internal.processors.closure.GridClosureProcessor$2.body(GridClosureProcessor.java:967) > [ignite-core-2.4.13.b4.jar:2.4.13.b4] > at org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:110) > [ignite-core-2.4.13.b4.jar:2.4.13.b4] > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) > [?:1.8.0_171] > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) > [?:1.8.0_171] > at java.lang.Thread.run(Thread.java:748) [?:1.8.0_171] > {code} > -- This message was sent by Atlassian JIRA (v7.6.3#76005)