[ 
https://issues.apache.org/jira/browse/IGNITE-10815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16728796#comment-16728796
 ] 

ASF GitHub Bot commented on IGNITE-10815:
-----------------------------------------

GitHub user Jokser opened a pull request:

    https://github.com/apache/ignite/pull/5746

    IGNITE-10815 Fixed coordinator failover in case of exchanges merge and 
non-affinity nodes

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/gridgain/apache-ignite ignite-10815

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/ignite/pull/5746.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #5746
    
----
commit 97d4d22f12f0a24060ea1cd7253758065cf77023
Author: Pavel Kovalenko <jokserfn@...>
Date:   2018-12-25T17:25:57Z

    IGNITE-10815 WIP
    
    Signed-off-by: Pavel Kovalenko <jokse...@gmail.com>

commit 141f40b8742d12681b3f41f7ee3dbc3ae2702380
Author: Pavel Kovalenko <jokserfn@...>
Date:   2018-12-25T18:51:11Z

    IGNITE-10815 Fix and test.
    
    Signed-off-by: Pavel Kovalenko <jokse...@gmail.com>

commit 84b3fc09cca1f7133883bd44c64a1466d32d5b53
Author: Pavel Kovalenko <jokserfn@...>
Date:   2018-12-25T18:52:08Z

    IGNITE-10815 Cleanup
    
    Signed-off-by: Pavel Kovalenko <jokse...@gmail.com>

----


> NullPointerException in InitNewCoordinatorFuture.init() leads to cluster hang
> -----------------------------------------------------------------------------
>
>                 Key: IGNITE-10815
>                 URL: https://issues.apache.org/jira/browse/IGNITE-10815
>             Project: Ignite
>          Issue Type: Bug
>    Affects Versions: 2.4
>            Reporter: Anton Kurbanov
>            Assignee: Pavel Kovalenko
>            Priority: Critical
>             Fix For: 2.8
>
>
> Possible scenario to reproduce:
> 1. Force few consecutive exchange merges and finish.
> 2. Trigger exchange.
> 3. Shutdown coordinator node before sending/receiving full partitions message.
>  
> Stacktrace:
> {code:java}
> 2018-12-24 15:54:02,664 sys-#48%gg% ERROR 
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture
>  - Failed to init new coordinator future: bd74f7ed-6984-4f78-9941-480df673ab77
> java.lang.NullPointerException: null
> at 
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture.events(GridDhtPartitionsExchangeFuture.java:534)
>  ~[ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at 
> org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager$18.applyx(CacheAffinitySharedManager.java:1790)
>  ~[ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at 
> org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager$18.applyx(CacheAffinitySharedManager.java:1738)
>  ~[ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at 
> org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager.forAllRegisteredCacheGroups(CacheAffinitySharedManager.java:1107)
>  ~[ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at 
> org.apache.ignite.internal.processors.cache.CacheAffinitySharedManager.initCoordinatorCaches(CacheAffinitySharedManager.java:1738)
>  ~[ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at 
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.InitNewCoordinatorFuture.init(InitNewCoordinatorFuture.java:104)
>  ~[ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at 
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture$8$1.call(GridDhtPartitionsExchangeFuture.java:3439)
>  [ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at 
> org.apache.ignite.internal.processors.cache.distributed.dht.preloader.GridDhtPartitionsExchangeFuture$8$1.call(GridDhtPartitionsExchangeFuture.java:3435)
>  [ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at 
> org.apache.ignite.internal.util.IgniteUtils.wrapThreadLoader(IgniteUtils.java:6720)
>  [ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at 
> org.apache.ignite.internal.processors.closure.GridClosureProcessor$2.body(GridClosureProcessor.java:967)
>  [ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at org.apache.ignite.internal.util.worker.GridWorker.run(GridWorker.java:110) 
> [ignite-core-2.4.13.b4.jar:2.4.13.b4]
> at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  [?:1.8.0_171]
> at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  [?:1.8.0_171]
> at java.lang.Thread.run(Thread.java:748) [?:1.8.0_171]
> {code}
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to