[ 
https://issues.apache.org/jira/browse/IGNITE-9913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16803849#comment-16803849
 ] 

Amelchev Nikita commented on IGNITE-9913:
-----------------------------------------

I have implemented lightweight PME (based on PR by Ivan Rakov) for the case 
when the baseline server leaves topology.

I have benchmarked it with master under yardstick load 
(IgniteGetAndPutTxBenchmark, 6 servers, 2 clients by 64 threads):
master:
 !master_yardstick.png! 
with my changes:
 !9913_yardstick.png! 

PME duration
master: servers 1440+-35 ms (servers); 989+-87 ms (clients)
with changes:  117+-10 ms (servers and clients)

Also, max latency of transactions was decreased: 
master: 1439 ms
with changes: 293 ms

In summary, PME duration was decreased by 10 times and the maximum latency of 
transactions was decreased by 4-5 times.

TC tests look good. (testRebalancingDuringLoad_N can be muted until 
IGNITE-11623 will be resolved). 


> Prevent data updates blocking in case of backup BLT server node leave
> ---------------------------------------------------------------------
>
>                 Key: IGNITE-9913
>                 URL: https://issues.apache.org/jira/browse/IGNITE-9913
>             Project: Ignite
>          Issue Type: Improvement
>          Components: general
>            Reporter: Ivan Rakov
>            Assignee: Amelchev Nikita
>            Priority: Major
>             Fix For: 2.8
>
>         Attachments: 9913_yardstick.png, master_yardstick.png
>
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Ignite cluster performs distributed partition map exchange when any server 
> node leaves or joins the topology.
> Distributed PME blocks all updates and may take a long time. If all 
> partitions are assigned according to the baseline topology and server node 
> leaves, there's no actual need to perform distributed PME: every cluster node 
> is able to recalculate new affinity assigments and partition states locally. 
> If we'll implement such lightweight PME and handle mapping and lock requests 
> on new topology version correctly, updates won't be stopped (except updates 
> of partitions that lost their primary copy).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to