[jira] [Commented] (IGNITE-3018) Cache affinity calculation is slow with large nodes number

Taras Ledkov (JIRA) Wed, 11 May 2016 01:48:32 -0700

    [ 
https://issues.apache.org/jira/browse/IGNITE-3018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15279802#comment-15279802
 ]


Taras Ledkov commented on IGNITE-3018:
--------------------------------------

Unfortunately bucket based distribution has a very bad re-balance index.
In the test below the value is the average partitions count that are changed 
after each change of the topology. 

Test 60 nodes. Old: 11.4; New: 19.8;
Test 100 nodes. Old: 7.6; New: 135.7;
Test 200 nodes. Old: 4.0; New: 100.1;
Test 300 nodes. Old: 2.7; New: 75.1;
Test 400 nodes. Old: 1.9; New: 60.5;
Test 500 nodes. Old: 1.9; New: 52.5;
Test 600 nodes. Old: 1.2; New: 46.0;

So, it looks like we have to use simple sort all hash(partition, node) to 
prevent rebalance problem.

> Cache affinity calculation is slow with large nodes number
> ----------------------------------------------------------
>
>                 Key: IGNITE-3018
>                 URL: https://issues.apache.org/jira/browse/IGNITE-3018
>             Project: Ignite
>          Issue Type: Bug
>          Components: cache
>            Reporter: Semen Boikov
>            Assignee: Taras Ledkov
>            Priority: Critical
>             Fix For: 1.7
>
>         Attachments: 003.png, 064.png, 100.png, 128.png, 200.png, 300.png, 
> 400.png, 500.png, 600.png
>
>
> With large number of cache server nodes (> 200)  RendezvousAffinityFunction 
> and FairAffinityFunction work pretty slow .
> For RendezvousAffinityFunction.assignPartitions can take hundredes of 
> milliseconds, for FairAffinityFunction it can take seconds.
> For RendezvousAffinityFunction most time is spent in MD5 hash calculation and 
> nodes list sorting. As optimization we can try to cache {partion, node} MD5 
> hash or try another hash function. Also several minor optimizations are 
> possible (avoid unncecessary allocations, only one thread local 'get', etc).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (IGNITE-3018) Cache affinity calculation is slow with large nodes number

Reply via email to