Re: [infinispan-dev] Denormalizing hashes

Radim Vansa Wed, 11 Dec 2013 00:38:41 -0800

Hi Dan

I am not speaking about changing something for the C++ client, Iunderstand that the client code cannot be changed in order to keep thebackward compatibility.

The current hash-wheel approach is working well, but there are few flawsthat could be fixed keeping the client code untouched. Please, correctme if I am wrong.

1) The denormalization is executed for every client for every topologychange/client join. I don't have any numbers, but calling the hashingalgorithm million times per every such occasion sounds as wastingcomputing power. -> cache the denormalized stuff on server

2) The server is sending numOwners hashIds per segment, one for eachowner. What's the reason for that? I think that only primary ownersshould be inserted there. This would:a) target all PUT requests to primary owner, reducing PUT latency andlowering the general load in cluster

b) reduce the routing information

And yes, ISPN-3530 and ISPN-3701 are pretty serious, but IMO ratherorthogonal to the segment vs. hash wheel approach and its details.


Radim


On 12/11/2013 09:18 AM, Dan Berindei wrote:

Hi Radim

Actually, it's me that wrote the denormalization code :)
It was meant as a stop-gap measure before we upgraded the HotRodprotocol to support the segment-based consistent hash, but thedenormalization worked well enough (or so we thought) that we didn'tget to changing the protocol yet.
That's not a big change in itself, but we also wanted to make theconsistent hash per-cache on the client (it's now per-cache manager),and that's a bit more complicated to do. And it's not like it wouldhave been a good idea to change this before starting the C++ client,the client would still have to support the current style of consistenthash.
On Tue, Dec 10, 2013 at 8:17 PM, Radim Vansa <rva...@redhat.com<mailto:rva...@redhat.com>> wrote:
    Hi Galder,

    as I am trying to debug some problem in C++ client, I was looking into
    the server code. And I am not sure whether I understand the code
    correctly, but it seems to me that the server denormalizes the
    consistent hash for each client anew (after each topology change or
    client joining). Is this true? Looking into trace logs, I can see
    stuff
    like

    18:15:17,339 TRACE [org.infinispan.server.hotrod.Encoders$Encoder12$]
    (HotRodServerWorker-12) Writing hash id 639767 for
    192.168.11.101:11222 <http://192.168.11.101:11222>

     From denormalizeSegmentHashIds() method I see that this means that we
    have executed the hash function 639768 times just to notify one
    client.
    Is my understanding correct?
Yes, this happens every time a client joins and/or every time thecache topology changes.
We could easily cache the result of denormalizeSegmentHashIds, as itonly depends on the number of segments. It's just that I wasn'texpecting it to take so many iterations.
    Also, there is nothing like the concept of primary owner, is this
    right?
The client CH doesn't have a concept of backup owners. But for each(hash id, server) pair that gets sent to the client, it means all thehash codes between the previous hash id and this hash id have thisserver as the primary owner. The server in the next (hash id, server)pair is the first backup, and so on.
For each segment, the server generates numOwners (hash id, server)pairs. That means, for most of the hash codes in the segment, the listof owners on the client will be the same as the list of owners on theserver. But for 0.0002 (leewayFraction) of the hash codes, the clientprimary owner will be indeed one of the server backup owners.
    I thought that every first request in HotRod will go to primary owner,
    so that the PUT does not have to do the first hop and is executed
    directly on the primary. But it seems to me that it goes to any of the
    owners (practically random one, as you are only looking for the
    numOwner
    ids in leeway = on the beginning of the range - then, 99.98% or more
    requests should go to the server with last position in the
    leeway). This
    looks pretty suboptimal for writes, isn't it?
I'm not sure what you mean here, but I'm pretty sure the request goesto the correct server because we have a test for it:ConsistentHashV1IntegrationTest
Cheers
Dan


    Cheers

    Radim

    PS: for every line of code you write in Scala, God kills a kitten

    --
    Radim Vansa <rva...@redhat.com <mailto:rva...@redhat.com>>
    JBoss DataGrid QA

    _______________________________________________
    infinispan-dev mailing list
    infinispan-dev@lists.jboss.org <mailto:infinispan-dev@lists.jboss.org>
    https://lists.jboss.org/mailman/listinfo/infinispan-dev




_______________________________________________
infinispan-dev mailing list
infinispan-dev@lists.jboss.org
https://lists.jboss.org/mailman/listinfo/infinispan-dev



--
Radim Vansa <rva...@redhat.com>
JBoss DataGrid QA

_______________________________________________
infinispan-dev mailing list
infinispan-dev@lists.jboss.org
https://lists.jboss.org/mailman/listinfo/infinispan-dev

Re: [infinispan-dev] Denormalizing hashes

Reply via email to