Re: Question about JDBM key/value replacement

Emmanuel Lécharny Fri, 27 Apr 2012 09:08:48 -0700

Le 4/27/12 5:46 PM, Selcuk AYA a écrit :

On Fri, Apr 27, 2012 at 6:06 AM, Emmanuel Lécharny<[email protected]>  wrote:

Le 4/27/12 2:09 PM, Selcuk AYA a écrit :

Hi,
On Thu, Apr 26, 2012 at 4:52 PM, Emmanuel Lécharny<[email protected]>
  wrote:

Hi guys,

currently, we can replace an existing value when its associated key
exists
in the BTree. What I'd like to do is to replace both key and value in
this
case.

The rational is that the key mau contain some extra information that are
not
used to find the key, like the nbChildren/nbDescendant for a
ParentIdAndrdn
key in the RdnIndex.

Is there an easy way to do that with JDBM, or should we extend the API it
offers ?

as I understand you are trying to store some auxiliary information per
key?

Yep.

  With JDBM we can:
   - provide<K, aux>    tuple as the key and pass the key comparator to
be the comparator for the actual keys.
   - change JDBM code to update key inside when replacing an entry.

however, in general, I do not think we can expect a key/value store to
update the key when replacing an entry. So I am not sure it is a good
idea to do this kind of thing.


The pb is that those auxiliary informations are not part of the compare
being done. Ie, two keys may be seen as equals, but the auxiliary
information might be different. In this very case, I'd like the new version
of the key replaced.
It may be very specific to my need, and we can also consider that the value
might cary those extra informations, too, instead of storing them in the
key.

So for the<ParentIdAndRdn, ID>  tuple, where the data structure is :
ParentIdAndRdn :
{
    ID parentId,
    Rdn[] rdns,
    int nbChildren,
    int nbDescendant
}

we could inject the two ints after having read the value if this value
structure was :

AugmentedID
{
    ID id
    int nbChildren,
    int nbDescendant
}

I think this would be better. However, note that in txns branch, we
removed most of the generics and assumed that indices store UUID as
their value. I hope we go  with removal/add ( I guess this is what you
are doing now)

Should not be a big problem here. Generics are an issue anyway...

and serializing the ParentIdAndRdn ID and Rdn[] only.

I am kind of confused, I have a couple of questions:

1) Can you explain why we need to store nbChildren and nbDescedant but
do not need to serialize it?

We *do* serialize those elements. What I suggest is that we store thoseinformation in the values, not in the keys, as in any case, we will geta tuple <key, value>, so we will always have the extended informations.

What if the page storing an entry is
kicked out of cache and we loose nbChildren and nbDescendant?

Then we will read them from disk, if the cache does not contain thoseinformation, right ?


2)Connected with the above question, how do we maintain these
auxiliary data? Are there maintained only in memory or on disk as
well? Do they have to absolutely correct to get consistent data out of
the server?

On memory and on disk. They are absolutely critical as they are used tobrowe the tree while doing a One/Subtree scope search.


3) Lets say I have<rdn1, rdn2,rdn3>  and<rdn1,rdn2, rdn4>  . Thread 1
adds<rdn1, rdn2,rdn3, rdn5>  and thread 2 add<rdn1,rdn2, rdn4,rdn6>
concurrently.
These two threads run concurrently but they should not
logically conflict. Now they have to read nbDescendants of<rdn1,rdn2>
and increment it but they have to do it in such a way that the net
effect of these two threads is serialized.

yep.

I am not sure how to do
this.

do you mean that the indexes aren't gloablly protected when we modifysome data ? I thought that a modification will protect all the index itwill update until they have all been updated, then the next modificationcan operate.

We don't really care if that serialize the modifications, because theserver does not have to be fast when we inject data, only when we readdata. At least, we could think about a serialization over the operationson one index like the RdnIndex (as an entry modification may update morethan one entry in this index).


Is that a problem ?

--
Regards,
Cordialement,
Emmanuel Lécharny
www.iktek.com

Re: Question about JDBM key/value replacement

Reply via email to