On Fri, Sep 12, 2014 at 9:46 AM, Somnath Roy <somnath....@sandisk.com> wrote:
>
> Make perfect sense Sage..
>
> Regarding striping of filedata, You are saying KeyValue interface will do the 
> following for me?
>
> 1. Say in case of rbd image of order 4 MB, a write request coming to 
> Key/Value interface, it will  chunk the object (say full 4MB) in smaller 
> sizes (configurable ?) and stripe it as multiple key/value pair ?


Yes, and the stripe size can be configurated.

>
>
> 2. Also, while reading it will take care of accumulating and send it back.



Do you have any other idea? By the way, could you tell more about your
key/value interface. I'm doing some jobs for NVMe interface with intel
NVMe SSD.

>
>
>
> Thanks & Regards
> Somnath
>
>
> -----Original Message-----
> From: Sage Weil [mailto:sw...@redhat.com]
> Sent: Thursday, September 11, 2014 6:31 PM
> To: Somnath Roy
> Cc: Haomai Wang (haomaiw...@gmail.com); ceph-users@lists.ceph.com; 
> ceph-de...@vger.kernel.org
> Subject: Re: Regarding key/value interface
>
> Hi Somnath,
>
> On Fri, 12 Sep 2014, Somnath Roy wrote:
> >
> > Hi Sage/Haomai,
> >
> > If I have a key/value backend that support transaction, range queries
> > (and I don?t need any explicit caching etc.) and I want to replace
> > filestore (and leveldb omap) with that,  which interface you recommend
> > me to derive from , directly ObjectStore or  KeyValueDB ?
> >
> > I have already integrated this backend by deriving from ObjectStore
> > interfaces earlier (pre keyvalueinteface days) but not tested
> > thoroughly enough to see what functionality is broken (Basic
> > functionalities of RGW/RBD are working fine).
> >
> > Basically, I want to know what are the advantages (and disadvantages)
> > of deriving it from the new key/value interfaces ?
> >
> > Also, what state is it in ? Is it feature complete and supporting all
> > the ObjectStore interfaces like clone and all ?
>
> Everything is supported, I think, for perhaps some IO hints that don't make 
> sense in a k/v context.  The big things that you get by using KeyValueStore 
> and plugging into the lower-level interface are:
>
>  - striping of file data across keys
>  - efficient clone
>  - a zillion smaller methods that aren't conceptually difficult to implement 
> bug tedious and to do so.
>
> The other nice thing about reusing this code is that you can use a leveldb or 
> rocksdb backend as a reference for testing or performance or whatever.
>
> The main thing that will be a challenge going forward, I predict, is making 
> storage of the object byte payload in key/value pairs efficient.  I think 
> KeyValuestore is doing some simple striping, but it will suffer for small 
> overwrites (like 512-byte or 4k writes from an RBD).  There are probably some 
> pretty simple heuristics and tricks that can be done to mitigate the most 
> common patterns, but there is no simple solution since the backends generally 
> don't support partial value updates (I assume yours doesn't either?).  But, 
> any work done here will benefit the other backends too so that would be a 
> win..
>
> sage
>
> ________________________________
>
> PLEASE NOTE: The information contained in this electronic mail message is 
> intended only for the use of the designated recipient(s) named above. If the 
> reader of this message is not the intended recipient, you are hereby notified 
> that you have received this message in error and that any review, 
> dissemination, distribution, or copying of this message is strictly 
> prohibited. If you have received this communication in error, please notify 
> the sender by telephone or e-mail (as shown above) immediately and destroy 
> any and all copies of this message in your possession (whether hard copies or 
> electronically stored copies).
>



-- 

Best Regards,

Wheat
_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to