Re: [ceph-users] basic questions about pool

pragya jain Tue, 15 Jul 2014 21:47:29 -0700

thank you very much, Karan, for your explanation.

Regards 
Pragya Jain



On Tuesday, 15 July 2014 1:53 PM, Karan Singh <karan.si...@csc.fi> wrote:
 

>
>
>Hi Pragya
>
>
>Let me try to answer these.
>
>
>1#  The decisions is based on your use case ( performance , reliability ) .If 
>you need high performance out of your cluster , the deployer will create a 
>pool on SSD and assign this pool to applications which require higher I/O. For 
>Ex : if you integrate openstack with Ceph , you can instruct openstack 
>configuration files to write data to a specific ceph pool.  
>(http://ceph.com/docs/master/rbd/rbd-openstack/#configuring-glance) , 
>similarly you can instruct CephFS and RadosGW with pool to use for data 
>storage.
>
>
>2#  Usually the end user (client to ceph cluster) does not bother about where 
>the data is getting stored , which pool its using , and what is the real 
>physical locate of data. End user will demand for specific performance , 
>reliability and availability. It is the job of Ceph admin to fulfil  their 
>storage requirements, out of Ceph functionalities of SSD , Erausre codes , 
>replication level etc.
>
>
>
>
>Block Device :- End user will instruct the application ( Qemu / KVM , 
>OpenStack etc ) , which pool it should for data storage. rbd is the default 
>pool for block device.
>CephFS :- End user will mount this pool as filesystem and can use further. 
>Default pool are data and metadata .
> RadosGW :- End user will storage objects using S3 or Swift API. 
>
>
>
>
>
>- Karan Singh -
>
>On 15 Jul 2014, at 07:42, pragya jain <prag_2...@yahoo.co.in> wrote:
>
>thank you very much, Craig, for your clear explanation against my questions. 
>>
>>
>>Now I am very clear about the concept of pools in ceph.
>>
>>
>>But I have two small questions:
>>1. How does the deployer decide that a particular type of information will be 
>>stored in a particular pool? Are there any settings at the time of creation 
>>of pool that a deployer should make to ensure that which type of data will be 
>>stored in which pool?
>>
>>
>>2. How does an end-user specify that his/her data will be stored in which 
>>pool? how can an end-user come to know which pools are stored on SSDs or on 
>>HDDs, what are the properties of a particular pool? 
>>
>>
>>Thanks again, Please help to clear these confusions also. 
>>
>>
>>Regards
>>Pragya Jain
>>
>>
>>
>>On Sunday, 13 July 2014 5:04 AM, Craig Lewis <cle...@centraldesktop.com> 
>>wrote:
>> 
>>
>>>
>>>
>>>I'll answer out of order.
>>>
>>>
>>>#2: rdb is used for RDB images.  data and metadata are used by CephFS.  
>>>RadosGW's default pools will be created the first time radosgw starts up.  
>>>If you aren't using RDB or CephFS, you can ignore those pools.
>>>
>>>
>>>#1: RadosGW will use several pools to segregate it's data.  There are a 
>>>couple pools for store user/subuser information, as well as pools for 
>>>storing the actual data.  I'm using federation, and I have a total of 18 
>>>pools that RadosGW is using in some form.  Pools are a way to logically 
>>>separate your data, and pools can also have different replication/storage 
>>>settings.  For example, I could say that the .rgw.buckets.index pool needs 
>>>4x replication and is only stored on SSDs, while .rgw.bucket is 3x 
>>>replication on HDDs.
>>>
>>>
>>>#3: In addition to #1, you can setup different pools to actually store user 
>>>data in RadosGW.  For example, an end user may have some very important data 
>>>that you want replicated 4 times, and some other data that needs to be 
>>>stored on SSDs for low latency.  Using CRUSH, you would create the some 
>>>rados pools with those specs.  Then you'd setup some placement targets in 
>>>RadosGW that use those pools.  A user that cares will specify a placement 
>>>target when they create a bucket.  That way they can decide what the storage 
>>>requirements are.  If they don't care, then they can just use the default.
>>>
>>>
>>>Does that help?
>>>
>>>
>>>
>>>
>>>
>>>On Thu, Jul 10, 2014 at 11:34 PM, pragya jain <prag_2...@yahoo.co.in> wrote:
>>>
>>>hi all,
>>>>
>>>>
>>>>I have some very basic questions about pools in ceph.
>>>>
>>>>
>>>>According to ceph documentation, as we deploy a ceph cluster with radosgw 
>>>>instance over it, ceph creates pool by default to store the data or the 
>>>>deployer can also create pools according to the requirement.
>>>>
>>>>
>>>>Now, my question is:
>>>>1. what is the relevance of multiple pools in a cluster?
>>>>i.e. why should a deployer create multiple pools in a cluster? what should 
>>>>be the benefits of creating multiple pools?
>>>>
>>>>
>>>>2. according to the docs, the default pools are data, metadata, and rbd.
>>>>what is the difference among these three types of pools?
>>>>
>>>>
>>>>3. when a system deployer has deployed a ceph cluster with radosgw 
>>>>interface and start providing services to the end-user, such as, end-user 
>>>>can create their account on the ceph cluster and can store/retrieve their 
>>>>data to/from the cluster, then Is the end user has any concern about the 
>>>>pools created in the cluster?
>>>>
>>>>
>>>>Please somebody help me to clear these confusions.
>>>>
>>>>
>>>>regards
>>>>Pragya Jain
>>>>_______________________________________________
>>>>ceph-users mailing list
>>>>ceph-users@lists.ceph.com
>>>>http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>>>
>>>>
>>>
>>>
>>>_______________________________________________
>>ceph-users mailing list
>>ceph-users@lists.ceph.com
>>http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>>
>
>
>

_______________________________________________
ceph-users mailing list
ceph-users@lists.ceph.com
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Re: [ceph-users] basic questions about pool

Reply via email to