Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-11-21 Thread Danny Al-Gaaf
Hi,

no, I have it basically ready but I have to run some tests before.
You'll have it in the next days!

Danny

Am 21.11.2012 01:23, schrieb Sage Weil:
> If you haven't gotten to this yet, I'll go ahead and jump on it..
> let me know!
> 
> Thanks- sage
> 
> 
> On Thu, 9 Aug 2012, Danny Kukawka wrote:
> 
>> Remove btrfs specific keys and replace them by more generic keys
>> to be able to replace btrfs with e.g. xfs or ext4 easily.
>> 
>> Add new key to define the osd fs type: 'fstype', which can get 
>> defined in the [osd] section for all OSDs.
>> 
>> Replace: - 'btrfs devs' -> 'devs' - 'btrfs path' -> 'fs path' -
>> 'btrfs options' -> 'fs options' - mkcephfs: replace --mkbtrfs
>> with --mkfs - init-ceph: replace --btrfs with --fsmount,
>> --nobtrfs with --nofsmount, --btrfsumount with --fsumount
>> 
>> Update documentation, manpage and example config files.
>> 
>> Signed-off-by: Danny Kukawka  --- 
>> doc/man/8/mkcephfs.rst  |   17 +++- 
>> man/mkcephfs.8  |   15 +++ 
>> src/ceph.conf.twoosds   |7 ++-- 
>> src/init-ceph.in|   50
>> +- src/mkcephfs.in
>> |   60 +-- src/sample.ceph.conf
>> |   15 --- src/test/cli/osdmaptool/ceph.conf.withracks |3
>> +- 7 Dateien ge?ndert, 95 Zeilen hinzugef?gt(+), 72 Zeilen
>> entfernt(-)
>> 
>> diff --git a/doc/man/8/mkcephfs.rst b/doc/man/8/mkcephfs.rst 
>> index ddc378a..dd3fbd5 100644 --- a/doc/man/8/mkcephfs.rst +++
>> b/doc/man/8/mkcephfs.rst @@ -70,20 +70,15 @@ Options default is
>> ``/etc/ceph/keyring`` (or whatever is specified in the config
>> file).
>> 
>> -.. option:: --mkbtrfs +.. option:: --mkfs
>> 
>> -   Create and mount the any btrfs file systems specified in the 
>> -   ceph.conf for OSD data storage using mkfs.btrfs. The "btrfs
>> devs" -   and (if it differs from "osd data") "btrfs path"
>> options must be -   defined. +   Create and mount any file system
>> specified in the ceph.conf for +   OSD data storage using mkfs.
>> The "devs" and (if it differs from +   "osd data") "fs path"
>> options must be defined.
>> 
>> **NOTE** Btrfs is still considered experimental.  This option -
>> can ease some configuration pain, but is the use of btrfs is not 
>> -   required when ``osd data`` directories are mounted manually
>> by the -   adminstrator. - -   **NOTE** This option is deprecated
>> and will be removed in a future -   release. +   can ease some
>> configuration pain, but is not required when +   ``osd data``
>> directories are mounted manually by the adminstrator.
>> 
>> .. option:: --no-copy-conf
>> 
>> diff --git a/man/mkcephfs.8 b/man/mkcephfs.8 index
>> 8544a01..22a5335 100644 --- a/man/mkcephfs.8 +++
>> b/man/mkcephfs.8 @@ -32,7 +32,7 @@ level margin:
>> \\n[rst2man-indent\\n[rst2man-indent-level]] . .SH SYNOPSIS .nf 
>> -\fBmkcephfs\fP [ \-c \fIceph.conf\fP ] [ \-\-mkbtrfs ] [ \-a,
>> \-\-all\-hosts [ \-k +\fBmkcephfs\fP [ \-c \fIceph.conf\fP ] [
>> \-\-mkfs ] [ \-a, \-\-all\-hosts [ \-k 
>> \fI/path/to/admin.keyring\fP ] ] .fi .sp @@ -111,19 +111,16 @@
>> config file). .UNINDENT .INDENT 0.0 .TP -.B \-\-mkbtrfs -Create
>> and mount the any btrfs file systems specified in the -ceph.conf
>> for OSD data storage using mkfs.btrfs. The "btrfs devs" -and (if
>> it differs from "osd data") "btrfs path" options must be +.B
>> \-\-mkfs +Create and mount any file systems specified in the 
>> +ceph.conf for OSD data storage using mkfs.*. The "devs" +and (if
>> it differs from "osd data") "fs path" options must be defined. 
>> .sp \fBNOTE\fP Btrfs is still considered experimental.  This
>> option -can ease some configuration pain, but is the use of btrfs
>> is not +can ease some configuration pain, but the use of this
>> option is not required when \fBosd data\fP directories are
>> mounted manually by the adminstrator. -.sp -\fBNOTE\fP This
>> option is deprecated and will be removed in a future -release. 
>> .UNINDENT .INDENT 0.0 .TP diff --git a/src/ceph.conf.twoosds
>> b/src/ceph.conf.twoosds index c0cfc68..05ca754 100644 ---
>> a/src/ceph.conf.twoosds +++ b/src/ceph.conf.twoosds @@ -67,7
>> +67,8 @@ debug journal = 20 log dir = /data/cosd$id osd data =
>> /mnt/osd$id -btrfs options = "flushoncommit,usertrans" + fs
>> options = "flushoncommit,usertrans" +fstype = btrfs ;user =
>> root
>> 
>> ;osd journal = /mnt/osd$id/journal @@ -75,8 +76,8 @@ osd journal
>> = "/dev/disk/by-path/pci-:05:02.0-scsi-6:0:0:0" ;filestore
>> max sync interval = 1
>> 
>> -btrfs devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0" 
>> -;   btrfs devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0
>> \ +  devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0" +;
>> devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0 \ ;
>> /dev/disk/by-path/pci-:05:01.0-scsi-3:0:0:0 \ ;
>> /dev/disk/by-path/pci-:05:01.0-scsi-4:0:0:0 \ ;
>> /dev/disk/by-path/pci-:05:0

Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-11-20 Thread Sage Weil
If you haven't gotten to this yet, I'll go ahead and jump on it.. let me 
know!

Thanks-
sage


On Thu, 9 Aug 2012, Danny Kukawka wrote:

> Remove btrfs specific keys and replace them by more generic
> keys to be able to replace btrfs with e.g. xfs or ext4 easily.
> 
> Add new key to define the osd fs type: 'fstype', which can get
> defined in the [osd] section for all OSDs.
> 
> Replace:
> - 'btrfs devs' -> 'devs'
> - 'btrfs path' -> 'fs path'
> - 'btrfs options' -> 'fs options'
> - mkcephfs: replace --mkbtrfs with --mkfs
> - init-ceph: replace --btrfs with --fsmount, --nobtrfs
> with --nofsmount, --btrfsumount with --fsumount
> 
> Update documentation, manpage and example config files.
> 
> Signed-off-by: Danny Kukawka 
> ---
>  doc/man/8/mkcephfs.rst  |   17 +++-
>  man/mkcephfs.8  |   15 +++
>  src/ceph.conf.twoosds   |7 ++--
>  src/init-ceph.in|   50 +-
>  src/mkcephfs.in |   60 
> +--
>  src/sample.ceph.conf|   15 ---
>  src/test/cli/osdmaptool/ceph.conf.withracks |3 +-
>  7 Dateien ge?ndert, 95 Zeilen hinzugef?gt(+), 72 Zeilen entfernt(-)
> 
> diff --git a/doc/man/8/mkcephfs.rst b/doc/man/8/mkcephfs.rst
> index ddc378a..dd3fbd5 100644
> --- a/doc/man/8/mkcephfs.rst
> +++ b/doc/man/8/mkcephfs.rst
> @@ -70,20 +70,15 @@ Options
> default is ``/etc/ceph/keyring`` (or whatever is specified in the
> config file).
>  
> -.. option:: --mkbtrfs
> +.. option:: --mkfs
>  
> -   Create and mount the any btrfs file systems specified in the
> -   ceph.conf for OSD data storage using mkfs.btrfs. The "btrfs devs"
> -   and (if it differs from "osd data") "btrfs path" options must be
> -   defined.
> +   Create and mount any file system specified in the ceph.conf for 
> +   OSD data storage using mkfs. The "devs" and (if it differs from 
> +   "osd data") "fs path" options must be defined.
>  
> **NOTE** Btrfs is still considered experimental.  This option
> -   can ease some configuration pain, but is the use of btrfs is not
> -   required when ``osd data`` directories are mounted manually by the
> -   adminstrator.
> -
> -   **NOTE** This option is deprecated and will be removed in a future
> -   release.
> +   can ease some configuration pain, but is not required when 
> +   ``osd data`` directories are mounted manually by the adminstrator.
>  
>  .. option:: --no-copy-conf
>  
> diff --git a/man/mkcephfs.8 b/man/mkcephfs.8
> index 8544a01..22a5335 100644
> --- a/man/mkcephfs.8
> +++ b/man/mkcephfs.8
> @@ -32,7 +32,7 @@ level margin: \\n[rst2man-indent\\n[rst2man-indent-level]]
>  .
>  .SH SYNOPSIS
>  .nf
> -\fBmkcephfs\fP [ \-c \fIceph.conf\fP ] [ \-\-mkbtrfs ] [ \-a, \-\-all\-hosts 
> [ \-k
> +\fBmkcephfs\fP [ \-c \fIceph.conf\fP ] [ \-\-mkfs ] [ \-a, \-\-all\-hosts [ 
> \-k
>  \fI/path/to/admin.keyring\fP ] ]
>  .fi
>  .sp
> @@ -111,19 +111,16 @@ config file).
>  .UNINDENT
>  .INDENT 0.0
>  .TP
> -.B \-\-mkbtrfs
> -Create and mount the any btrfs file systems specified in the
> -ceph.conf for OSD data storage using mkfs.btrfs. The "btrfs devs"
> -and (if it differs from "osd data") "btrfs path" options must be
> +.B \-\-mkfs
> +Create and mount any file systems specified in the
> +ceph.conf for OSD data storage using mkfs.*. The "devs"
> +and (if it differs from "osd data") "fs path" options must be
>  defined.
>  .sp
>  \fBNOTE\fP Btrfs is still considered experimental.  This option
> -can ease some configuration pain, but is the use of btrfs is not
> +can ease some configuration pain, but the use of this option is not
>  required when \fBosd data\fP directories are mounted manually by the
>  adminstrator.
> -.sp
> -\fBNOTE\fP This option is deprecated and will be removed in a future
> -release.
>  .UNINDENT
>  .INDENT 0.0
>  .TP
> diff --git a/src/ceph.conf.twoosds b/src/ceph.conf.twoosds
> index c0cfc68..05ca754 100644
> --- a/src/ceph.conf.twoosds
> +++ b/src/ceph.conf.twoosds
> @@ -67,7 +67,8 @@
>   debug journal = 20
>   log dir = /data/cosd$id
>   osd data = /mnt/osd$id
> - btrfs options = "flushoncommit,usertrans"
> + fs options = "flushoncommit,usertrans"
> + fstype = btrfs
>  ;user = root
>  
>  ;osd journal = /mnt/osd$id/journal
> @@ -75,8 +76,8 @@
>   osd journal = "/dev/disk/by-path/pci-:05:02.0-scsi-6:0:0:0"
>  ;filestore max sync interval = 1
>  
> - btrfs devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0"
> -;btrfs devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0 \
> + devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0"
> +;devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0 \
>  ;   /dev/disk/by-path/pci-:05:01.0-scsi-3:0:0:0 \
>  ;   /dev/disk/by-path/pci-:05:01.0-scsi-4:0:0:0 \
>  ;   /dev/disk/by-path/pci-:05:01.0-scsi-5:0:0:0 \
> diff --git a/

Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-11-15 Thread Sage Weil
On Thu, 15 Nov 2012, Danny Al-Gaaf wrote:
> Hi Sage,
> 
> Am 15.11.2012 01:12, schrieb Sage Weil:
> > Hi Danny,
> > 
> > Have you had a chance to work on this?  I'd like to include this 
> > in bobtail.  If you don't have time we can go ahead an implement it, but 
> > I'd like avoid duplicating effort.
> 
> I already work on it. Do you have a deadline for bobtail?

Release is ~3 weeks off, but it is technically frozen.  This week would be 
best so that we can make sure it is well tested.

Thanks!
sage
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-11-15 Thread Danny Al-Gaaf
Hi Sage,

Am 15.11.2012 01:12, schrieb Sage Weil:
> Hi Danny,
> 
> Have you had a chance to work on this?  I'd like to include this 
> in bobtail.  If you don't have time we can go ahead an implement it, but 
> I'd like avoid duplicating effort.

I already work on it. Do you have a deadline for bobtail?

Danny

> Thanks!
> sage

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-11-14 Thread Sage Weil
Hi Danny,

Have you had a chance to work on this?  I'd like to include this 
in bobtail.  If you don't have time we can go ahead an implement it, but 
I'd like avoid duplicating effort.

Thanks!
sage


On Fri, 2 Nov 2012, Danny Al-Gaaf wrote:
> Hi Sage,
> 
> sorry for the late reply, was absent some weeks and busy with other issues.
> 
> Am 17.08.2012 01:40, schrieb Sage Weil:
> > On Thu, 16 Aug 2012, Tommi Virtanen wrote:
> >> On Thu, Aug 16, 2012 at 3:32 PM, Sage Weil  wrote:
> >>> As for the new options, I suggest:
> >>>
> >>>  * osd fs type
> >>>  * osd fs devs   (will work for mkcephfs, not for new stuff)
> >>>  * osd fs path
> >>>  * osd fs options
> >>
> >> What does osd_fs_path mean, and how is it different from the osd_data dir?
> > 
> > The idea was that you might wand the fs mounted somewhere other that 
> > osd_data.  I'm not sure it's useful; we may as well drop that...
> > 
> >> I'm expecting to need both mkfs-time options (btrfs metadata block
> >> size etc) and mount-time options (noatime etc).
> >>
> >> It would be nice if there was a way to set the options for all
> >> fstypes, and then just toggle which one is used (by default). That
> >> avoids bugs like trying to mkfs/mount btrfs with xfs-specific options,
> >> and vice versa.
> >>
> >> I'm not sure how well our config system will handle dynamic variable
> >> names -- ceph-authtool was fine with me just putting data in
> >> osd_crush_location, and we don't need to access these variables from
> >> C++, so it should be fine. If you really wanted to, you could probably
> >> cram the them into a single variable, with ad hoc structured data in
> >> the string value, but that's ugly.. Or just hardcode the list of
> >> possible filesystems, and then it's not dynamic variable names
> >> anymore.
> > 
> > Yeah, ceph-conf will happily take anything.  The C++ code has to do 
> > slightly more work to get arbitrary config fields, but that's not an 
> > issue.
> > 
> >> So I'm dreaming of something like:
> >>
> >> [osd]
> >> # what mount options will be passed when an osd data disk is using
> >> # one of these filesystems; these are passed to mount -o
> >> osd mount options btrfs = herp,foo=bar
> >> osd mount options xfs = noatime,derp
> >>
> >> # what mkfs options are used when creating new osd data disk
> >> # filesystems
> >> osd mkfs options btrfs = --hur
> >> osd mkfs options xfs = --dur
> >>
> >> # what fstype to use by default when mkfs'ing; mounting will detect
> >> # what's there (with blkid) and work with anything
> >> osd mkfs type = btrfs
> 
> I will prepare a patch with these for the current mkcephfs and init-ceph
> incl. aliases for the old keys and cmdline options where possible.
> 
> >> # this may go away with "mkcephfs 2.0", and it will have to get more
> >> # complex if we provide something for journals too, etc, because you
> >> # may want to pair specific data disks to specific journals (DH has
> >> # this need).. haven't had time to think it through, which is why i'm
> >> # leaning toward "and here's a hook where you run something on the
> >> # host that calls ceph-disk-prepare etc on all the disks you want",
> >> # and using uuids to match journals to data disks -- this work has
> >> # not yet started)
> >> osd fs devs = /dev/sdb /dev/sdc
> > 
> > This all looks good to me.  What do you think, Danny?
> 
> This part (osd fs devs) is for a new mkcepfs.2.0 if I understand you
> correctly. Sounds okay for me atm. (Tommi: Are there any new information
> on this? Did you already start to work on this?)
> 
> Danny
> 
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majord...@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
> 
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-11-02 Thread Sage Weil
On Fri, 2 Nov 2012, Danny Al-Gaaf wrote:
> Hi Sage,
> 
> sorry for the late reply, was absent some weeks and busy with other issues.
> 
> Am 17.08.2012 01:40, schrieb Sage Weil:
> > On Thu, 16 Aug 2012, Tommi Virtanen wrote:
> >> On Thu, Aug 16, 2012 at 3:32 PM, Sage Weil  wrote:
> >>> As for the new options, I suggest:
> >>>
> >>>  * osd fs type
> >>>  * osd fs devs   (will work for mkcephfs, not for new stuff)
> >>>  * osd fs path
> >>>  * osd fs options
> >>
> >> What does osd_fs_path mean, and how is it different from the osd_data dir?
> > 
> > The idea was that you might wand the fs mounted somewhere other that 
> > osd_data.  I'm not sure it's useful; we may as well drop that...
> > 
> >> I'm expecting to need both mkfs-time options (btrfs metadata block
> >> size etc) and mount-time options (noatime etc).
> >>
> >> It would be nice if there was a way to set the options for all
> >> fstypes, and then just toggle which one is used (by default). That
> >> avoids bugs like trying to mkfs/mount btrfs with xfs-specific options,
> >> and vice versa.
> >>
> >> I'm not sure how well our config system will handle dynamic variable
> >> names -- ceph-authtool was fine with me just putting data in
> >> osd_crush_location, and we don't need to access these variables from
> >> C++, so it should be fine. If you really wanted to, you could probably
> >> cram the them into a single variable, with ad hoc structured data in
> >> the string value, but that's ugly.. Or just hardcode the list of
> >> possible filesystems, and then it's not dynamic variable names
> >> anymore.
> > 
> > Yeah, ceph-conf will happily take anything.  The C++ code has to do 
> > slightly more work to get arbitrary config fields, but that's not an 
> > issue.
> > 
> >> So I'm dreaming of something like:
> >>
> >> [osd]
> >> # what mount options will be passed when an osd data disk is using
> >> # one of these filesystems; these are passed to mount -o
> >> osd mount options btrfs = herp,foo=bar
> >> osd mount options xfs = noatime,derp
> >>
> >> # what mkfs options are used when creating new osd data disk
> >> # filesystems
> >> osd mkfs options btrfs = --hur
> >> osd mkfs options xfs = --dur
> >>
> >> # what fstype to use by default when mkfs'ing; mounting will detect
> >> # what's there (with blkid) and work with anything
> >> osd mkfs type = btrfs
> 
> I will prepare a patch with these for the current mkcephfs and init-ceph
> incl. aliases for the old keys and cmdline options where possible.

We ended up using slightly different option names for ceph-disk-prpare.  
Let's make them match!  They are:

 osd fs type = 
 osd fs mkfs arguments  = ...
 osd fs mount options  = ...

fstype would be ext4, xfs, btrfs.

Sorry, I lost track of this thread as well... :)

sage


> 
> >> # this may go away with "mkcephfs 2.0", and it will have to get more
> >> # complex if we provide something for journals too, etc, because you
> >> # may want to pair specific data disks to specific journals (DH has
> >> # this need).. haven't had time to think it through, which is why i'm
> >> # leaning toward "and here's a hook where you run something on the
> >> # host that calls ceph-disk-prepare etc on all the disks you want",
> >> # and using uuids to match journals to data disks -- this work has
> >> # not yet started)
> >> osd fs devs = /dev/sdb /dev/sdc
> > 
> > This all looks good to me.  What do you think, Danny?
> 
> This part (osd fs devs) is for a new mkcepfs.2.0 if I understand you
> correctly. Sounds okay for me atm. (Tommi: Are there any new information
> on this? Did you already start to work on this?)
> 
> Danny
> 
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majord...@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
> 
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-11-02 Thread Danny Al-Gaaf
Hi Sage,

sorry for the late reply, was absent some weeks and busy with other issues.

Am 17.08.2012 01:40, schrieb Sage Weil:
> On Thu, 16 Aug 2012, Tommi Virtanen wrote:
>> On Thu, Aug 16, 2012 at 3:32 PM, Sage Weil  wrote:
>>> As for the new options, I suggest:
>>>
>>>  * osd fs type
>>>  * osd fs devs   (will work for mkcephfs, not for new stuff)
>>>  * osd fs path
>>>  * osd fs options
>>
>> What does osd_fs_path mean, and how is it different from the osd_data dir?
> 
> The idea was that you might wand the fs mounted somewhere other that 
> osd_data.  I'm not sure it's useful; we may as well drop that...
> 
>> I'm expecting to need both mkfs-time options (btrfs metadata block
>> size etc) and mount-time options (noatime etc).
>>
>> It would be nice if there was a way to set the options for all
>> fstypes, and then just toggle which one is used (by default). That
>> avoids bugs like trying to mkfs/mount btrfs with xfs-specific options,
>> and vice versa.
>>
>> I'm not sure how well our config system will handle dynamic variable
>> names -- ceph-authtool was fine with me just putting data in
>> osd_crush_location, and we don't need to access these variables from
>> C++, so it should be fine. If you really wanted to, you could probably
>> cram the them into a single variable, with ad hoc structured data in
>> the string value, but that's ugly.. Or just hardcode the list of
>> possible filesystems, and then it's not dynamic variable names
>> anymore.
> 
> Yeah, ceph-conf will happily take anything.  The C++ code has to do 
> slightly more work to get arbitrary config fields, but that's not an 
> issue.
> 
>> So I'm dreaming of something like:
>>
>> [osd]
>> # what mount options will be passed when an osd data disk is using
>> # one of these filesystems; these are passed to mount -o
>> osd mount options btrfs = herp,foo=bar
>> osd mount options xfs = noatime,derp
>>
>> # what mkfs options are used when creating new osd data disk
>> # filesystems
>> osd mkfs options btrfs = --hur
>> osd mkfs options xfs = --dur
>>
>> # what fstype to use by default when mkfs'ing; mounting will detect
>> # what's there (with blkid) and work with anything
>> osd mkfs type = btrfs

I will prepare a patch with these for the current mkcephfs and init-ceph
incl. aliases for the old keys and cmdline options where possible.

>> # this may go away with "mkcephfs 2.0", and it will have to get more
>> # complex if we provide something for journals too, etc, because you
>> # may want to pair specific data disks to specific journals (DH has
>> # this need).. haven't had time to think it through, which is why i'm
>> # leaning toward "and here's a hook where you run something on the
>> # host that calls ceph-disk-prepare etc on all the disks you want",
>> # and using uuids to match journals to data disks -- this work has
>> # not yet started)
>> osd fs devs = /dev/sdb /dev/sdc
> 
> This all looks good to me.  What do you think, Danny?

This part (osd fs devs) is for a new mkcepfs.2.0 if I understand you
correctly. Sounds okay for me atm. (Tommi: Are there any new information
on this? Did you already start to work on this?)

Danny

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-16 Thread Sage Weil
On Thu, 16 Aug 2012, Tommi Virtanen wrote:
> On Thu, Aug 16, 2012 at 3:32 PM, Sage Weil  wrote:
> > As for the new options, I suggest:
> >
> >  * osd fs type
> >  * osd fs devs   (will work for mkcephfs, not for new stuff)
> >  * osd fs path
> >  * osd fs options
> 
> What does osd_fs_path mean, and how is it different from the osd_data dir?

The idea was that you might wand the fs mounted somewhere other that 
osd_data.  I'm not sure it's useful; we may as well drop that...

> I'm expecting to need both mkfs-time options (btrfs metadata block
> size etc) and mount-time options (noatime etc).
> 
> It would be nice if there was a way to set the options for all
> fstypes, and then just toggle which one is used (by default). That
> avoids bugs like trying to mkfs/mount btrfs with xfs-specific options,
> and vice versa.
> 
> I'm not sure how well our config system will handle dynamic variable
> names -- ceph-authtool was fine with me just putting data in
> osd_crush_location, and we don't need to access these variables from
> C++, so it should be fine. If you really wanted to, you could probably
> cram the them into a single variable, with ad hoc structured data in
> the string value, but that's ugly.. Or just hardcode the list of
> possible filesystems, and then it's not dynamic variable names
> anymore.

Yeah, ceph-conf will happily take anything.  The C++ code has to do 
slightly more work to get arbitrary config fields, but that's not an 
issue.

> So I'm dreaming of something like:
> 
> [osd]
> # what mount options will be passed when an osd data disk is using
> # one of these filesystems; these are passed to mount -o
> osd mount options btrfs = herp,foo=bar
> osd mount options xfs = noatime,derp
> 
> # what mkfs options are used when creating new osd data disk
> # filesystems
> osd mkfs options btrfs = --hur
> osd mkfs options xfs = --dur
> 
> # what fstype to use by default when mkfs'ing; mounting will detect
> # what's there (with blkid) and work with anything
> osd mkfs type = btrfs
> 
> # this may go away with "mkcephfs 2.0", and it will have to get more
> # complex if we provide something for journals too, etc, because you
> # may want to pair specific data disks to specific journals (DH has
> # this need).. haven't had time to think it through, which is why i'm
> # leaning toward "and here's a hook where you run something on the
> # host that calls ceph-disk-prepare etc on all the disks you want",
> # and using uuids to match journals to data disks -- this work has
> # not yet started)
> osd fs devs = /dev/sdb /dev/sdc

This all looks good to me.  What do you think, Danny?

sage

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-16 Thread Tommi Virtanen
On Thu, Aug 16, 2012 at 3:52 PM, Tommi Virtanen  wrote:
> I'm not sure how well our config system will handle dynamic variable
> names -- ceph-authtool was fine with me just putting data in

*ceph-conf
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-16 Thread Tommi Virtanen
On Thu, Aug 16, 2012 at 3:32 PM, Sage Weil  wrote:
> As for the new options, I suggest:
>
>  * osd fs type
>  * osd fs devs   (will work for mkcephfs, not for new stuff)
>  * osd fs path
>  * osd fs options

What does osd_fs_path mean, and how is it different from the osd_data dir?

I'm expecting to need both mkfs-time options (btrfs metadata block
size etc) and mount-time options (noatime etc).

It would be nice if there was a way to set the options for all
fstypes, and then just toggle which one is used (by default). That
avoids bugs like trying to mkfs/mount btrfs with xfs-specific options,
and vice versa.

I'm not sure how well our config system will handle dynamic variable
names -- ceph-authtool was fine with me just putting data in
osd_crush_location, and we don't need to access these variables from
C++, so it should be fine. If you really wanted to, you could probably
cram the them into a single variable, with ad hoc structured data in
the string value, but that's ugly.. Or just hardcode the list of
possible filesystems, and then it's not dynamic variable names
anymore.

So I'm dreaming of something like:

[osd]
# what mount options will be passed when an osd data disk is using
# one of these filesystems; these are passed to mount -o
osd mount options btrfs = herp,foo=bar
osd mount options xfs = noatime,derp

# what mkfs options are used when creating new osd data disk
# filesystems
osd mkfs options btrfs = --hur
osd mkfs options xfs = --dur

# what fstype to use by default when mkfs'ing; mounting will detect
# what's there (with blkid) and work with anything
osd mkfs type = btrfs

# this may go away with "mkcephfs 2.0", and it will have to get more
# complex if we provide something for journals too, etc, because you
# may want to pair specific data disks to specific journals (DH has
# this need).. haven't had time to think it through, which is why i'm
# leaning toward "and here's a hook where you run something on the
# host that calls ceph-disk-prepare etc on all the disks you want",
# and using uuids to match journals to data disks -- this work has
# not yet started)
osd fs devs = /dev/sdb /dev/sdc
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-16 Thread Sage Weil
On Fri, 10 Aug 2012, Sage Weil wrote:
> On Fri, 10 Aug 2012, Danny Kukawka wrote:
> > Am 10.08.2012 17:54, schrieb Sage Weil:
> > > On Thu, 9 Aug 2012, Danny Kukawka wrote:
> > >> Remove btrfs specific keys and replace them by more generic
> > >> keys to be able to replace btrfs with e.g. xfs or ext4 easily.
> > >>
> > >> Add new key to define the osd fs type: 'fstype', which can get
> > >> defined in the [osd] section for all OSDs.
> > >>
> > >> Replace:
> > >> - 'btrfs devs' -> 'devs'
> > >> - 'btrfs path' -> 'fs path'
> > >> - 'btrfs options' -> 'fs options'
> > >> - mkcephfs: replace --mkbtrfs with --mkfs
> > >> - init-ceph: replace --btrfs with --fsmount, --nobtrfs
> > >> with --nofsmount, --btrfsumount with --fsumount
> > >>
> > >> Update documentation, manpage and example config files.
> > > 
> > > Maybe this should keep the old options as well, so that --mkbtrfs is an 
> > > alias for --mkfs --btrfs...
> > 
> > I can add this to the patch, no problem!
> > 
> > > Tommi, is this kind of invocation compatible with your notion of what 
> > > mkcephfs 2.0 should be?  If we can jump to the target interface and 
> > > rewrite the implementation in terms of the new tools that would capture 
> > > the best of both worlds.
> > 
> > If you can point me to some documentation how the target interface
> > works, I could take a look at adapting mkcephfs to the new tools as soon
> > as the new interface/workflow is ready.
> 
> It's not defined yet.  I know Tommi has something in his head, but I'm not 
> sure if it's something similar to the current one (a cluster-wide 
> ceph.conf) or something else.  He's off today, so we probably have to wait 
> to find out.

Okay, I just synced up with Tommi.  The goal is for the new 'mkcephfs' 
functionality to generalize to both cluster setup and expansion, and to 
avoid the pain associated with explicitly defining osd ids, paths, and 
devices for each osd.  (IDs will by dynamic, hotplugging will be possible, 
etc.).  The result will be the ability to define a file specifying 
information like

 - hosts that will be monitors, osds, and/or mons
 - config options to put in the ceph.conf, or config file fragment

To instantiate osds, you could either:

 - explicitly run ceph-disk-prepare on the appropriate nodes, devices 
   (specifying fs type etc)
 - plug in a properly tagged disk that will be magically consumed
 - possibly provide a list of devices/paths in the file above and let the 
   mkcephfs replacement run it.

Point being, it won't be a drop-in replacement for mkcephfs (with a 
ceph.conf as input) because the current approach makes you allocate osd 
ids and such, and the new approach won't require that.

That being the case, I'm all for generalizing the current mkcephfs to 
other file systems as an interim step.  I think the requirements there are 
just that the old options continue to work, and pick good names.  That is,

 * conf: make 'btrfs ...' options alias to new options
 * mkcephfs: make --mkbtrfs an alias for '--mkfs' (or --mkosdfs?)
 * init-ceph: make the --btrfs alias --fsmount, etc.

As for the new options, I suggest:

 * osd fs type
 * osd fs devs   (will work for mkcephfs, not for new stuff)
 * osd fs path
 * osd fs options

The new stuff can definitely use 'osd fs type' and 'osd fs option' (e.g., 
in [osd] section).  It's less clear whether the others will remain useful, 
since they are generally tied to specific osd instances, which live in 
[osd.$id] sections, which the new stuff is won't require of you.

My hope is for the both methods to be present in bobtail, and mkcephfs to 
be deprecated by cuttlefish.  Does that sound reasonable?

Thanks!
sage
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-10 Thread Mandell Degerness
Comment below in-line:

On Fri, Aug 10, 2012 at 9:12 AM, Sage Weil  wrote:
> On Fri, 10 Aug 2012, Danny Kukawka wrote:
>> Am 10.08.2012 17:54, schrieb Sage Weil:
>> > On Thu, 9 Aug 2012, Danny Kukawka wrote:
>> >> Remove btrfs specific keys and replace them by more generic
>> >> keys to be able to replace btrfs with e.g. xfs or ext4 easily.
>> >>
>> >> Add new key to define the osd fs type: 'fstype', which can get
>> >> defined in the [osd] section for all OSDs.
>> >>
>> >> Replace:
>> >> - 'btrfs devs' -> 'devs'
>> >> - 'btrfs path' -> 'fs path'
>> >> - 'btrfs options' -> 'fs options'
>> >> - mkcephfs: replace --mkbtrfs with --mkfs

I'm just a little concerned about this as --mkfs in other parts of the
project refers to the on file system changes to prepare the directory
structure for use by the daemons.  I actually prefer the usage here, I
only want to point out the confusing CLI parameter collision.

>> >> - init-ceph: replace --btrfs with --fsmount, --nobtrfs
>> >> with --nofsmount, --btrfsumount with --fsumount
>> >>
>> >> Update documentation, manpage and example config files.
>> >
>> > Maybe this should keep the old options as well, so that --mkbtrfs is an
>> > alias for --mkfs --btrfs...
>>
>> I can add this to the patch, no problem!
>>
>> > Tommi, is this kind of invocation compatible with your notion of what
>> > mkcephfs 2.0 should be?  If we can jump to the target interface and
>> > rewrite the implementation in terms of the new tools that would capture
>> > the best of both worlds.
>>
>> If you can point me to some documentation how the target interface
>> works, I could take a look at adapting mkcephfs to the new tools as soon
>> as the new interface/workflow is ready.
>
> It's not defined yet.  I know Tommi has something in his head, but I'm not
> sure if it's something similar to the current one (a cluster-wide
> ceph.conf) or something else.  He's off today, so we probably have to wait
> to find out.
>
> sage
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majord...@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-10 Thread Sage Weil
On Fri, 10 Aug 2012, Danny Kukawka wrote:
> Am 10.08.2012 17:54, schrieb Sage Weil:
> > On Thu, 9 Aug 2012, Danny Kukawka wrote:
> >> Remove btrfs specific keys and replace them by more generic
> >> keys to be able to replace btrfs with e.g. xfs or ext4 easily.
> >>
> >> Add new key to define the osd fs type: 'fstype', which can get
> >> defined in the [osd] section for all OSDs.
> >>
> >> Replace:
> >> - 'btrfs devs' -> 'devs'
> >> - 'btrfs path' -> 'fs path'
> >> - 'btrfs options' -> 'fs options'
> >> - mkcephfs: replace --mkbtrfs with --mkfs
> >> - init-ceph: replace --btrfs with --fsmount, --nobtrfs
> >> with --nofsmount, --btrfsumount with --fsumount
> >>
> >> Update documentation, manpage and example config files.
> > 
> > Maybe this should keep the old options as well, so that --mkbtrfs is an 
> > alias for --mkfs --btrfs...
> 
> I can add this to the patch, no problem!
> 
> > Tommi, is this kind of invocation compatible with your notion of what 
> > mkcephfs 2.0 should be?  If we can jump to the target interface and 
> > rewrite the implementation in terms of the new tools that would capture 
> > the best of both worlds.
> 
> If you can point me to some documentation how the target interface
> works, I could take a look at adapting mkcephfs to the new tools as soon
> as the new interface/workflow is ready.

It's not defined yet.  I know Tommi has something in his head, but I'm not 
sure if it's something similar to the current one (a cluster-wide 
ceph.conf) or something else.  He's off today, so we probably have to wait 
to find out.

sage
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-10 Thread Danny Kukawka
Am 10.08.2012 17:54, schrieb Sage Weil:
> On Thu, 9 Aug 2012, Danny Kukawka wrote:
>> Remove btrfs specific keys and replace them by more generic
>> keys to be able to replace btrfs with e.g. xfs or ext4 easily.
>>
>> Add new key to define the osd fs type: 'fstype', which can get
>> defined in the [osd] section for all OSDs.
>>
>> Replace:
>> - 'btrfs devs' -> 'devs'
>> - 'btrfs path' -> 'fs path'
>> - 'btrfs options' -> 'fs options'
>> - mkcephfs: replace --mkbtrfs with --mkfs
>> - init-ceph: replace --btrfs with --fsmount, --nobtrfs
>> with --nofsmount, --btrfsumount with --fsumount
>>
>> Update documentation, manpage and example config files.
> 
> Maybe this should keep the old options as well, so that --mkbtrfs is an 
> alias for --mkfs --btrfs...

I can add this to the patch, no problem!

> Tommi, is this kind of invocation compatible with your notion of what 
> mkcephfs 2.0 should be?  If we can jump to the target interface and 
> rewrite the implementation in terms of the new tools that would capture 
> the best of both worlds.

If you can point me to some documentation how the target interface
works, I could take a look at adapting mkcephfs to the new tools as soon
as the new interface/workflow is ready.

Danny



signature.asc
Description: OpenPGP digital signature


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-10 Thread Sage Weil
On Thu, 9 Aug 2012, Danny Kukawka wrote:
> Remove btrfs specific keys and replace them by more generic
> keys to be able to replace btrfs with e.g. xfs or ext4 easily.
> 
> Add new key to define the osd fs type: 'fstype', which can get
> defined in the [osd] section for all OSDs.
> 
> Replace:
> - 'btrfs devs' -> 'devs'
> - 'btrfs path' -> 'fs path'
> - 'btrfs options' -> 'fs options'
> - mkcephfs: replace --mkbtrfs with --mkfs
> - init-ceph: replace --btrfs with --fsmount, --nobtrfs
> with --nofsmount, --btrfsumount with --fsumount
> 
> Update documentation, manpage and example config files.

Maybe this should keep the old options as well, so that --mkbtrfs is an 
alias for --mkfs --btrfs...

Tommi, is this kind of invocation compatible with your notion of what 
mkcephfs 2.0 should be?  If we can jump to the target interface and 
rewrite the implementation in terms of the new tools that would capture 
the best of both worlds.

Thanks!
sage


> 
> Signed-off-by: Danny Kukawka 
> ---
>  doc/man/8/mkcephfs.rst  |   17 +++-
>  man/mkcephfs.8  |   15 +++
>  src/ceph.conf.twoosds   |7 ++--
>  src/init-ceph.in|   50 +-
>  src/mkcephfs.in |   60 
> +--
>  src/sample.ceph.conf|   15 ---
>  src/test/cli/osdmaptool/ceph.conf.withracks |3 +-
>  7 Dateien ge?ndert, 95 Zeilen hinzugef?gt(+), 72 Zeilen entfernt(-)
> 
> diff --git a/doc/man/8/mkcephfs.rst b/doc/man/8/mkcephfs.rst
> index ddc378a..dd3fbd5 100644
> --- a/doc/man/8/mkcephfs.rst
> +++ b/doc/man/8/mkcephfs.rst
> @@ -70,20 +70,15 @@ Options
> default is ``/etc/ceph/keyring`` (or whatever is specified in the
> config file).
>  
> -.. option:: --mkbtrfs
> +.. option:: --mkfs
>  
> -   Create and mount the any btrfs file systems specified in the
> -   ceph.conf for OSD data storage using mkfs.btrfs. The "btrfs devs"
> -   and (if it differs from "osd data") "btrfs path" options must be
> -   defined.
> +   Create and mount any file system specified in the ceph.conf for 
> +   OSD data storage using mkfs. The "devs" and (if it differs from 
> +   "osd data") "fs path" options must be defined.
>  
> **NOTE** Btrfs is still considered experimental.  This option
> -   can ease some configuration pain, but is the use of btrfs is not
> -   required when ``osd data`` directories are mounted manually by the
> -   adminstrator.
> -
> -   **NOTE** This option is deprecated and will be removed in a future
> -   release.
> +   can ease some configuration pain, but is not required when 
> +   ``osd data`` directories are mounted manually by the adminstrator.
>  
>  .. option:: --no-copy-conf
>  
> diff --git a/man/mkcephfs.8 b/man/mkcephfs.8
> index 8544a01..22a5335 100644
> --- a/man/mkcephfs.8
> +++ b/man/mkcephfs.8
> @@ -32,7 +32,7 @@ level margin: \\n[rst2man-indent\\n[rst2man-indent-level]]
>  .
>  .SH SYNOPSIS
>  .nf
> -\fBmkcephfs\fP [ \-c \fIceph.conf\fP ] [ \-\-mkbtrfs ] [ \-a, \-\-all\-hosts 
> [ \-k
> +\fBmkcephfs\fP [ \-c \fIceph.conf\fP ] [ \-\-mkfs ] [ \-a, \-\-all\-hosts [ 
> \-k
>  \fI/path/to/admin.keyring\fP ] ]
>  .fi
>  .sp
> @@ -111,19 +111,16 @@ config file).
>  .UNINDENT
>  .INDENT 0.0
>  .TP
> -.B \-\-mkbtrfs
> -Create and mount the any btrfs file systems specified in the
> -ceph.conf for OSD data storage using mkfs.btrfs. The "btrfs devs"
> -and (if it differs from "osd data") "btrfs path" options must be
> +.B \-\-mkfs
> +Create and mount any file systems specified in the
> +ceph.conf for OSD data storage using mkfs.*. The "devs"
> +and (if it differs from "osd data") "fs path" options must be
>  defined.
>  .sp
>  \fBNOTE\fP Btrfs is still considered experimental.  This option
> -can ease some configuration pain, but is the use of btrfs is not
> +can ease some configuration pain, but the use of this option is not
>  required when \fBosd data\fP directories are mounted manually by the
>  adminstrator.
> -.sp
> -\fBNOTE\fP This option is deprecated and will be removed in a future
> -release.
>  .UNINDENT
>  .INDENT 0.0
>  .TP
> diff --git a/src/ceph.conf.twoosds b/src/ceph.conf.twoosds
> index c0cfc68..05ca754 100644
> --- a/src/ceph.conf.twoosds
> +++ b/src/ceph.conf.twoosds
> @@ -67,7 +67,8 @@
>   debug journal = 20
>   log dir = /data/cosd$id
>   osd data = /mnt/osd$id
> - btrfs options = "flushoncommit,usertrans"
> + fs options = "flushoncommit,usertrans"
> + fstype = btrfs
>  ;user = root
>  
>  ;osd journal = /mnt/osd$id/journal
> @@ -75,8 +76,8 @@
>   osd journal = "/dev/disk/by-path/pci-:05:02.0-scsi-6:0:0:0"
>  ;filestore max sync interval = 1
>  
> - btrfs devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0"
> -;btrfs devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0 \
> + devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0"
> +;devs = "/dev/disk/by

Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-09 Thread Xiaopong Tran

On 08/10/2012 09:53 AM, Sage Weil wrote:

On Thu, 9 Aug 2012, Tommi Virtanen wrote:

On Thu, Aug 9, 2012 at 10:03 AM, Danny Kukawka  wrote:

So you mean chef?! Will there be an alternative to simply setup a
cluster from console?

We (SUSE) are already working on an own chef ceph cookbook. But from
what I've seen till now it's really hard and more laborious to initially
setup a cluster with chef than with mkcephfs.


I've written about this on the mailing list several times. We see a
lot of demand for Chef, but don't want to tie our hands -- Canonical
is working on Juju Charms, and I would like to see a mkcephfs
replacement that relies on just SSH connections from a workstation
node. We've made an explicit effort to improve the product as a whole,
and to make the Chef cookbook as thin as possible.

For some reason, the threading is broken on the archive, but this is a
fragment of the most recent thread that talked about this:

http://thread.gmane.org/gmane.comp.file-systems.ceph.devel/8263/focus=8265


I think the real question is whether the planned "mkcephfs 2.0" is going
to capture the equivalent functionality of being able to enumerate up
front which osds will exist and which disks/journals they will use, and to
bring them up.  Assuming it will (IMHO it needs to), can we make that
compatible with the current way that mkcephfs is invoked (i.e., a
ceph.conf file and a few command line args)?  Not everyone (and I daresay
probably a minority) of users will be using Chef/Juju/Puppet/whatever,
regardless of whether or not we feel that is the right way to do things
and try to push them in that direction.

In any case, since the new osd hotplugging stuff isn't available now and
is probably still a sprint or two off, I think we should consider applying
this patch.  We aren't recommending btrfs across the board, and cluster
bringup is currently painful on ext4/xfs/etc.  And, Danny already did the
work.  :)

sage
--


+1.

Rgds,

Xiaopong


--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-09 Thread Sage Weil
On Thu, 9 Aug 2012, Tommi Virtanen wrote:
> On Thu, Aug 9, 2012 at 10:03 AM, Danny Kukawka  
> wrote:
> > So you mean chef?! Will there be an alternative to simply setup a
> > cluster from console?
> >
> > We (SUSE) are already working on an own chef ceph cookbook. But from
> > what I've seen till now it's really hard and more laborious to initially
> > setup a cluster with chef than with mkcephfs.
> 
> I've written about this on the mailing list several times. We see a
> lot of demand for Chef, but don't want to tie our hands -- Canonical
> is working on Juju Charms, and I would like to see a mkcephfs
> replacement that relies on just SSH connections from a workstation
> node. We've made an explicit effort to improve the product as a whole,
> and to make the Chef cookbook as thin as possible.
> 
> For some reason, the threading is broken on the archive, but this is a
> fragment of the most recent thread that talked about this:
> 
> http://thread.gmane.org/gmane.comp.file-systems.ceph.devel/8263/focus=8265

I think the real question is whether the planned "mkcephfs 2.0" is going 
to capture the equivalent functionality of being able to enumerate up 
front which osds will exist and which disks/journals they will use, and to 
bring them up.  Assuming it will (IMHO it needs to), can we make that 
compatible with the current way that mkcephfs is invoked (i.e., a 
ceph.conf file and a few command line args)?  Not everyone (and I daresay 
probably a minority) of users will be using Chef/Juju/Puppet/whatever, 
regardless of whether or not we feel that is the right way to do things 
and try to push them in that direction.

In any case, since the new osd hotplugging stuff isn't available now and 
is probably still a sprint or two off, I think we should consider applying 
this patch.  We aren't recommending btrfs across the board, and cluster 
bringup is currently painful on ext4/xfs/etc.  And, Danny already did the 
work.  :)

sage
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-09 Thread Tommi Virtanen
On Thu, Aug 9, 2012 at 10:03 AM, Danny Kukawka  wrote:
> So you mean chef?! Will there be an alternative to simply setup a
> cluster from console?
>
> We (SUSE) are already working on an own chef ceph cookbook. But from
> what I've seen till now it's really hard and more laborious to initially
> setup a cluster with chef than with mkcephfs.

I've written about this on the mailing list several times. We see a
lot of demand for Chef, but don't want to tie our hands -- Canonical
is working on Juju Charms, and I would like to see a mkcephfs
replacement that relies on just SSH connections from a workstation
node. We've made an explicit effort to improve the product as a whole,
and to make the Chef cookbook as thin as possible.

For some reason, the threading is broken on the archive, but this is a
fragment of the most recent thread that talked about this:

http://thread.gmane.org/gmane.comp.file-systems.ceph.devel/8263/focus=8265
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-09 Thread Danny Kukawka
Am 09.08.2012 18:53, schrieb Tommi Virtanen:
> On Thu, Aug 9, 2012 at 9:46 AM, Jim Schutt  wrote:
>> I'm embarrassed to admit I haven't been keeping up with this,
>> but I seem to recall that early versions didn't handle a
>> journal on a partition.  Did I get that wrong, or maybe that
>> capability exists now?  In the past I've found it to have a
>> small performance benefit, and would hate to lose it.
> 
> It's still not quite ready for prime time. Journal placement was cut
> out to allow us to focus on the distributed aspects of it first; those
> are now working pretty well. Putting much effort on the alternative is
> probably not a good bet.
> 
> This ticket needs to get fixed (and defined better, first!) to have
> more flexibility with journals:
> http://tracker.newdream.net/issues/2398

So you mean chef?! Will there be an alternative to simply setup a
cluster from console?

We (SUSE) are already working on an own chef ceph cookbook. But from
what I've seen till now it's really hard and more laborious to initially
setup a cluster with chef than with mkcephfs.

Danny



signature.asc
Description: OpenPGP digital signature


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-09 Thread Tommi Virtanen
On Thu, Aug 9, 2012 at 9:49 AM, Danny Kukawka  wrote:
> And where can I find this new "OSD hotplugging" style init? Is there any
> documentation?

mkcephfs does not use the new style. The Chef cookbooks we have do. If
you use the Juju Charms that Canonical has been working on, those
should also use the new way now.

Here's the installation documents using Chef:
http://ceph.com/docs/master/install/chef/
http://ceph.com/docs/master/config-cluster/chef/
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-09 Thread Tommi Virtanen
On Thu, Aug 9, 2012 at 9:46 AM, Jim Schutt  wrote:
> I'm embarrassed to admit I haven't been keeping up with this,
> but I seem to recall that early versions didn't handle a
> journal on a partition.  Did I get that wrong, or maybe that
> capability exists now?  In the past I've found it to have a
> small performance benefit, and would hate to lose it.

It's still not quite ready for prime time. Journal placement was cut
out to allow us to focus on the distributed aspects of it first; those
are now working pretty well. Putting much effort on the alternative is
probably not a good bet.

This ticket needs to get fixed (and defined better, first!) to have
more flexibility with journals:
http://tracker.newdream.net/issues/2398
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-09 Thread Danny Kukawka
Am 09.08.2012 18:26, schrieb Tommi Virtanen:
> On Thu, Aug 9, 2012 at 8:42 AM, Danny Kukawka  wrote:
>> Remove btrfs specific keys and replace them by more generic
>> keys to be able to replace btrfs with e.g. xfs or ext4 easily.
>>
>> Add new key to define the osd fs type: 'fstype', which can get
>> defined in the [osd] section for all OSDs.
> 
> I'm going to say let's not do this. In fact, I've wanted to remove the
> "btrfs devs" option for a while now, as it keeps leading people down
> the wrong path:
> https://github.com/ceph/ceph/commits/kill-btrfs-devs
> 
> mkcephfs is not a viable route forward. For example, it is unable to
> expand a pre-existing cluster.
> 
> The new "OSD hotplugging" style init is much, much nicer. And does
> more than just mkfs & mount.

And where can I find this new "OSD hotplugging" style init? Is there any
documentation?

Danny




signature.asc
Description: OpenPGP digital signature


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-09 Thread Jim Schutt

On 08/09/2012 10:26 AM, Tommi Virtanen wrote:

mkcephfs is not a viable route forward. For example, it is unable to
expand a pre-existing cluster.

The new "OSD hotplugging" style init is much, much nicer. And does
more than just mkfs&  mount.


I'm embarrassed to admit I haven't been keeping up with this,
but I seem to recall that early versions didn't handle a
journal on a partition.  Did I get that wrong, or maybe that
capability exists now?  In the past I've found it to have a
small performance benefit, and would hate to lose it.

Thanks -- Jim

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: [PATCH] make mkcephfs and init-ceph osd filesystem handling more flexible

2012-08-09 Thread Tommi Virtanen
On Thu, Aug 9, 2012 at 8:42 AM, Danny Kukawka  wrote:
> Remove btrfs specific keys and replace them by more generic
> keys to be able to replace btrfs with e.g. xfs or ext4 easily.
>
> Add new key to define the osd fs type: 'fstype', which can get
> defined in the [osd] section for all OSDs.

I'm going to say let's not do this. In fact, I've wanted to remove the
"btrfs devs" option for a while now, as it keeps leading people down
the wrong path:
https://github.com/ceph/ceph/commits/kill-btrfs-devs

mkcephfs is not a viable route forward. For example, it is unable to
expand a pre-existing cluster.

The new "OSD hotplugging" style init is much, much nicer. And does
more than just mkfs & mount.

>
> Replace:
> - 'btrfs devs' -> 'devs'
> - 'btrfs path' -> 'fs path'
> - 'btrfs options' -> 'fs options'
> - mkcephfs: replace --mkbtrfs with --mkfs
> - init-ceph: replace --btrfs with --fsmount, --nobtrfs
> with --nofsmount, --btrfsumount with --fsumount
>
> Update documentation, manpage and example config files.
>
> Signed-off-by: Danny Kukawka 
> ---
>  doc/man/8/mkcephfs.rst  |   17 +++-
>  man/mkcephfs.8  |   15 +++
>  src/ceph.conf.twoosds   |7 ++--
>  src/init-ceph.in|   50 +-
>  src/mkcephfs.in |   60 
> +--
>  src/sample.ceph.conf|   15 ---
>  src/test/cli/osdmaptool/ceph.conf.withracks |3 +-
>  7 Dateien geändert, 95 Zeilen hinzugefügt(+), 72 Zeilen entfernt(-)
>
> diff --git a/doc/man/8/mkcephfs.rst b/doc/man/8/mkcephfs.rst
> index ddc378a..dd3fbd5 100644
> --- a/doc/man/8/mkcephfs.rst
> +++ b/doc/man/8/mkcephfs.rst
> @@ -70,20 +70,15 @@ Options
> default is ``/etc/ceph/keyring`` (or whatever is specified in the
> config file).
>
> -.. option:: --mkbtrfs
> +.. option:: --mkfs
>
> -   Create and mount the any btrfs file systems specified in the
> -   ceph.conf for OSD data storage using mkfs.btrfs. The "btrfs devs"
> -   and (if it differs from "osd data") "btrfs path" options must be
> -   defined.
> +   Create and mount any file system specified in the ceph.conf for
> +   OSD data storage using mkfs. The "devs" and (if it differs from
> +   "osd data") "fs path" options must be defined.
>
> **NOTE** Btrfs is still considered experimental.  This option
> -   can ease some configuration pain, but is the use of btrfs is not
> -   required when ``osd data`` directories are mounted manually by the
> -   adminstrator.
> -
> -   **NOTE** This option is deprecated and will be removed in a future
> -   release.
> +   can ease some configuration pain, but is not required when
> +   ``osd data`` directories are mounted manually by the adminstrator.
>
>  .. option:: --no-copy-conf
>
> diff --git a/man/mkcephfs.8 b/man/mkcephfs.8
> index 8544a01..22a5335 100644
> --- a/man/mkcephfs.8
> +++ b/man/mkcephfs.8
> @@ -32,7 +32,7 @@ level margin: \\n[rst2man-indent\\n[rst2man-indent-level]]
>  .
>  .SH SYNOPSIS
>  .nf
> -\fBmkcephfs\fP [ \-c \fIceph.conf\fP ] [ \-\-mkbtrfs ] [ \-a, \-\-all\-hosts 
> [ \-k
> +\fBmkcephfs\fP [ \-c \fIceph.conf\fP ] [ \-\-mkfs ] [ \-a, \-\-all\-hosts [ 
> \-k
>  \fI/path/to/admin.keyring\fP ] ]
>  .fi
>  .sp
> @@ -111,19 +111,16 @@ config file).
>  .UNINDENT
>  .INDENT 0.0
>  .TP
> -.B \-\-mkbtrfs
> -Create and mount the any btrfs file systems specified in the
> -ceph.conf for OSD data storage using mkfs.btrfs. The "btrfs devs"
> -and (if it differs from "osd data") "btrfs path" options must be
> +.B \-\-mkfs
> +Create and mount any file systems specified in the
> +ceph.conf for OSD data storage using mkfs.*. The "devs"
> +and (if it differs from "osd data") "fs path" options must be
>  defined.
>  .sp
>  \fBNOTE\fP Btrfs is still considered experimental.  This option
> -can ease some configuration pain, but is the use of btrfs is not
> +can ease some configuration pain, but the use of this option is not
>  required when \fBosd data\fP directories are mounted manually by the
>  adminstrator.
> -.sp
> -\fBNOTE\fP This option is deprecated and will be removed in a future
> -release.
>  .UNINDENT
>  .INDENT 0.0
>  .TP
> diff --git a/src/ceph.conf.twoosds b/src/ceph.conf.twoosds
> index c0cfc68..05ca754 100644
> --- a/src/ceph.conf.twoosds
> +++ b/src/ceph.conf.twoosds
> @@ -67,7 +67,8 @@
> debug journal = 20
> log dir = /data/cosd$id
> osd data = /mnt/osd$id
> -   btrfs options = "flushoncommit,usertrans"
> +   fs options = "flushoncommit,usertrans"
> +   fstype = btrfs
>  ;  user = root
>
>  ;  osd journal = /mnt/osd$id/journal
> @@ -75,8 +76,8 @@
> osd journal = "/dev/disk/by-path/pci-:05:02.0-scsi-6:0:0:0"
>  ;  filestore max sync interval = 1
>
> -   btrfs devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0"
> -;  btrfs devs = "/dev/disk/by-path/pci-:05:01.0-scsi-2:0:0:0 \
> +   devs = "/dev/dis