Error 5 when trying to mount Ceph 0.47.1

2012-05-25 Thread Nam Dang
Hi, I've just started working with Ceph for a couple of weeks. At the moment, I'm trying to setup a small cluster with 1 monitor, 1 MDS and 6 OSDs. However, I cannot mount ceph to the system no matter which node I'm executing the mounting command on. My nodes run Ubuntu 11.10 with kernal

Re: ceph rbd crashes/stalls while random write 4k blocks

2012-05-25 Thread Stefan Priebe - Profihost AG
Am 24.05.2012 16:19, schrieb Florian Haas: On Thu, May 24, 2012 at 4:09 PM, Stefan Priebe - Profihost AG s.pri...@profihost.ag wrote: Take a look at these to see if anything looks familiar: http://oss.sgi.com/bugzilla/show_bug.cgi?id=922

Re: ceph rbd crashes/stalls while random write 4k blocks

2012-05-25 Thread Florian Haas
On Fri, May 25, 2012 at 8:47 AM, Stefan Priebe - Profihost AG s.pri...@profihost.ag wrote: Am 24.05.2012 16:19, schrieb Florian Haas: On Thu, May 24, 2012 at 4:09 PM, Stefan Priebe - Profihost AG s.pri...@profihost.ag wrote: Take a look at these to see if anything looks familiar:

Re: ceph rbd crashes/stalls while random write 4k blocks

2012-05-25 Thread Stefan Priebe - Profihost AG
Am 25.05.2012 09:33, schrieb Florian Haas: On Fri, May 25, 2012 at 8:47 AM, Stefan Priebe - Profihost AG s.pri...@profihost.ag wrote: Am 24.05.2012 16:19, schrieb Florian Haas: On Thu, May 24, 2012 at 4:09 PM, Stefan Priebe - Profihost AG s.pri...@profihost.ag wrote: Take a look at these to

Re: poor OSD performance using kernel 3.4

2012-05-25 Thread Stefan Priebe - Profihost AG
Am 25.05.2012 03:53, schrieb Mark Nelson: On 05/24/2012 02:05 PM, Stefan Priebe wrote: 3.2.18 works fine too. Honestly I don't know if tmpfs journal with dio = false would lead to that kind of behavior. Anything interesting in the logs if you turn debugging up? just stuff like this. But

ceph: Check PagePrivate(page) before dereference, page-private

2012-05-25 Thread Yan, Zheng
I got lots of NULL pointer dereference Oops when compiling kernel on ceph. The bug is because the kernel page migration routine replaces some pages in the page cache with new pages, these new pages' private can be non-zero. Signed-off-by: Zheng Yan zheng.z@intel.com --- fs/ceph/addr.c |5

Re: 'rbd map' asynchronous behavior

2012-05-25 Thread Andrey Korolyov
Hi, Newer kernel rbd driver throws a quite strange messages on map|unmap, comparing to 3.2 branch: rbd map 'path' # device appears as /dev/rbd1 instead of rbd0, then rbd unmap /dev/rbd1 # causes following trace, w/ vanilla 3.4.0 from kernel.org: [ 99.700802] BUG: scheduling while atomic:

Re: poor OSD performance using kernel 3.4

2012-05-25 Thread Stefan Priebe - Profihost AG
Some speed tests with different Kernel Versions. The same applies to other FS like btrfs. I used rados -p data bench 100 write -t 16 for all tests and a freshly created FS. mount options were always: noatime,nodiratime,nobarrier. 3.0.30 with XFS speed is always between 120 and 160MB/s Total

Re: poor OSD performance using kernel 3.4

2012-05-25 Thread Stefan Priebe - Profihost AG
Even with v3.3-rc1 is pretty often 0. Am 25.05.2012 13:31, schrieb Stefan Priebe - Profihost AG: Some speed tests with different Kernel Versions. The same applies to other FS like btrfs. I used rados -p data bench 100 write -t 16 for all tests and a freshly created FS. mount options were

Problem building RPM from spec file

2012-05-25 Thread Guido Winkelmann
I keep having problems getting ceph built as an RPM. Now the install phase fails with these messages: /usr/bin/install -c -m 644 ceph-osd.8 ceph-mds.8 ceph-mon.8 mkcephfs.8 ceph- fuse.8 ceph-syn.8 crushtool.8 osdmaptool.8 monmaptool.8 ceph-conf.8 ceph-run.8 ceph.8 mount.ceph.8 radosgw.8

Re: RBD format changes and layering

2012-05-25 Thread Alex Elder
On 05/24/2012 06:05 PM, Josh Durgin wrote: RBD object format changes = In this message I respond only to the first part of your message. I'll look at the layering stuff separately. To enable us to add more features to rbd, including copy-on-write cloning via layering,

Re: poor OSD performance using kernel 3.4

2012-05-25 Thread Alexandre DERUMIER
Hi Stephan, Do you have same performance with read ? Did you have done some iostats ? how much time to flush from journal to disks ? - Mail original - De: Stefan Priebe - Profihost AG s.pri...@profihost.ag À: Mark Nelson mark.nel...@inktank.com Cc: ceph-devel@vger.kernel.org Envoyé:

Re: ceph: Check PagePrivate(page) before dereference, page-private

2012-05-25 Thread Sage Weil
Hi Yan, On Fri, 25 May 2012, Yan, Zheng wrote: I got lots of NULL pointer dereference Oops when compiling kernel on ceph. The bug is because the kernel page migration routine replaces some pages in the page cache with new pages, these new pages' private can be non-zero. Signed-off-by: Zheng

Re: Problem building RPM from spec file

2012-05-25 Thread Sage Weil
On Fri, 25 May 2012, Guido Winkelmann wrote: I keep having problems getting ceph built as an RPM. Now the install phase fails with these messages: /usr/bin/install -c -m 644 ceph-osd.8 ceph-mds.8 ceph-mon.8 mkcephfs.8 ceph- fuse.8 ceph-syn.8 crushtool.8 osdmaptool.8 monmaptool.8

Re: Error 5 when trying to mount Ceph 0.47.1

2012-05-25 Thread Greg Farnum
On Thursday, May 24, 2012 at 10:58 PM, Nam Dang wrote: Hi, I've just started working with Ceph for a couple of weeks. At the moment, I'm trying to setup a small cluster with 1 monitor, 1 MDS and 6 OSDs. However, I cannot mount ceph to the system no matter which node I'm executing the

Re: RBD format changes and layering

2012-05-25 Thread Josh Durgin
On 05/24/2012 04:39 PM, Yehuda Sadeh wrote: On Thu, May 24, 2012 at 4:05 PM, Josh Durginjosh.dur...@inktank.com wrote: RBD object format changes = To enable us to add more features to rbd, including copy-on-write cloning via layering, we need to change to rbd header

Re: 'rbd map' asynchronous behavior

2012-05-25 Thread Greg Farnum
That looks like a bug that isn't familiar to Josh or I. Can you create a report in the tracker and provide as much debug info as you can come up with? :) On Friday, May 25, 2012 at 3:15 AM, Andrey Korolyov wrote: Hi, Newer kernel rbd driver throws a quite strange messages on map|unmap,

Re: RBD format changes and layering

2012-05-25 Thread Josh Durgin
On 05/25/2012 07:57 AM, Alex Elder wrote: On 05/24/2012 06:05 PM, Josh Durgin wrote: RBD object format changes = In this message I respond only to the first part of your message. I'll look at the layering stuff separately. To enable us to add more features to rbd,

Re: RBD format changes and layering

2012-05-25 Thread Greg Farnum
On Thursday, May 24, 2012 at 4:05 PM, Josh Durgin wrote: snip One thing that's not addressed in the earlier design is how to make images read-only. The simplest way would be to only support layering on top of snapshots, which are read-only by definition. Another way would be to allow

Re: RBD format changes and layering

2012-05-25 Thread Josh Durgin
On 05/25/2012 01:55 PM, Greg Farnum wrote: On Thursday, May 24, 2012 at 4:05 PM, Josh Durgin wrote: snip One thing that's not addressed in the earlier design is how to make images read-only. The simplest way would be to only support layering on top of snapshots, which are read-only by

Re: RBD format changes and layering

2012-05-25 Thread Sage Weil
On Fri, 25 May 2012, Josh Durgin wrote: On 05/25/2012 07:57 AM, Alex Elder wrote: /** * Get the metadata about the image required to do I/O * to it. In the future this may include extra information for * features that require it, like encryption/compression type. * This extra data

Re: RBD format changes and layering

2012-05-25 Thread Josh Durgin
On 05/25/2012 03:26 PM, Sage Weil wrote: On Fri, 25 May 2012, Josh Durgin wrote: On 05/25/2012 07:57 AM, Alex Elder wrote: /** * Get the metadata about the image required to do I/O * to it. In the future this may include extra information for * features that require it, like

Question regarding API doc

2012-05-25 Thread Xiaopong Tran
I'm looking at the description in this API: http://ceph.com/docs/master/api/librados/#rados_objects_list_next For the parameters entry and key, the doc said (caller must free). I looked up in the code, and found this statement in the doc a bit misleading. Is the doc outdated, or did I miss