CBT on an existing cluster

2016-01-05 Thread Deneau, Tom
Having trouble getting a reply from c...@cbt.com so trying ceph-devel list... To get familiar with CBT, I first wanted to use it on an existing cluster. (i.e., not have CBT do any cluster setup). Is there a .yaml example that illustrates how to use cbt to run for example, its radosbench

RE: ceph-mon terminated with status 28

2015-12-15 Thread Deneau, Tom
Brad -- The issue is in tracker now.. http://tracker.ceph.com/issues/14088 -- Tom > -Original Message- > From: Brad Hubbard [mailto:bhubb...@redhat.com] > Sent: Monday, December 14, 2015 3:47 PM > To: Deneau, Tom > Cc: ceph-devel@vger.kernel.org > Subject: Re: c

RE: ceph-mon terminated with status 28

2015-12-14 Thread Deneau, Tom
Thanks, Brad. That was the problem. Is there a reason why we don't log more descriptive info for this kind of failure? -- Tom > -Original Message- > From: Brad Hubbard [mailto:bhubb...@redhat.com] > Sent: Sunday, December 13, 2015 4:19 PM > To: Deneau, Tom >

ceph-mon terminated with status 28

2015-12-13 Thread Deneau, Tom
I am trying to understand the following failure: A small cluster was running fine, and then was left unused for a while. When I went to try to use it again, the mon socket wasn't there and I could see that ceph-mon was not running. I saw the lines below at the end of dmesg output. When I tried

tarball for 10.0.0

2015-11-30 Thread Deneau, Tom
I did not see the source tarball for 10.0.0 at http://download.ceph.com/tarballs/ceph-10.0.0.tar.gz -- Tom Deneau -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majord...@vger.kernel.org More majordomo info at

keyring issues, 9.1.0

2015-10-22 Thread Deneau, Tom
My current situation as I upgrade to v9.1.0 is that client.admin keyring seems to work fine, for instance for ceph status command. But commands that use client.bootstrap-osd such as /usr/bin/ceph --cluster ceph --name client.bootstrap-osd --keyring /var/lib/ceph/bootstrap-osd/ceph.keyring

osd activation under 9.1.0

2015-10-16 Thread Deneau, Tom
Using 9.1.0 I am getting the error shown below at ceph-deploy osd activate time. + ceph-deploy --overwrite-conf osd activate Intel-2P-Sandy-Bridge-04:/var/local//dev/sdf2:/dev/sdf1 ... [][WARNIN] INFO:ceph-disk:Running command: /usr/bin/ceph-osd --cluster ceph --mkfs --mkkey -i 4 --monmap

RE: osd activation under 9.1.0

2015-10-16 Thread Deneau, Tom
> -Original Message- > From: Sage Weil [mailto:s...@newdream.net] > Sent: Friday, October 16, 2015 4:35 PM > To: Deneau, Tom > Cc: ceph-devel@vger.kernel.org > Subject: Re: osd activation under 9.1.0 > > On Fri, 16 Oct 2015, Deneau, Tom wrote: > > Usin

symbol lookup error v9.1.0

2015-10-16 Thread Deneau, Tom
On an ubuntu trusty system, * I installed v9.1.0 and could bring up a single node cluster with it. * I did a git checkout of v9.1.0, followed by ./autogen.sh; ./configure; make Then when I try to run for example the rados I just built using "./src/.libs/rados -v" I get

RE: v9.1.0 Infernalis release candidate released

2015-10-14 Thread Deneau, Tom
I tried an rpmbuild on Fedora21 from the tarball which seemed to work ok. But having trouble doing "ceph-deploy --overwrite-conf mon create-initial" with 9.1.0". This is using ceph-deploy version 1.5.24. Is this part of the "needs Fedora 22 or later" story? -- Tom [myhost][DEBUG ] create a done

RE: v9.1.0 Infernalis release candidate released

2015-10-14 Thread Deneau, Tom
-bin (2.19-0ubuntu6.6) ... [][WARNIN] usermod: user 'ceph' does not exist Any suggestions for recovering from this situation? -- Tom > -Original Message- > From: Sage Weil [mailto:s...@newdream.net] > Sent: Wednesday, October 14, 2015 12:40 PM > To: Deneau, Tom > Cc: c

RE: v9.1.0 Infernalis release candidate released

2015-10-14 Thread Deneau, Tom
> -Original Message- > From: Sage Weil [mailto:s...@newdream.net] > Sent: Wednesday, October 14, 2015 3:59 PM > To: Deneau, Tom > Cc: ceph-devel@vger.kernel.org > Subject: RE: v9.1.0 Infernalis release candidate released > > On Wed, 14 Oct 2015, Deneau, Tom wrote

RE: v9.1.0 Infernalis release candidate released

2015-10-14 Thread Deneau, Tom
> -Original Message- > From: Sage Weil [mailto:s...@newdream.net] > Sent: Wednesday, October 14, 2015 4:30 PM > To: Deneau, Tom > Cc: ceph-devel@vger.kernel.org > Subject: RE: v9.1.0 Infernalis release candidate released > > On Wed, 14 Oct 2015, Deneau, Tom wr

RE: throttles

2015-10-13 Thread Deneau, Tom
> -Original Message- > From: Sage Weil [mailto:s...@newdream.net] > Sent: Tuesday, October 13, 2015 7:44 AM > To: Deneau, Tom > Cc: ceph-devel@vger.kernel.org > Subject: Re: throttles > > On Mon, 12 Oct 2015, Deneau, Tom wrote: > > Looking at the perf c

RE: throttles

2015-10-13 Thread Deneau, Tom
, 2015 10:57 AM > To: Deneau, Tom; Sage Weil > Cc: ceph-devel@vger.kernel.org > Subject: RE: throttles > > BTW, you can completely turn off these throttles ( other than the > filestore throttle ) by setting the value to 0. > > Thanks & Regards > Somnath > > --

dump_historic_ops, slow requests

2015-10-12 Thread Deneau, Tom
I have a small ceph cluster (3 nodes, 5 osds each, journals all just partitions on the spinner disks) and I have noticed that when I hit it with a bunch of rados bench clients all doing writes of large (40M objects) with --no-cleanup, the rados bench commands seem to finish OK but I often get

throttles

2015-10-12 Thread Deneau, Tom
Looking at the perf counters on my osds, I see wait counts for the following throttle related perf counters: (This is from trying to benchmark using multiple rados bench client processes). throttle-filestore_bytes throttle-msgr_dispatch_throttler-client throttle-osd_client_bytes

RE: perf counters from a performance discrepancy

2015-10-08 Thread Deneau, Tom
> -Original Message- > From: Sage Weil [mailto:sw...@redhat.com] > Sent: Wednesday, October 07, 2015 9:48 PM > To: Deneau, Tom > Cc: Mark Nelson; Gregory Farnum; ceph-devel@vger.kernel.org > Subject: RE: perf counters from a performance discrepancy > > > I fi

RE: perf counters from a performance discrepancy

2015-10-07 Thread Deneau, Tom
> -Original Message- > From: Deneau, Tom > Sent: Wednesday, September 23, 2015 3:05 PM > To: 'Mark Nelson'; Gregory Farnum; Sage Weil > Cc: ceph-devel@vger.kernel.org > Subject: RE: perf counters from a performance discrepancy > > > > > -Original M

perf counters from a performance discrepancy

2015-09-23 Thread Deneau, Tom
Hi all -- Looking for guidance with perf counters... I am trying to see whether the perf counters can tell me anything about the following discrepancy I populate a number of 40k size objects in each of two pools, poolA and poolB. Both pools cover osds on a single node, 5 osds total. *

RE: perf counters from a performance discrepancy

2015-09-23 Thread Deneau, Tom
I will be out of office for a week but will put this on the list of things to try when I get back. -- Tom > -Original Message- > From: Samuel Just [mailto:sj...@redhat.com] > Sent: Wednesday, September 23, 2015 3:28 PM > To: Deneau, Tom > Cc: Mark Nelson; Gregory Farnum;

RE: perf counters from a performance discrepancy

2015-09-23 Thread Deneau, Tom
> -Original Message- > From: Gregory Farnum [mailto:gfar...@redhat.com] > Sent: Wednesday, September 23, 2015 3:39 PM > To: Deneau, Tom > Cc: ceph-devel@vger.kernel.org > Subject: Re: perf counters from a performance discrepancy > > On Wed, Sep 23, 2015 at 9:33

RE: perf counters from a performance discrepancy

2015-09-23 Thread Deneau, Tom
> -Original Message- > From: Mark Nelson [mailto:mnel...@redhat.com] > Sent: Wednesday, September 23, 2015 1:43 PM > To: Gregory Farnum; Sage Weil > Cc: Deneau, Tom; ceph-devel@vger.kernel.org > Subject: Re: perf counters from a performance discrepancy > > >

RE: rados bench object not correct errors on v9.0.3

2015-08-26 Thread Deneau, Tom
-Original Message- From: Dałek, Piotr [mailto:piotr.da...@ts.fujitsu.com] Sent: Wednesday, August 26, 2015 2:02 AM To: Sage Weil; Deneau, Tom Cc: ceph-devel@vger.kernel.org; ceph-us...@ceph.com Subject: RE: rados bench object not correct errors on v9.0.3 -Original Message

rados bench object not correct errors on v9.0.3

2015-08-25 Thread Deneau, Tom
-Original Message- From: ceph-devel-ow...@vger.kernel.org [mailto:ceph-devel- ow...@vger.kernel.org] On Behalf Of Sage Weil Sent: Monday, August 24, 2015 12:45 PM To: ceph-annou...@ceph.com; ceph-devel@vger.kernel.org; ceph-us...@ceph.com; ceph-maintain...@ceph.com Subject: v9.0.3

RE: rados bench object not correct errors on v9.0.3

2015-08-25 Thread Deneau, Tom
-Original Message- From: Sage Weil [mailto:sw...@redhat.com] Sent: Tuesday, August 25, 2015 12:43 PM To: Deneau, Tom Cc: ceph-devel@vger.kernel.org; ceph-us...@ceph.com; piotr.da...@ts.fujitsu.com Subject: Re: rados bench object not correct errors on v9.0.3 On Tue, 25 Aug 2015

RE: rados bench object not correct errors on v9.0.3

2015-08-25 Thread Deneau, Tom
-Original Message- From: ceph-devel-ow...@vger.kernel.org [mailto:ceph-devel- ow...@vger.kernel.org] On Behalf Of Deneau, Tom Sent: Tuesday, August 25, 2015 1:24 PM To: Sage Weil Cc: ceph-devel@vger.kernel.org; ceph-us...@ceph.com; piotr.da...@ts.fujitsu.com Subject: RE: rados

registering for tracker.ceph.com

2015-07-23 Thread Deneau, Tom
I wanted to register for tracker.ceph.com to enter a few issues but never got the confirming email and my registration is now in some stuck state (not complete but name/email in use so can't re-register). Any suggestions? -- Tom Deneau -- To unsubscribe from this list: send the line unsubscribe

building just src/tools/rados

2015-07-22 Thread Deneau, Tom
Is there a make command that would build just the src/tools or even just src/tools/rados ? -- Tom Deneau -- To unsubscribe from this list: send the line unsubscribe ceph-devel in the body of a message to majord...@vger.kernel.org More majordomo info at

9.0.2 test/perf_local.cc on non-x86 architectures

2015-07-21 Thread Deneau, Tom
I was trying to do an rpmbuild of v9.0.2 for aarch64 and got the following error: test/perf_local.cc: In function 'double div32()': test/perf_local.cc:396:31: error: impossible constraint in 'asm' cc); Probably should have an if defined (__i386__) around it. -- Tom

RE: osd suicide timeout

2015-07-13 Thread Deneau, Tom
To: Deneau, Tom Cc: ceph-devel Subject: Re: osd suicide timeout On Fri, Jul 10, 2015 at 10:45 PM, Deneau, Tom tom.den...@amd.com wrote: I have an osd log file from an osd that hit a suicide timeout (with the previous 1 events logged). (On this node I have also seen this suicide timeout

RE: osd suicide timeout

2015-07-13 Thread Deneau, Tom
- From: Gregory Farnum [mailto:g...@gregs42.com] Sent: Monday, July 13, 2015 11:45 AM To: Deneau, Tom Cc: ceph-devel Subject: Re: osd suicide timeout heartbeat_map reset_timeout 'OSD::osd_op_tp thread 0x3ff6eb0efd0' had suicide timed out after 150 So that's the OSD's op thread, which

osd suicide timeout

2015-07-10 Thread Deneau, Tom
I have an osd log file from an osd that hit a suicide timeout (with the previous 1 events logged). (On this node I have also seen this suicide timeout happen once before and also a sync_entry timeout. I can see that 6 minutes or so before that osd died, other osds on the same node were

RE: load-gen from an osd node

2015-07-08 Thread Deneau, Tom
-ops on the command line. -- Tom Deneau -Original Message- From: Gregory Farnum [mailto:g...@gregs42.com] Sent: Wednesday, July 01, 2015 11:05 AM To: Deneau, Tom Cc: ceph-devel Subject: Re: load-gen from an osd node Hmm, the only changes I see between those two versions are some

osd aborts, sync entry timeout and suicide timeout

2015-07-06 Thread Deneau, Tom
I had a small (4 nodes, 19 OSDs) cluster that I was running a sort of stress test on over the weekend. Let's call the 4 nodes, A, B, C and D. (Node A had the monitor running on it). Anyway, node C died with a hardware problem, and, I think at about that same time two of the 5 osds on node B

pgs stuck undersized and degraded

2015-06-29 Thread Deneau, Tom
On a very small (3 node) cluster, I have one pool with a replication size of 3 that is showing some stuck PGs. This pool has 64 pgs and the other pgs in the pool seem fine, mapped to 3 osds each. And all the pgs in other pools are also fine. Why would these pgs be stuck with 2 ? The osd crush

RE: load-gen from an osd node

2015-06-29 Thread Deneau, Tom
Oh, I just noticed that the client nodes I spoke of where load-gen actually worked were running 0.94, not 9.0.1. And when I upgrade them to 9.0.1, load-gen no longer works. So more likely this is just a problem with newer rados load-gens -- Tom -Original Message- From: Deneau

load-gen from an osd node

2015-06-26 Thread Deneau, Tom
I am running 9.0.1 and I noticed when I run rados load-gen from one of the osd nodes, it creates the objects but then always reports a throughput of 0 MB/sec. But if I run it from a separate client node, it works fine. Why would this be? I'm not sure but I thought in earlier versions load-gen

RE: deleting objects from a pool

2015-06-25 Thread Deneau, Tom
- From: Podoski, Igor [mailto:igor.podo...@ts.fujitsu.com] Sent: Thursday, June 25, 2015 1:06 AM To: Deneau, Tom; Dałek, Piotr; ceph-devel Subject: RE: deleting objects from a pool Hi, It appears, that cleanup can be used as a purge: rados -p poolname cleanup --prefix Regards, Igor

deleting objects from a pool

2015-06-24 Thread Deneau, Tom
I have benchmarking situations where I want to leave a pool around but delete a lot of objects from the pool. Is there any really fast way to do that? I noticed rados rmpool is fast but I don't want to remove the pool. I have been spawning multiple threads, each deleting a subset of the objects

RE: deleting objects from a pool

2015-06-24 Thread Deneau, Tom
-Original Message- From: ceph-devel-ow...@vger.kernel.org [mailto:ceph-devel- ow...@vger.kernel.org] On Behalf Of Deneau, Tom Sent: Wednesday, June 24, 2015 6:44 PM I have benchmarking situations where I want to leave a pool around but delete a lot of objects from the pool

erasure pool with isa plugin

2015-06-22 Thread Deneau, Tom
If one has a cluster with some nodes that can run with the ISA plugin and some that cannot, is there a way to define a pool such that the ISA-capable nodes can use the ISA plugin and the others can use say the jerasure plugin? -- Tom Deneau -- To unsubscribe from this list: send the line

osd pool erasure code stripe width

2015-06-19 Thread Deneau, Tom
I am trying to understand the use of osd pool erasure code stripe width For example, I have a single-node system with a k=2,m=1 ec pool and I write a single 40M object to this pool using rados bench. But when I look on the disk, I still see only the 3 20M pieces for this object. Where does the

RE: osd pool erasure code stripe width

2015-06-19 Thread Deneau, Tom
] Sent: Friday, June 19, 2015 5:47 PM To: Deneau, Tom; ceph-devel Subject: Re: osd pool erasure code stripe width Hi Tom, A stripe width of 4KB (the default) means the object is encoded 4KB at a time. It does not show in the file written to disk. Cheers On 19/06/2015 22:11, Deneau, Tom

rados bench throughput with no disk or network activity

2015-05-28 Thread Deneau, Tom
I've noticed that * with a single node cluster with 4 osds * and running rados bench rand on that same node so no network traffic * with a number of objects small enough so that everything is in the cache so no disk traffic we still peak out at about 1600 MB/sec. And the cpu is 40%

RE: rados bench throughput with no disk or network activity

2015-05-28 Thread Deneau, Tom
-Original Message- From: Gregory Farnum [mailto:g...@gregs42.com] Sent: Thursday, May 28, 2015 6:18 PM To: Deneau, Tom Cc: ceph-devel Subject: Re: rados bench throughput with no disk or network activity On Thu, May 28, 2015 at 4:09 PM, Deneau, Tom tom.den...@amd.com wrote

RE: journal writes when running rados bench seq on ec pool

2015-05-20 Thread Deneau, Tom
Hi -- I never saw an explanation for the writes that are occurring during ecpool reads. -- Tom -Original Message- From: Somnath Roy [mailto:somnath@sandisk.com] Sent: Wednesday, May 13, 2015 6:48 PM To: Deneau, Tom; ceph-devel Subject: RE: journal writes when running rados

RE: ceph tell osd bench

2015-04-23 Thread Deneau, Tom
-Original Message- From: Gregory Farnum [mailto:g...@gregs42.com] Sent: Thursday, April 23, 2015 12:37 PM To: Deneau, Tom Cc: ceph-devel Subject: Re: ceph tell osd bench On Thu, Apr 23, 2015 at 6:58 AM, Deneau, Tom tom.den...@amd.com wrote: While running ceph tell osd bench

ceph tell osd bench

2015-04-23 Thread Deneau, Tom
While running ceph tell osd bench and playing around with the total_bytes and block_size parameters, I have noticed that if the total_bytes written is less than about 0.5G, the bytes/sec is much higher. Why is that? -- Tom Deneau -- To unsubscribe from this list: send the line unsubscribe

hashing variation in rados bench runs

2015-04-20 Thread Deneau, Tom
I have been trying to run rados bench runs and I've noticed a lot of variations from run to run. The runs generally write data with --no-cleanup then read it back (seq), dropping the caches in between I admit this is on a single node cluster with 5 data disks so maybe not realistic but... In

RE: ms_crc_data false

2015-04-09 Thread Deneau, Tom
-Original Message- From: Gregory Farnum [mailto:g...@gregs42.com] Sent: Wednesday, April 08, 2015 6:26 PM To: Deneau, Tom Cc: ceph-devel Subject: Re: ms_crc_data false On Wed, Apr 8, 2015 at 3:38 PM, Deneau, Tom tom.den...@amd.com wrote: With 0.93, I tried ceph tell 'osd

ms_crc_data false

2015-04-08 Thread Deneau, Tom
With 0.93, I tried ceph tell 'osd.*' injectargs '--ms_crc_data=false' '--ms_crc_header=false' and saw the changes reflected in ceph admin-daemon But having done that, perf top still shows time being spent in crc32 routines. Is there some other parameter that needs changing? -- Tom Deneau

RE: ms_crc_data false

2015-04-08 Thread Deneau, Tom
-Original Message- From: Sage Weil [mailto:s...@newdream.net] Sent: Wednesday, April 08, 2015 5:40 PM To: Deneau, Tom Cc: ceph-devel Subject: Re: ms_crc_data false On Wed, 8 Apr 2015, Deneau, Tom wrote: With 0.93, I tried ceph tell 'osd.*' injectargs '--ms_crc_data=false

rpmbuild version

2015-03-28 Thread Deneau, Tom
Starting from a ceph git checkout I want to create a source tar.bz2 to feed to rpmbuild. I see there is a make dist-bzip2 to make the actual source tar but if I want the built binaries to identify themselves as a custom version, what is the best way to do that? -- Tom Deneau -- To unsubscribe

seg fault in ceph-osd on aarch64

2015-03-26 Thread Deneau, Tom
I've been exercising the the 64-bit arm (aarch64) version of ceph. This is from self-built rpms from the v0.93 snapshot. The cluster is a single system with 6 hard drives, one osd each. I've been letting it run with some rados bench and rados load-gen loops and running bonnie++ on an rbd mount.

RE: seg fault in ceph-osd on aarch64

2015-03-26 Thread Deneau, Tom
any suggestions for stress tests, etc that might make this happen sooner? -- Tom -Original Message- From: Sage Weil [mailto:s...@newdream.net] Sent: Thursday, March 26, 2015 12:17 PM To: Deneau, Tom Cc: ceph-devel Subject: Re: seg fault in ceph-osd on aarch64 On Thu, 26 Mar 2015

RE: packages on download.ceph.com

2015-03-11 Thread Deneau, Tom
-Original Message- From: Sage Weil [mailto:sw...@redhat.com] Sent: Monday, March 09, 2015 4:05 PM To: Danny Al-Gaaf Cc: Mark Nelson; Deneau, Tom; ceph-devel Subject: Re: packages on download.ceph.com On Mon, 9 Mar 2015, Danny Al-Gaaf wrote: Am 09.03.2015 um 20:35 schrieb Mark

RE: packages on download.ceph.com

2015-03-09 Thread Deneau, Tom
-Original Message- From: Danny Al-Gaaf [mailto:danny.al-g...@bisect.de] Sent: Monday, March 09, 2015 3:46 PM To: Mark Nelson; Deneau, Tom; ceph-devel Subject: Re: packages on download.ceph.com Am 09.03.2015 um 20:35 schrieb Mark Nelson: On 03/09/2015 02:06 PM, Deneau, Tom

packages on download.ceph.com

2015-03-09 Thread Deneau, Tom
I'm trying to gather information on what it would take to get packages for an architecture other than x86_64 up on http://download.ceph.com -- Tom Deneau, AMD -- To unsubscribe from this list: send the line unsubscribe ceph-devel in the body of a message to majord...@vger.kernel.org More

RE: [ceph-users] who is using radosgw with civetweb?

2015-02-26 Thread Deneau, Tom
Robert -- We are still having trouble with this. Can you share your [client.radosgw.gateway] section of ceph.conf and were there any other special things to be aware of? -- Tom -Original Message- From: ceph-devel-ow...@vger.kernel.org [mailto:ceph-devel-ow...@vger.kernel.org] On

RE: using radosgw with mod_proxy_fcgi

2015-02-16 Thread Deneau, Tom
-Original Message- From: Yehuda Sadeh-Weinraub [mailto:yeh...@redhat.com] Sent: Wednesday, February 11, 2015 7:10 PM To: Deneau, Tom Cc: ceph-devel@vger.kernel.org Subject: Re: using radosgw with mod_proxy_fcgi - Original Message - From: Tom Deneau tom.den...@amd.com To: ceph-devel

using radosgw with mod_proxy_fcgi

2015-02-11 Thread Deneau, Tom
I am a Ceph novice and have the rados and rbd setups working and would like to use the radosgw stack. I am running on a platform (aarch64) for which there are no pre-built binaries of the ceph patched apache and the ceph patched mod_fastcgi. But since I gather from the mail lists that the

RE: using ceph-deploy on build after make install

2015-02-04 Thread Deneau, Tom
Message- From: john.sp...@inktank.com [mailto:john.sp...@inktank.com] On Behalf Of John Spray Sent: Wednesday, February 04, 2015 2:35 AM To: Deneau, Tom Cc: ceph-devel@vger.kernel.org Subject: Re: using ceph-deploy on build after make install I suspect that your clue is Failed to execute

using ceph-deploy on build after make install

2015-02-03 Thread Deneau, Tom
New to ceph building but here is my situation... I have been successfully able to build ceph starting from git checkout firefly (also successful from git checkout master). After building, I am able to run vstarth.sh from the source directory as ./vstart.sh -d -n -x (or with -X). I can then