[ceph-users] Re: ceph osd crush move exception

2022-05-05 Thread Eugen Block
Hi, can you share your 'ceph osd tree' so it easier to understand what might be going wrong. I didn't check the script in detail, what exactly do you mean by extending? Do you create new hosts in a different root of the osd tree? Do those new hosts get PGs assigned although they're in a d

[ceph-users] Importance of CEPHADM_CHECK_KERNEL_VERSION

2022-05-05 Thread E Taka
Hello all, how important is it to use the same Linux kernel version on all Hosts? Background is, that new hosts are installed with the actual Ubuntu server 22.04 while the older ones run with Ubuntu 20.04. In other words: may I disable this check: ceph cephadm config-check disable kernel_versio

[ceph-users] Re: Stretch cluster questions

2022-05-05 Thread Eneko Lacunza
Hi Gregory Thanks for your confirmation. I hope I can start some tests today. Cheers El 5/5/22 a las 5:19, Gregory Farnum escribió: On Wed, May 4, 2022 at 1:25 AM Eneko Lacunza wrote: Hi Gregory, El 3/5/22 a las 22:30, Gregory Farnum escribió: On Mon, Apr 25, 2022 at 12:57 AM

[ceph-users] Re: Unbalanced Cluster

2022-05-05 Thread Erdem Agaoglu
Hi David, I think you're right with your option 2. 512 pgs is just too few. You're also right with the "inflation" but you should add your erasure bits to the calculation, so 9x512=4608. With 144 OSDs, you would average 32 pgs per OSD. Some old advice for that number was around 100. But your cur

[ceph-users] Recover from "Module 'progress' has failed"

2022-05-05 Thread Kuhring, Mathias
Dear Ceph community, We are having an issue with the MGR progress module:     Module 'progress' has failed: ('e7fb29e3-9caf-4b20-b930-cee8474526bb',) We are currently on ceph version 16.2.7 (dd0603118f56ab514f133c8d2e3adfc983942503) pacific (stable). I'm aware that there are already issues an

[ceph-users] Telemetry Dashboards tech talk today at 1pm EST

2022-05-05 Thread Yaarit Hatuka
Hi everyone, Join us today (17:00 UTC / 1pm EST) for a tech talk on our Telemetry Dashboards [1], with a focus on crash telemetry work, and interesting use cases for both users and developers. Please add any questions you have here [2]. Thanks! Yaarit [1] https://telemetry-public.ceph.com/ [2] h

[ceph-users] Re: Telemetry Dashboards tech talk today at 1pm EST

2022-05-05 Thread Yaarit Hatuka
To join the meeting on a computer or mobile phone: https://bluejeans.com/908675367/browser To join via Phone: 1) Dial: +1 408 740 7256 +1 888 240 2560(US Toll Free) +1 408 317 9253(Alternate Number) (see all numbers - http://bluejeans.com/numbers) 2) Enter C

[ceph-users] How to make ceph syslog items approximate ceph -w ?

2022-05-05 Thread Harry G. Coin
Using Quincy I'm getting a much worse lag owing to ceph syslog message volume, though without obvious system errors. In the usual case of no current/active hardware errors and no software crashes:  what config settings can I pick so that what appears in syslog is as close to what would appear

[ceph-users] Re: Unbalanced Cluster

2022-05-05 Thread David Schulz
Hi Richard, Thanks for that.  It never occurred to me that we'd need at least 10 servers for that shape of EC.  We will certainly push to get that new server in now. -Dave On 2022-05-04 5:07 p.m., Richard Bade wrote: > [△EXTERNAL] > > > > Hi David, > I think that part of the problem with unbal

[ceph-users] Re: Unbalanced Cluster

2022-05-05 Thread David Schulz
Hi Erdem, The balancer was driving all the weights to 1.0 so I turned it off. The OSDs were creeping up to the 90% full threshold with it turned on. I've been playing whackamole with the OSDs for a week trying to keep the cluster from locking all writes when a single OSD goes over 90%. I

[ceph-users] Re: Unbalanced Cluster

2022-05-05 Thread Richard Bade
Hi David, Something else you could try with that other pool, if it contains little or no data, is to reduce the PG number. This does cause some backfill operations as it does a pg merge but this doesn't take long if the pg is virtually empty. The autoscaler has a mode where it can make recommendati

[ceph-users] Re: Unbalanced Cluster

2022-05-05 Thread Anthony D'Atri
> The balancer was driving all the weights to 1.0 so I turned it off. Which weights (CRUSH or reweight?) And which balancer? Assuming the ceph-mgr balancer module in upmap mode, you’d want the reweight values to be 1.000 since it uses the newer pg-upmap functionality to distribute capac

[ceph-users] add host error

2022-05-05 Thread Rafael Quaglio
Hi, In a fresh quincy install, when I type the command to add a host, I get this error: ceph orch host add gpuno02 10.0.100.2 Error EINVAL: Command ['which', 'python3'] failed. which: no python3 in (/sbin:/bin:/usr/sbin:/usr/bin) The output of the which command: which python3 /usr/bin/

[ceph-users] Re: Unbalanced Cluster

2022-05-05 Thread Jeremy Austin
On Thu, May 5, 2022 at 11:15 AM Anthony D'Atri wrote: > > > This calculator can help when you have multiple pools: > > https://old.ceph.com/pgcalc/ Did an EC-aware version of this calculator ever escape the Red Hat paywall? Thanks, -- Jeremy Austin jhaus...@gmail.com _