[ceph-users] Re: frequent Monitor down

2020-10-28 Thread Eugen Block
Have you looked into syslog and mon logs? Zitat von Andrei Mikhailovsky : Hello everyone, I am having regular messages that the Monitors are going down and up: 2020-10-27T09:50:49.032431+ mon .arh-ibstorage2-ib ( mon .1) 2248 : cluster [WRN] Health check failed: 1/4 mons down, quorum

[ceph-users] Re: frequent Monitor down

2020-10-28 Thread Eugen Block
. Andrei - Original Message - From: "Eugen Block" To: "ceph-users" Sent: Wednesday, 28 October, 2020 11:51:20 Subject: [ceph-users] Re: frequent Monitor down Have you looked into syslog and mon logs? Zitat von Andrei Mikhailovsky : Hello everyone, I am having regul

[ceph-users] Re: frequent Monitor down

2020-10-28 Thread Andrei Mikhailovsky
Yes, I have, Eugen, I see no obvious reason / error / etc. I see a lot of entries relating to Compressing as well as monitor going down. Andrei - Original Message - > From: "Eugen Block" > To: "ceph-users" > Sent: Wednesday, 28 October, 2020 11:51:

[ceph-users] Re: frequent Monitor down

2020-10-28 Thread Andrei Mikhailovsky
drei - Original Message - > From: "Eugen Block" > To: "Andrei Mikhailovsky" > Cc: "ceph-users" > Sent: Wednesday, 28 October, 2020 20:19:15 > Subject: Re: [ceph-users] Re: frequent Monitor down > Why do you have 4 MONs in the first place?

[ceph-users] Re: frequent Monitor down

2020-10-29 Thread Marc Roos
Really? First time I read this here, afaik you can get a split brain like this. -Original Message- Sent: Thursday, October 29, 2020 12:16 AM To: Eugen Block Cc: ceph-users Subject: [ceph-users] Re: frequent Monitor down Eugen, I've got four physical servers and I've instal

[ceph-users] Re: frequent Monitor down

2020-10-29 Thread Tony Liu
> From: Marc Roos > Sent: Thursday, October 29, 2020 1:42 AM > To: andrei ; eblock > Cc: ceph-users > Subject: [ceph-users] Re: frequent Monitor down > > Really? First time I read this here, afaik you can get a split brain > like this. > > > > -Origina

[ceph-users] Re: frequent Monitor down

2020-10-29 Thread Janne Johansson
Den tors 29 okt. 2020 kl 20:16 skrev Tony Liu : > Typically, the number of nodes is 2n+1 to cover n failures. > It's OK to have 4 nodes, from failure covering POV, it's the same > as 3 nodes. 4 nodes will cover 1 failure. If 2 nodes down, the > cluster is down. It works, just not make much sense.

[ceph-users] Re: frequent Monitor down

2020-10-30 Thread Frank Schilder
Schilder AIT Risø Campus Bygning 109, rum S14 From: Janne Johansson Sent: 29 October 2020 22:07:45 To: Tony Liu Cc: Marc Roos; ceph-users Subject: [ceph-users] Re: frequent Monitor down Den tors 29 okt. 2020 kl 20:16 skrev Tony Liu : > Typically, the number