Re: raid1: cannot add disk to replace faulty because can only mount fs as read-only.

Austin S. Hemmelgarn Fri, 27 Jan 2017 12:24:35 -0800

On 2017-01-27 11:47, Hans Deragon wrote:

On 2017-01-24 14:48, Adam Borowski wrote:

On Tue, Jan 24, 2017 at 01:57:24PM -0500, Hans Deragon wrote:

If I remove 'ro' from the option, I cannot get the filesystem mounted
because of the following error: BTRFS: missing devices(1) exceeds the
limit(0), writeable mount is not allowed So I am stuck. I can only
mount the filesystem as read-only, which prevents me to add a disk.


A known problem: you get only one shot at fixing the filesystem, but
that's
not because of some damage but because the check whether the fs is in a
shape is good enough to mount is oversimplistic.

Here's a patch, if you apply it and recompile, you'll be able to mount
degraded rw.

Note that it removes a safety harness: here, the harness got tangled
up and
keeps you from recovering when it shouldn't, but it _has_ valid uses
that.

Meow!


Greetings,

Ok, that solution will solve my problem in the short run, i.e. getting
my raid1 up again.

However, as a user, I am seeking for an easy, no maintenance raid
solution.  I wish that if a drive fails, the btrfs filesystem still
mounts rw and leaves the OS running, but warns the user of the failing
disk and easily allow the addition of a new drive to reintroduce
redundancy.  Are there any plans within the btrfs community to implement
such a feature?  In a year from now, when the other drive will fail,
will I hit again this problem, i.e. my OS failing to start, booting into
a terminal, and cannot reintroduce a new drive without recompiling the
kernel?

Before I make any suggestions regarding this, I should point out thatmounting read-write when a device is missing is what caused this issuein the first place. Doing so is extremely dangerous in any RAID setup,regardless of your software stack. The filesystem is expected to storethings reliably when a write succeeds, and if you've got a broken RAIDarray, claiming that you can store things reliably is generally a lie.MD and LVM both have things in place to mitigate most of the risk, buteven there it's still risky. Yes, it's not convenient to have to dealwith a system that won't boot, but it's at least a whole lot easier fromLinux than it is in most other operating systems.

Now, the first step to reliable BTRFS usage is using up-to-date kernels.If you're actually serious about using BTRFS, you should be doing thisanyway though. Assuming you're keeping up-to-date on the kernel, thenyou won't hit this same problem again (or at least you shouldn't, sincemultiple people now have checks for this in their regression testingsuites for BTRFS).

The second is proper monitoring. A well set up monitoring system willlet you know when the disk is failing before it gets to the point ofjust disappearing from the system most of the time. There is currentlyno specific monitoring tool for BTRFS, but it's really easy to set upautomated monitoring for stuff like this. It's impractical for me tocover exact configuration here, since I don't know how much backgroundyou have dealing with stuff like this (and you're probably using systemdsince it's Ubnutu, and I have near zero background dealing withrecurring task scheduling with that). I can however cover a list ofwhat you should be monitoring and roughly how often:1. SMART status from the storage devices. You'll need smartmontools forthis. In general, I'd suggest using smartctl through cron or a systemdtimer unit to monitor this instead of smartd. Basic command-line thatwill work on all modern SATA disks to perform the checks you want is:

smartctl -H /dev/sda

You'll need one call for each disk, just replace /dev/sda with eachdevice. Note that this should be the device itself, not the partitions.If that command spits out a warning (or returns with an exit codeother than 0), something's wrong and you should at least investigate(and possibly look at replacing the disk). I would suggest checkingSMART status at least daily, and potentially much more frequently.When the self-checks in the disk firmware start failing (which is whatthis is checking), it generally means that failure is imminent, usuallywithin a couple of days at most.2. BTRFS scrub. if you're serious about data safety, you should berunning a scrub on the filesystem regularly. As a general rule, once aweek is reasonable unless you have marginal hardware or are seriouslyparanoid. Make sure to check the results later with the 'btrfs scrubstatus' command. It will tell you if it found any errors, and how manyit was able to fix. Isolated single errors are generally not a sign ofimminent failure, it's when they start happening regularly or you see awhole lot at once that you're in trouble. Scrub will also fix mostsynchronization issues between devices in a RAID set.3. BTRFS device stats. BTRFS stores per-device error counters in thefilesystem. These track cumulative errors since the last time they werereset, including errors encountered during normal operation. You shouldbe checking these regularly. I"m a bit paranoid, so most of my systemscheck every hour. Daily is usually sufficient for most people. Thereare a couple of options for checking these. The newest versions ofbtrfs-progs (which are not in Ubuntu yet) have a switch that will changethe exit code if any counter is non-zero. The other option 9which worksregardless of btrfs-progs version) is to use a script to check the output.4. Filesystem mount flags. When BTRFS encounters a severe error (I'mnot sure about the full list that will trigger this, except that itdoesn't include read errors if they get corrected (which they should ifyou're using RAID)), it will remount the filesystem read-only. This isa safety measure to prevent the kernel or the rest of the system frommaking any issues with the filesystem worse. If you monitor the mountoptions for the filesystem to know when this happens (note that theresponse _SHOULD NOT_ be remounting the FS writable again, if the kernelremounted it read-only, something is seriously wrong). A number ofmonitoring tools can actually automate checking this one for you (aswell other stuff like disk usage), but it's pretty easy to find scriptsthat can do this on the internet because this is pretty standardbehavior among a wide variety of Linux filesystems.

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: raid1: cannot add disk to replace faulty because can only mount fs as read-only.

Reply via email to