Re: booting from BTRFS works only with one device in the pool

Hugo Mills Mon, 01 Feb 2016 15:16:41 -0800

On Mon, Feb 01, 2016 at 11:02:42PM +0000, Duncan wrote:
> Hugo Mills posted on Mon, 01 Feb 2016 22:11:20 +0000 as excerpted:
> 
> > On Mon, Feb 01, 2016 at 10:31:47PM +0100, Hendrik Friedel wrote:
> >> Hello,
> >> 
> >> I am running CentOS from a btrfs root.
> >> This worked fine until I added a device to that pool:
> >> btrfs device add /dev/sda3 /
> >> reboot
> >> 
> >> This now causes the errors:
> >> BTRFS: failed to read chunk tree on sdb3 BTRFS: open_ctree failed
> >> 
> >> Here  I am stuck in a recovery prompt.
> >> 
> >> btrfs fi show displays the file system correctly with 2.1GiB used for
> >> sdb3 and 0.00GiB used on sda3
> > 
> > By far the simplest and most reliable method of doing this is to
> > use an initramfs with the command "btrfs dev scan" in it somewhere
> > before mounting. Most of the major distributions already have an
> > initramfs set up (as does yours, I see), and will install the correct
> > commands in the initramfs if you install the btrfs-progs package
> > (btrfs-tools in Debian derivatives).
> > 
> > The other approach is to add "device=/dev/sda3,device=/dev/sdb3" to
> > the "rootflags=" option on the grub command line, but that's going to
> > break if your kernel or hardware decides to reorder your devices.
> 
> As a btrfs user with a two-device btrfs root filesystem who has had to 
> deal with this same problem myself...
> 
> Unless the problem has been fixed in very recent kernels, device= doesn't 
> work in rootflags= for btrfs on the kernel commandline in any case.  I 
> don't know why, but I suspect it might be the fact that with 
> rootflags=device=, there's at least two =, and the kernel may split it at 
> the wrong = and instead of seeing a rootflags= option it knows what to do 
> with, it sees a rootflags=device= option that it ignores as making no 
> sense, making it a kernel commandline parsing bug.  Alternatively, 
> another poster hinted that the kernel may get it correct, but btrfs for 
> some reason can't use device= in the form it gets passed in from 
> rootflags=, while it can use it when passed in from userspace, which 
> would make it a btrfs bug.
> 
> Either way, (1) the problem isn't new and has been there for quite some 
> time, since early in the kernel 3.x era at least, but (2) other options 
> such as degraded _can_ be passed successfully by rootflags=, so btrfs can 
> use rootflags= in general, it just has problems with device=, for 
> whatever reason.
> 
> Which means...
> 
> > Do the sensible thing and just use the initramfs solution.
> 
> Exactly.
> 
> For over a decade I used reiserfs on / and was able to avoid an initr* 
> entirely.  And single-device btrfs / works without an initr*.  But multi-
> device... not so much.  While I definitely wasn't thrilled at the 
> prospect of having to complicate my boot process with an initr* that 
> _shouldn't_ be necessary, and had to spend some time learning about initr* 
> (I had initially dumped initr* early in my Linux experience and hadn't 
> had to learn about them until I tried multi-device btrfs /) so as to be 
> able to satisfy myself that I could work with it in both normal operation 
> and disaster recovery scenarios...
> 
> At this point there's really only two choices if you want multi-device 
> btrfs /, either use an initr* and have your multi-device btrfs /, or give 
> up on multi-device btrfs / and be satisfied with a more traditional 
> single-device /, btrfs or otherwise.
> 
> Well, unless you're a kernel-level dev sufficiently motivated to look 
> into the problem, devise a patch to solve the problem, and get that patch 
> thru the approval process and into kernel mainline.  In that case there's 
> three options, and I dearly wish someone with the necessary kernel level 
> coder qualification would take that third option, making initr*less multi-
> device btrfs / a viable option for the rest of us who in the mean time 
> are stuck with only the first two options!


   The problem with that approach is that you're going to have to put
policy into the kernel (which devices do you bother to scan? -- we've
hit this already with long scan times caused by timeouts on /dev/fd0
and /dev/cdrom0). That on its own is, I think, enough to doom any
attempt at putting device scan into the kernel.

   An initramfs has been a requirement for so many configurations for
so long, and has largely been a solved problem for the major
distributions, I'm really bemused about people's adamant resistance to
having one. It's *less* trouble than a kernel-based solution, because
at least it's easier to debug. With a decent initramfs builder, you
can get a recovery shell to poke around and fix things (or at least
determine what's broken).

   Hugo, wearing the Hat of Limited Sympathy.

-- 
Hugo Mills             | What's a Nazgûl like you doing in a place like this?
hugo@... carfax.org.uk |
http://carfax.org.uk/  |
PGP: E2AB1DE4          |                                                Illiad

signature.asc
Description: Digital signature

Re: booting from BTRFS works only with one device in the pool

Reply via email to