Re: [btrfs-progs] Bug in mkfs.btrfs -r

Qu Wenruo Fri, 01 Sep 2017 06:55:07 -0700


On 2017年09月01日 20:47, Austin S. Hemmelgarn wrote:

On 2017-09-01 08:19, Qu Wenruo wrote:
On 2017年09月01日 20:05, Austin S. Hemmelgarn wrote:
On 2017-09-01 07:49, Qu Wenruo wrote:
On 2017年09月01日 19:28, Austin S. Hemmelgarn wrote:
On 2017-08-31 20:13, Qu Wenruo wrote:
On 2017年09月01日 01:27, Goffredo Baroncelli wrote:
Hi All,
I found a bug in mkfs.btrfs, when it is used the option '-r'. Itseems that it is not visible the full disk.
Despite the new bug you found, -r has several existing bugs.
Is this actually a bug though? Every other filesystem creationtool that I know of that offers functionality like this generatesthe filesystem just large enough to contain the data you want init, so I would argue that making this use the whole device isactually breaking consistency with other tools, not to mentionremoving functionality that is useful (even aside from the systemimage generation use case I mentioned, there are other practicalapplications (seed 'device' generation comes to mind).
Well, then documentation bug.

And I'm not sure the chunk size is correct or optimized.
Even for btrfs-convert, which will make data chunks very scattered,we still try to make a large chunk to cover scattered data extents.
For a one-shot or read-only filesystem though, a maximally sizedchunk is probably suboptimal.
Not exactly.
Current kernel (and btrfs-progs also tries to follow kernel chunkallocator's behavior) will not make a chunk larger than 10% of RW space.
So for small filesystem chunk won't be too maximally sized.
Are you sure about this? I've got a couple of sub 10GB BTRFS volumesthat definitely have more than one 1GB data chunk.


Yes, check the following code:

        /* we don't want a chunk larger than 10% of writeable space */
        max_chunk_size = min(div_factor(fs_devices->total_rw_bytes, 1),
                             max_chunk_size);
Which is in __btrfs_alloc_chunk() function in fs/btrfs/volumes.c

Suppose you use this to generate a base image for a system in theform of a seed device. This actually ends up being a pretty easy wayto get factory reset functionality. It's also a case where you wantthe base image to take up as little space as possible, so that theend-user usable storage space is as much as possible. In that case,if your base image doesn't need an exact multiple of 1GB for datachunks, then using 1GB data chunks is not the best choice for atleast the final data chunk (because the rest of that 1GB getswasted). A similar argument applies for metadata.
Yes, your example makes sense. (despite of above 10% limit I mentioned).

The problem is, no one really knows how the image will be used.
Maybe it will be used as normal btrfs (with fi resize), or with yourpurpose.
We can't save users from making poor choices. If we could, we wouldn'thave anywhere near as many e-mails on the list from people who aretrying to recover data from their broken filesystems because they haveno backups.
The only case I can find where '-r' is a win is when you need thefilesystem to be as small as possible with no free space. The momentyou need free space, it's actually faster to just create the filesystem,resize it to the desired size, and then copy in your data (I've actuallybenchmarked this, and while it's not _much_ difference in time spent,there is a measurable difference, with my guess being that theallocation code is doing more work in userspace than in the kernel). Ata minimum, I think it's probably worth documenting this fact.

I still remember some time ago, other guys told me that the mainadvantage of -r is we don't need root privilege to mount.

Anyway, documentation is important, but we need to first know thecorrect or designed behavior of -r.


At least mkfs.ext4 -d option doesn't limit the size.

In my test, 1G file with mkfs.ext -d still shows about 900M+ availablespace.

For normal btrfs case, although it may not cause much problem, but itwill not be the optimized use case and may need extra manual balance.
Actually, until the first write to the filesystem, it will still be anoptimal layout. Once you start writing to any BTRFS filesystem that hasan optimal layout though, it immediately becomes non-optimal, andthere's not really anything we can do about that unless we allow chunksthat are already allocated to be resized on the fly (which is a bad ideafor multiple reasons).
At least to me, it's not the case for chunk created by -r option.
BTW, seed device is RO anyway, how much or how less spare space wehave is not a problem at all.
That really depends on how you look at it. Aside from the aboveexample, there's the rather specific question of why you would notwant to avoid wasting space. The filesystem is read-only, whichmeans that any 'free space' on that filesystem is completelyunusable, can't be reclaimed for anything else, and in general isjust a waste.
Still same problem above.
What if the seed device is de-attached and then be used as normal btrfs?
So to me, even follow other tools -r, we should follow the normalextent allocator behavior to create data/metadata, and then set thedevice size to end of its dev extents.
I don't entirely agree, but I think I've made my point well enoughabove.
Yes, you did make your point clear, and I agree that use cases youmentioned exist and wasted space also exists.
But since we don't really know what the image will be used, I preferto keep everything to use kernel (or btrfs-progs) chunk allocator tomake the behavior consistent.
So my point is more about consistent behavior of btrfs-progs andkernel, and less maintenance.(That's to say, my goal for mkfs.btrfs -r is just to do mkfs, mount,cp without privilege)
Perhaps we could add some tool then to take a BTRFS filesystem andrestructure it to have an optimal layout? On first examination, theresize command actually sounds like a reasonable place to do this,possibly add a 'min' keyword (similar to 'max') that can also adjustchunk sizes to get the smallest possible filesystem. The biggest thingI'm worried about here is that there are numerous use cases for optimalfilesystems of minimal size, and changing the behavior of the -r optionwill remove the only currently available way to get such filesystems.


Yes, when we're going to cover all possible cases, we're doomed.

So I'll just make it as simple as possible for now.

If some one really wants to do that, resize subcommand seems to be agood place to start.


Thanks,
Qu

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [btrfs-progs] Bug in mkfs.btrfs -r

Reply via email to