Re: Volume appears full but TB's of space available

Austin S. Hemmelgarn Fri, 07 Apr 2017 04:42:08 -0700

On 2017-04-06 23:25, John Petrini wrote:

Interesting. That's the first time I'm hearing this. If that's the
case I feel like it's a stretch to call it RAID10 at all. It sounds a
lot more like basic replication similar to Ceph only Ceph understands
failure domains and therefore can be configured to handle device
failure (albeit at a higher level)

Yeah, the stacking is a bit odd, and there are some rather annoyingcaveats that make most of the names other than raid5/raid6 misleading.In fact, raid1 mode in BTRFS is more like what most people think of asRAID10 when run on more than 2 disks than BTRFS raid10 mode is, althoughit stripes at a much higher level.


I do of course keep backups but I chose RAID10 for the mix of
performance and reliability. It doesn't seems worth it losing 50% of
my usable space for the performance gain alone.

Thank you for letting me know about this. Knowing that I think I may
have to reconsider my choice here. I've really been enjoying the
flexibility of BTRS which is why I switched to it in the first place
but with experimental RAID5/6 and what you've just told me I'm
beginning to doubt that it's the right choice.

There are some other options in how you configure it. Most of the moreuseful operational modes actually require stacking BTRFS on top of LVMor MD. I'm rather fond of running BTRFS raid1 on top of LVM RAID0volumes, which while it provides no better data safety than BTRFS raid10mode, gets noticeably better performance. You can also reverse that toget something more like traditional RAID10, but you lose theself-correcting aspect of BTRFS.


What's more concerning is that I haven't found a good way to monitor
BTRFS. I might be able to accept that the array can only handle a
single drive failure if I was confident that I could detect it but so
far I haven't found a good solution for this.

This I can actually give some advice on. There are a couple of options,but the easiest is to find a piece of generic monitoring software thatcan check the return code of external programs, and then write somesimple scripts to perform the checks on BTRFS. The things you want tokeep an eye on are:

1. Output of 'btrfs dev stats'. If you've got a new enough copy ofbtrfs-progs, you can pass '--check' and the return code will be non-zeroif any of the error counters isn't zero. If you've got to use an olderversion, you'll instead have to write a script to parse the output (Iwill comment that this is much easier in a language like Perl or Pythonthan it is in bash). You want to watch for steady increases in errorcounts or sudden large jumps. Single intermittent errors are worthtracking, but they tend to happen more frequently the larger the array is.

2. Results from 'btrfs scrub'. This is somewhat tricky because scrub iseither asynchronous or blocks for a _long_ time. The simplest optionI've found is to fire off an asynchronous scrub to run during down-time,and then schedule recurring checks with 'btrfs scrub status'. On theplus side, 'btrfs scrub status' already returns non-zero if the scrubfound errors.

3. Watch the filesystem flags. Some monitoring software can easily dothis for you (Monit for example can watch for changes in the flags).The general idea here is that BTRFS will go read-only if it hits certainserious errors, so you can watch for that transition and send anotification when it happens. This is also worth watching since thefilesystem flags should not change during normal operation of anyfilesystem.

4. Watch SMART status on the drives and run regular self-tests. Most ofthe time, issues will show up here before they show up in the FS, so bywatching this, you may have an opportunity to replace devices before thefilesystem ends up completely broken.

5. If you're feeling really ambitious, watch the kernel logs for errorsfrom BTRFS and whatever storage drivers you use. This is the leastreliable thing out of this list to automate, so I'd not suggest justdoing this by itself.

The first two items are BTRFS specific. The rest however, are standardthings you should be monitoring regardless of what type of storage stackyou have. Of these, item 3 will immediately trigger in the event of acatastrophic device failure, while 1, 2, and 5 will provide bettercoverage of slow failures, and 4 will cover both aspects.

As far as what to use to actually track these, that really depends onyour use case. For tracking on an individual system basis, I'd suggestMonit, it's efficient, easy to configure, provides some degree of errorresilience, and can actually cover a lot of monitoring tasks beyondstuff like this. If you want some kind of centralized monitoring, I'dprobably go with Nagios, but that's more because that's the standard forthat type of thing, not because I've used it myself (I much preferper-system decentralized monitoring, with only the checks that systemsare online centralized).

--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: Volume appears full but TB's of space available

Reply via email to