Re: [PATCH v3 1/5] btrfs: scrub: Introduce full stripe lock for RAID56

Qu Wenruo Sun, 02 Apr 2017 17:43:42 -0700


At 04/01/2017 01:26 AM, David Sterba wrote:

On Fri, Mar 31, 2017 at 10:03:28AM +0800, Qu Wenruo wrote:



At 03/30/2017 06:31 PM, David Sterba wrote:

On Thu, Mar 30, 2017 at 09:03:21AM +0800, Qu Wenruo wrote:

+static int lock_full_stripe(struct btrfs_fs_info *fs_info, u64 bytenr)
+{
+       struct btrfs_block_group_cache *bg_cache;
+       struct btrfs_full_stripe_locks_tree *locks_root;
+       struct full_stripe_lock *existing;
+       u64 fstripe_start;
+       int ret = 0;
+
+       bg_cache = btrfs_lookup_block_group(fs_info, bytenr);
+       if (!bg_cache)
+               return -ENOENT;
+


When starting to scrub a chunk, we've already increased a ref for block group,
could you please put a ASSERT to catch it?


Personally I prefer WARN_ON() than ASSERT().

ASSERT() always panic the modules and forces us to reset the system.
Wiping out any possibility to check the system.


I think the sematnics of WARN_ON and ASSERT are different, so it should
be decided case by case which one to use. Assert is good for 'never
happens' or catch errors at development time (wrong API use, invariant
condition that must always match).

Also the asserts are gone if the config option is unset, while WARN_ON
will stay in some form (verbose or not). Both are suitable for catching
problems, but the warning is for less critical errors so we want to know
when it happens but still can continue.

The above case looks like a candidate for ASSERT as the refcounts must
be correct, continuing with the warning could lead to other unspecified
problems.


I'm OK to use ASSERT() here, but current ASSERT() in btrfs can hide real
problem if CONFIG_BTRFS_ASSERT is not set.

When CONFIG_BTRFS_ASSERT is not set, ASSERT() just does thing, and
*continue* executing.

This forces us to build a fallback method.

For above case, if we simply do "ASSERT(bg_cache);" then for
BTRFS_CONFIG_ASSERT not set case (which is quite common for most
distributions) we will cause NULL pointer deference.

So here, we still need to do bg_cache return value check, but just
change "WARN_ON(1);" to "ASSERT(0);" like:
------
        bg_cache = btrfs_lookup_block_group(fs_info, bytenr);
        if (!bg_cache) {
                ASSERT(0); /* WARN_ON(1); */
                return -ENOENT;
        }
------

Can we make ASSERT() really catch problem no matter kernel config?
Current ASSERT() behavior is in fact forcing us to consider both
situation, which makes it less handy.


All agreed, I'm not very happy about how the current ASSERT is
implemented. We want to add more and potentially expensive checks during
debugging builds, but also want to make sure that code does not proceed
pass some points if the invariants and expected values do not hold.

BUG_ON does that but then we have tons of them already and some of them
are just a temporary error handling, while at some other places it
serves as the sanity checker. We'd probably need 3rd option, that would
behave like BUG_ON but named differently, so we can clearly see that
it's intentional, or we can annotate the BUG_ON by comments.

Right, until we have better solution, I'll use the above method andresend the first 2 patch.


BTW, I'm not quite a fan of a new BUG_ON with different name.

Currently WARN_ON/BUG_ON/ASSERT is already a little more complex than weneed.

I prefer to remove all error handler BUG_ON() step by step as we'realready doing, and then leaving BUG_ON() as a sanity check for reallyimpossible case.

And modify WARN_ON() to give less scary info (without backtrace)depending on kernel config for more or less foreseeable and lesscritical error case.


Thanks,
Qu


--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Re: [PATCH v3 1/5] btrfs: scrub: Introduce full stripe lock for RAID56

Reply via email to