On 02/20/2018 12:53 AM, David Sterba wrote:
On Thu, Feb 15, 2018 at 01:02:24PM +0800, Anand Jain wrote:
btrfs_close_extra_devices() is not exactly about just closing the opened
devices, but its also about free-ing the stale devices which may have
scanned into the btrfs_fs_devices::dev_list. The way it picks devices to
be freed is by going through the btrfs_fs_devices::dev_list and its seed
devices, and finding for devices which do not have the flag
BTRFS_DEV_STATE_IN_FS_METADATA nor if it is part of the replace target.

However, in the first place the way devices are scanned and added to the
btrfs_fs_devices::dev_list have changed for a long time now. During scan
when it finds matching fsid+uuid+devid it would add the device to
btrfs_fs_devices::dev_list. A matched device with higher generation number
overwrites the device with lower generation number during.

Further, the stale devices containing the stale fsid are removed at the
time of the scan itself.

So there isn't any opportunity that btrfs_close_extra_devices() can free
the stale device within the fsid which is being mounted.

Further about the btrfs_fs_devices::latest_bdev that
the btrfs_close_extra_devices() function assigns, is already assigned by
the function __btrfs_open_devices().

So as this function has no effect, delete it.

I think this is correct. Freeing stale devices as a side effect of mount
does not seem right anyway. I'm still not able to convince myself that
there's not an unexpected interaction of dev scan and dev replace, as
it relies on the state bits and other locks. If you have ideas were to
put some asserts or extra checks, please suggest.


 Thanks for comments. I took a fresh look after a long weekend.
 Here I found something new. Following test case [1] can simulate stale
 SB on the missing device which got deleted. Now if we do the fresh
 dev scan and mount the deleted device should not be kept in the
 device_list after the mount because that's stale. And as IN_FS_METADATA
 flag is not set on this device this function helps to clean up those
 devices. The dev scan free_stale can't catch this condition because
 we won't go deeper than the SB read. Sorry, my bad pls ignore this
 patch this is a very corner case or I had gone nuts when I investigated
 this.

[1]
  mkfs.btrfs -fq -mraid1 -draid1 /dev/sde /dev/sdf /dev/sdg
  modprobe -r btrfs
  mount -o degraded,device=/dev/sde /dev/sdf /btrfs
  btrfs dev del 3 /btrfs
  umount /btrfs
  btrfs dev scan /dev/sde /dev/sdf /dev/sdg
  mount /dev/sde /btrfs
  wipefs -a /dev/sdg <-- this should work as sdg is not part of
                         fsid mounted anymore.

 I am thinking to rename this function to btrfs_free_extra_devices()
 and add comments.

 Also about the fs_devices::latest_bdev we certainly assign it in
 __btrfs_open_devices(), but there we don't care if it picks the
 replacing target. And suppose if replacing_target has the highest
 (or equal) generation number compared to other we shall read the sys
 and chunk tree from it. So during the mount context we use and
 reconstruct using the replacing target, but at the end of mount
 context, we use the non-replace target as the latest_bdev.

Thanks, Anand


--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to