On Mon, Dec 11, 2017 at 04:55:31PM -0500, Josef Bacik wrote:
> From: Josef Bacik <jba...@fb.com>
> 
> Now that we have metadata counters in the VM, we need to provide a way to kick
> writeback on dirty metadata.  Introduce super_operations->write_metadata.  
> This
> allows file systems to deal with writing back any dirty metadata we need based
> on the writeback needs of the system.  Since there is no inode to key off of 
> we
> need a list in the bdi for dirty super blocks to be added.  From there we can
> find any dirty sb's on the bdi we are currently doing writeback on and call 
> into
> their ->write_metadata callback.
> 
> Signed-off-by: Josef Bacik <jba...@fb.com>
> Reviewed-by: Jan Kara <j...@suse.cz>
> Reviewed-by: Tejun Heo <t...@kernel.org>
> ---
>  fs/fs-writeback.c                | 72 
> ++++++++++++++++++++++++++++++++++++----
>  fs/super.c                       |  6 ++++
>  include/linux/backing-dev-defs.h |  2 ++
>  include/linux/fs.h               |  4 +++
>  mm/backing-dev.c                 |  2 ++
>  5 files changed, 80 insertions(+), 6 deletions(-)
> 
> diff --git a/fs/fs-writeback.c b/fs/fs-writeback.c
> index 987448ed7698..fba703dff678 100644
> --- a/fs/fs-writeback.c
> +++ b/fs/fs-writeback.c
> @@ -1479,6 +1479,31 @@ static long writeback_chunk_size(struct bdi_writeback 
> *wb,
>       return pages;
>  }
>  
> +static long writeback_sb_metadata(struct super_block *sb,
> +                               struct bdi_writeback *wb,
> +                               struct wb_writeback_work *work)
> +{
> +     struct writeback_control wbc = {
> +             .sync_mode              = work->sync_mode,
> +             .tagged_writepages      = work->tagged_writepages,
> +             .for_kupdate            = work->for_kupdate,
> +             .for_background         = work->for_background,
> +             .for_sync               = work->for_sync,
> +             .range_cyclic           = work->range_cyclic,
> +             .range_start            = 0,
> +             .range_end              = LLONG_MAX,
> +     };
> +     long write_chunk;
> +
> +     write_chunk = writeback_chunk_size(wb, work);
> +     wbc.nr_to_write = write_chunk;
> +     sb->s_op->write_metadata(sb, &wbc);
> +     work->nr_pages -= write_chunk - wbc.nr_to_write;
> +
> +     return write_chunk - wbc.nr_to_write;

Ok, writeback_chunk_size() returns a page count. We've already gone
through the "metadata is not page sized" dance on the dirty
accounting side, so how are we supposed to use pages to account for
metadata writeback?

And, from what I can tell, if work->sync_mode = WB_SYNC_ALL or
work->tagged_writepages is set, this will basically tell us to flush
the entire dirty metadata cache because write_chunk will get set to
LONG_MAX.

IOWs, this would appear to me to change sync() behaviour quite
dramatically on filesystems where ->write_metadata is implemented.
That is, instead of leaving all the metadata dirty in memory and
just forcing the journal to stable storage, filesystems will be told
to also write back all their dirty metadata before sync() returns,
even though it is not necessary to provide correct sync()
semantics....

Mind you, writeback invocation is so convoluted now I could easily
be mis-interpretting this code, but it does seem to me like this
code is going to have some unintended behaviours....

Cheers,

Dave.
-- 
Dave Chinner
da...@fromorbit.com
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to