Re: [Cluster-devel] [GFS2 PATCH] GFS2: Don't brelse rgrp buffer_heads every allocation

Steven Whitehouse Wed, 10 Jun 2015 03:31:23 -0700

Hi,


On 09/06/15 15:45, Bob Peterson wrote:

----- Original Message -----

Hi,


On 05/06/15 15:49, Bob Peterson wrote:

Hi,

This patch allows the block allocation code to retain the buffers
for the resource groups so they don't need to be re-read from buffer
cache with every request. This is a performance improvement that's
especially noticeable when resource groups are very large. For
example, with 2GB resource groups and 4K blocks, there can be 33
blocks for every resource group. This patch allows those 33 buffers
to be kept around and not read in and thrown away with every
operation. The buffers are released when the resource group is
either synced or invalidated.

The blocks should be cached between operations, so this should only be
resulting in a skip of the look up of the cached block, and no changes
to the actual I/O. Does that mean that grab_cache_page() is slow I
wonder? Or is this an issue of going around the retry loop due to lack
of memory at some stage?

How does this interact with the rgrplvb support? I'd guess that with
that turned on, this is no longer an issue, because we'd only read in
the blocks for the rgrps that we are actually going to use?



Steve.

Hi,

If you compare the two vmstat outputs in the bugzilla #1154782, you'll
see no significant difference in memory usage nor cpu usage. So I assume
the page lookup is the "slow" part; not because it's such a slow thing
but because it's done 33 times per read-reference-invalidate (33 pages
to look up per rgrp).

Regards,

Bob Peterson
Red Hat File Systems

Thats true, however, as I understand the problem here, the issue is notreading in the blocks for the rgrp that is eventually selected to use,but the reading in of those blocks for the rgrps that we reject, forwhatever reason (full, or congested, or whatever). So with rgrplvbenabled, we don't then read those rgrps in off disk at all in most cases- so I was wondering whether that solves the problem without needingthis change?

Ideally I'd like to make the rgrplvb setting the default, since it ismuch more efficient. The question is how we can do that and still remainbackward compatible? Not an easy one to answer :(

Also, if the page lookup is the slow thing, then we should look at usingpagevec_lookup() to get the pages in chunks rather than doing itindividually (and indeed, multiple times per page, in case of block sizeless than page size). We know that the blocks will always be contiguouson disk, so we should be able to send down large I/Os, rather thanrelying on the block stack to merge them as we do at the moment, whichshould be a further improvement too,


Steve.

Re: [Cluster-devel] [GFS2 PATCH] GFS2: Don't brelse rgrp buffer_heads every allocation

Reply via email to