Hi,

Thats a good bit of detective work. I've added it to the -nmw tree.
Thanks,

Steve.

On Fri, 2013-12-13 at 08:31 -0500, Bob Peterson wrote:
> Hi,
> 
> This patch fixes a slab memory leak that sometimes can occur
> for files with a very short lifespan. The problem occurs when
> a dinode is deleted before it has gotten to the journal properly.
> In the leak scenario, the bd object is pinned for journal
> committment (queued to the metadata buffers queue: sd_log_le_buf)
> but is subsequently unpinned and dequeued before it finds its way
> to the ail or the revoke queue. In this rare circumstance, the bd
> object needs to be freed from slab memory, or it is forgotten.
> We have to be very careful how we do it, though, because
> multiple processes can call gfs2_remove_from_journal. In order to
> avoid double-frees, only the process that does the unpinning is
> allowed to free the bd.
> 
> Regards,
> 
> Bob Peterson
> Red Hat File Systems
> 
> Signed-off-by: Bob Peterson <rpete...@redhat.com> 
> ---
> diff --git a/fs/gfs2/meta_io.c b/fs/gfs2/meta_io.c
> index e57f608..c7f2469 100644
> --- a/fs/gfs2/meta_io.c
> +++ b/fs/gfs2/meta_io.c
> @@ -261,6 +261,7 @@ void gfs2_remove_from_journal(struct buffer_head *bh, 
> struct gfs2_trans *tr, int
>       struct address_space *mapping = bh->b_page->mapping;
>       struct gfs2_sbd *sdp = gfs2_mapping2sbd(mapping);
>       struct gfs2_bufdata *bd = bh->b_private;
> +     int was_pinned = 0;
>  
>       if (test_clear_buffer_pinned(bh)) {
>               trace_gfs2_pin(bd, 0);
> @@ -276,12 +277,16 @@ void gfs2_remove_from_journal(struct buffer_head *bh, 
> struct gfs2_trans *tr, int
>                       tr->tr_num_databuf_rm++;
>               }
>               tr->tr_touched = 1;
> +             was_pinned = 1;
>               brelse(bh);
>       }
>       if (bd) {
>               spin_lock(&sdp->sd_ail_lock);
>               if (bd->bd_tr) {
>                       gfs2_trans_add_revoke(sdp, bd);
> +             } else if (was_pinned) {
> +                     bh->b_private = NULL;
> +                     kmem_cache_free(gfs2_bufdata_cachep, bd);
>               }
>               spin_unlock(&sdp->sd_ail_lock);
>       }
> 


Reply via email to