On 18.04.19 г. 9:28 ч., Qu Wenruo wrote:
> [BUG]
> With kmalloc failure injection for submit_one_bio(), btrfs can crash like:
> 
>   BUG: unable to handle kernel NULL pointer dereference at 0000000000000038
>   #PF error: [WRITE]
>   PGD 0 P4D 0
>   Oops: 0002 [#1] PREEMPT SMP PTI
>   CPU: 1 PID: 247 Comm: kworker/u8:4 Not tainted 5.1.0-rc5-custom+ #19
>   Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 0.0.0 02/06/2015
>   Workqueue: writeback wb_workfn (flush-btrfs-6)
>   RIP: 0010:alloc_btrfs_bio+0x1e/0x30 [btrfs]
>   Code: 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 0f 1f 44 00 00 48 63 f6 48 63 ff 
> 48 8d 7c fe 18 be 40 8d 00 00 48 c1 e7 02 e8 a2 86 94 e0 <c7> 40 38 00 00 00 
> 00 c7 00 01 00 00 00 c3 0f 1f 40 00 0f 1f 44 00
>   Call Trace:
>    __btrfs_map_block+0x5ce/0x1210 [btrfs]
>    ? btrfs_bio_counter_inc_blocked+0x3a/0xc0 [btrfs]
>    btrfs_map_bio+0x9a/0x430 [btrfs]
>    btree_submit_bio_hook+0x82/0xb0 [btrfs]
>    submit_one_bio+0x95/0xc0 [btrfs]
>    copy_oldmem_page_encrypted+0x20/0x20
>    ? write_one_eb+0x18f/0x2a0 [btrfs]
>    ? end_extent_buffer_writeback+0x20/0x20 [btrfs]
>    ? btree_write_cache_pages+0x12c/0x350 [btrfs]
>    ? do_writepages+0x41/0xd0
>    ? __writeback_single_inode+0x54/0x650
>    ? writeback_sb_inodes+0x1f9/0x540
>    ? __writeback_inodes_wb+0x5d/0xb0
>    ? wb_writeback+0x340/0x4b0
>    ? wb_workfn+0x410/0x5d0
>    ? process_one_work+0x294/0x650
>    ? worker_thread+0x2d/0x3d0
>    ? process_one_work+0x650/0x650
>    ? kthread+0x112/0x130
>    ? kthread_park+0x80/0x80
>    ? ret_from_fork+0x3a/0x50
>   ---[ end trace b637169fb8b17c9c ]---
> 
> [CAUSE]
> We just forgot to check the return value of kmalloc.
> Surprisingly, all alloc_btrfs_bio() callers have handled memory
> allocation pretty well.
> 

The allocation uses the GFP_NOFAIL modified, which, according to the docs:

 * The VM implementation _must_ retry infinitely: the caller
 * cannot handle allocation failures. The allocation could block

 * indefinitely but will never return with failure. Testing for

 * failure is pointless.

The allocation requested is at least 128 bytes (assuming real_stripes is
0).

96 + 24 * total_stripes + 4 * real_stripes + 8 * total_stripes

Considering this I think it might be prudent to also remove the NOFAIL
flag altogether


> [FIX]
> Check and return if we failed memory allocation.
> 
> Signed-off-by: Qu Wenruo <w...@suse.com>

Though the change is fine:

Reviewed-by: Nikolay Borisov <nbori...@suse.com>


> ---
>  fs/btrfs/volumes.c | 2 ++
>  1 file changed, 2 insertions(+)
> 
> diff --git a/fs/btrfs/volumes.c b/fs/btrfs/volumes.c
> index 78bab7803bda..875d0eee1785 100644
> --- a/fs/btrfs/volumes.c
> +++ b/fs/btrfs/volumes.c
> @@ -5582,6 +5582,8 @@ static struct btrfs_bio *alloc_btrfs_bio(int 
> total_stripes, int real_stripes)
>               sizeof(u64) * (total_stripes),
>               GFP_NOFS|__GFP_NOFAIL);
>  
> +     if (!bbio)
> +             return NULL;
>       atomic_set(&bbio->error, 0);
>       refcount_set(&bbio->refs, 1);
>  
> 

Reply via email to