Re: Warnings and crash

2011-11-03 Thread Mitch Harder
On Wed, Nov 2, 2011 at 6:40 PM, Chris Mason chris.ma...@oracle.com wrote:
 On Thu, Nov 03, 2011 at 12:02:31AM +0100, David Sterba wrote:
 This one happened again, exactly same sequence of warnings and the crash
 at the end (same stack traces). It was in integration-scrub branch, ie.
 with all fixes on top.

 (I have captured sysrq-w, /proc/pid/stack of all processes and have the
 dmesg logs if anybody is interested.)

 Yes, I'd like to see the logs.  We need to figure out if we just failed
 with enospc in a critical spot or if there are other bigger problems.


I'm seeing similar warnings while testing the integration-scrub branch.

I'm testing on a 16-GB partition that is less than 20% full when I'm
seeing the warning.

Let me know if you're still working on this issue, and I'll try to
isolate the command that is running.

[ 8631.183234] device fsid aff24946-5596-4c19-a6c4-81809ae6bd94 devid
1 transid 7 /dev/sdb6
[ 8631.183483] btrfs: disk space caching is enabled
[ 8712.675332] btrfs: block rsv returned -28
[ 8712.675334] [ cut here ]
[ 8712.675362] WARNING: at fs/btrfs/extent-tree.c:5837
btrfs_alloc_free_block+0x3e9/0x400 [btrfs]()
[ 8712.675364] Hardware name: P35-DS3L
[ 8712.675365] Modules linked in: ipv6 snd_seq_midi snd_seq_dummy
snd_seq_oss snd_seq_midi_event snd_seq snd_pcm_oss snd_mixer_oss
lgdt330x cx88_dvb cx88_vp3054_i2c videobuf_dvb dvb_core rc_hauppauge
tuner_simple tuner_types tda9887 tda8290 tuner nvidia(P) ir_lirc_codec
lirc_dev ir_mce_kbd_decoder cx88_alsa cx8800 ir_sony_decoder
ir_jvc_decoder ir_rc6_decoder cx8802 cx88xx snd_ens1371 ir_rc5_decoder
ir_nec_decoder gameport snd_rawmidi snd_seq_device rc_core
i2c_algo_bit snd_ac97_codec videobuf_dma_sg tveeprom v4l2_common
videodev media v4l2_compat_ioctl32 videobuf_core btcx_risc iTCO_wdt
iTCO_vendor_support intel_agp intel_gtt ac97_bus snd_pcm snd_timer
i2c_i801 snd i2c_core snd_page_alloc ppdev parport_pc parport tpm_tis
tpm r8169 tpm_bios sr_mod pcspkr iscsi_tcp libiscsi_tcp libiscsi fuse
nfs nfs_acl auth_rpcgss lockd sunrpc btrfs zlib_deflate sl811_hcd
ohci_hcd uhci_hcd ehci_hcd
[ 8712.675409] Pid: 7917, comm: btrfs-transacti Tainted: P
3.1.0-integration-scrub+ #1
[ 8712.675411] Call Trace:
[ 8712.675417]  [81046e8f] warn_slowpath_common+0x7f/0xc0
[ 8712.675420]  [81046eea] warn_slowpath_null+0x1a/0x20
[ 8712.675428]  [a004a2e9] btrfs_alloc_free_block+0x3e9/0x400 [btrfs]
[ 8712.675435]  [a003b621] ? btrfs_leaf_free_space+0x61/0xb0 [btrfs]
[ 8712.675442]  [a0039069] __btrfs_cow_block+0x119/0x560 [btrfs]
[ 8712.675451]  [a0054070] ? btrfs_buffer_uptodate+0x50/0x70 [btrfs]
[ 8712.675458]  [a0039669] btrfs_cow_block+0x1b9/0x2f0 [btrfs]
[ 8712.675465]  [a003d640] btrfs_search_slot+0x5c0/0xa90 [btrfs]
[ 8712.675473]  [a0048b49]
btrfs_write_dirty_block_groups+0xd9/0x530 [btrfs]
[ 8712.675481]  [a0048952] ? btrfs_run_delayed_refs+0xf2/0x210 [btrfs]
[ 8712.675490]  [a0057475] commit_cowonly_roots+0x115/0x1e0 [btrfs]
[ 8712.675499]  [a0058a74]
btrfs_commit_transaction+0x404/0x8a0 [btrfs]
[ 8712.675508]  [a0057e31] ?
join_transaction.clone.24+0x21/0x240 [btrfs]
[ 8712.675511]  [81068e30] ? wake_up_bit+0x40/0x40
[ 8712.675520]  [a0051ee3] transaction_kthread+0x263/0x280 [btrfs]
[ 8712.675528]  [a0051c80] ? btrfs_congested_fn+0xb0/0xb0 [btrfs]
[ 8712.675536]  [a0051c80] ? btrfs_congested_fn+0xb0/0xb0 [btrfs]
[ 8712.675539]  [810688a6] kthread+0x96/0xa0
[ 8712.675542]  [816d3b54] kernel_thread_helper+0x4/0x10
[ 8712.675545]  [81068810] ? kthread_worker_fn+0x1a0/0x1a0
[ 8712.675547]  [816d3b50] ? gs_change+0xb/0xb
[ 8712.675549] ---[ end trace 234618ad5fea81a4 ]---
--
To unsubscribe from this list: send the line unsubscribe linux-btrfs in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Warnings and crash

2011-11-02 Thread David Sterba
This one happened again, exactly same sequence of warnings and the crash
at the end (same stack traces). It was in integration-scrub branch, ie.
with all fixes on top.

(I have captured sysrq-w, /proc/pid/stack of all processes and have the
dmesg logs if anybody is interested.)


david
--
To unsubscribe from this list: send the line unsubscribe linux-btrfs in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Re: Warnings and crash

2011-11-02 Thread Chris Mason
On Thu, Nov 03, 2011 at 12:02:31AM +0100, David Sterba wrote:
 This one happened again, exactly same sequence of warnings and the crash
 at the end (same stack traces). It was in integration-scrub branch, ie.
 with all fixes on top.
 
 (I have captured sysrq-w, /proc/pid/stack of all processes and have the
 dmesg logs if anybody is interested.)

Yes, I'd like to see the logs.  We need to figure out if we just failed
with enospc in a critical spot or if there are other bigger problems.

-chris

--
To unsubscribe from this list: send the line unsubscribe linux-btrfs in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html


Warnings and crash

2011-11-01 Thread David Sterba
the related source pointers of the warnings and crash (detailed below):

[ 1186.540073] WARNING: at fs/btrfs/extent-tree.c:5837 
btrfs_alloc_free_block+0x337/0x350 [btrfs]()
[ 1186.541076] WARNING: at fs/btrfs/extent-tree.c:4617 
__btrfs_free_extent+0x60d/0x6c0 [btrfs]()
[ 1186.541234] BUG: unable to handle kernel NULL pointer dereference at 
  (null)
[ 1186.541237] IP: [a005b881] map_private_extent_buffer+0x21/0x1f0 
[btrfs]


fs/btrfs/extent-tree.c:5837

This not a new warning, also known as 'the warn in use_block_rsv', happened
during xfstests/224.

The ratelimited printk patch (btrfs: ratelimit WARN_ON in use_block_rsv) is
part of the btrfs-next-stable branch pulled in, though is still meant as a
safety not to hang the box completely (and should go away in the future).

[ 1186.529033] btrfs: block rsv returned -28
[ 1186.534256] [ cut here ]
[ 1186.540073] WARNING: at fs/btrfs/extent-tree.c:5837 
btrfs_alloc_free_block+0x337/0x350 [btrfs]()
[ 1186.540075] Hardware name: Santa Rosa platform
[ 1186.540077] Modules linked in: btrfs aoe
[ 1186.540082] Pid: 5007, comm: btrfs-transacti Not tainted 3.1.0-default+ #64
[ 1186.540084] Call Trace:
[ 1186.540091]  [81057fdf] warn_slowpath_common+0x7f/0xc0
[ 1186.540095]  [8105803a] warn_slowpath_null+0x1a/0x20
[ 1186.540104]  [a002a9d7] btrfs_alloc_free_block+0x337/0x350 [btrfs]
[ 1186.540109]  [810913dd] ? trace_hardirqs_off+0xd/0x10
[ 1186.540113]  [810098d5] ? native_sched_clock+0x15/0x70
[ 1186.540118]  [810844ff] ? local_clock+0x6f/0x80
[ 1186.540126]  [a0017162] __btrfs_cow_block+0x152/0x800 [btrfs]
[ 1186.540130]  [810844ff] ? local_clock+0x6f/0x80
[ 1186.540133]  [810916fd] ? lock_release_holdtime+0x3d/0x1a0
[ 1186.540141]  [a0017ae1] btrfs_cow_block+0x2d1/0x410 [btrfs]
[ 1186.540150]  [a001cb88] btrfs_search_slot+0x728/0xe70 [btrfs]
[ 1186.540159]  [a0022f0b] lookup_inline_extent_backref+0x8b/0x480 
[btrfs]
[ 1186.540168]  [a00245f0] __btrfs_free_extent+0xd0/0x6c0 [btrfs]
[ 1186.540172]  [813628be] ? do_raw_spin_unlock+0x5e/0xb0
[ 1186.540182]  [a00287cf] run_clustered_refs+0x21f/0x850 [btrfs]
[ 1186.540192]  [a0028ec8] btrfs_run_delayed_refs+0xc8/0x220 [btrfs]
[ 1186.540195]  [813628be] ? do_raw_spin_unlock+0x5e/0xb0
[ 1186.540205]  [a0029178] btrfs_write_dirty_block_groups+0x158/0x540 
[btrfs]
[ 1186.540216]  [a0038ee5] commit_cowonly_roots+0x115/0x1e0 [btrfs]
[ 1186.540228]  [a0039ee7] btrfs_commit_transaction+0x477/0x8f0 
[btrfs]
[ 1186.540240]  [a00391a9] ? join_transaction+0x69/0x2b0 [btrfs]
[ 1186.540243]  [8107d480] ? wake_up_bit+0x40/0x40
[ 1186.540254]  [a0032a1b] transaction_kthread+0x28b/0x2a0 [btrfs]
[ 1186.540264]  [a0032790] ? btree_readpage+0x30/0x30 [btrfs]
[ 1186.540267]  [8107cf16] kthread+0xa6/0xb0
[ 1186.540272]  [81a1ea04] kernel_thread_helper+0x4/0x10
[ 1186.540276]  [81a15334] ? retint_restore_args+0x13/0x13
[ 1186.540279]  [8107ce70] ? __init_kthread_worker+0x70/0x70
[ 1186.540282]  [81a1ea00] ? gs_change+0x13/0x13
[ 1186.540284] ---[ end trace 6fa3da5fd65420af ]---
[ 1186.540433] btrfs: block rsv returned -28

ratelimited, I see just 2 occurences before, followed by

[ 1186.541076] WARNING: at fs/btrfs/extent-tree.c:4617 
__btrfs_free_extent+0x60d/0x6c0 [btrfs]()
[ 1186.541079] Hardware name: Santa Rosa platform
[ 1186.541080] Modules linked in: btrfs aoe
[ 1186.541084] Pid: 5007, comm: btrfs-transacti Tainted: GW   
3.1.0-default+ #64
[ 1186.541086] Call Trace:
[ 1186.541089]  [81057fdf] warn_slowpath_common+0x7f/0xc0
[ 1186.541093]  [8105803a] warn_slowpath_null+0x1a/0x20
[ 1186.541101]  [a0024b2d] __btrfs_free_extent+0x60d/0x6c0 [btrfs]
[ 1186.541105]  [813628be] ? do_raw_spin_unlock+0x5e/0xb0
[ 1186.541114]  [a0028949] run_clustered_refs+0x399/0x850 [btrfs]
[ 1186.541126]  [a0079800] ? btrfs_find_ref_cluster+0xb0/0x1a0 [btrfs]
[ 1186.541136]  [a0028ec8] btrfs_run_delayed_refs+0xc8/0x220 [btrfs]
[ 1186.541139]  [813628be] ? do_raw_spin_unlock+0x5e/0xb0
[ 1186.541149]  [a0029178] btrfs_write_dirty_block_groups+0x158/0x540 
[btrfs]
[ 1186.541160]  [a0038ee5] commit_cowonly_roots+0x115/0x1e0 [btrfs]
[ 1186.541172]  [a0039ee7] btrfs_commit_transaction+0x477/0x8f0 
[btrfs]
[ 1186.541183]  [a00391a9] ? join_transaction+0x69/0x2b0 [btrfs]
[ 1186.541186]  [8107d480] ? wake_up_bit+0x40/0x40
[ 1186.541196]  [a0032a1b] transaction_kthread+0x28b/0x2a0 [btrfs]
[ 1186.541207]  [a0032790] ? btree_readpage+0x30/0x30 [btrfs]
[ 1186.541210]  [8107cf16] kthread+0xa6/0xb0
[ 1186.541213]  [81a1ea04] kernel_thread_helper+0x4/0x10
[ 1186.541216]  [81a15334] ? retint_restore_args+0x13/0x13
[ 1186.541219]  [8107ce70