those are the last useful log outputs before the server locks up 

digging in /var/log/messages - you can see it stopped logging at 12:47, and I 
hard reset at 3:07

maybe I should have specified hard-lock-up instead of panic

2014-02-06T12:47:47.590784+11:00 store03 kernel: [ 4619.769346] ------------[ 
cut here ]------------
2014-02-06T12:47:47.590785+11:00 store03 kernel: [ 4619.769369] WARNING: CPU: 0 
PID: 3005 at /home/abuild/rpmbuild/BUILD/kernel-pae-3.11.6/l
inux-3.11/fs/btrfs/disk-io.c:482 btree_csum_one_bio.isra.48+0x93/0x110 [btrfs]()
2014-02-06T12:47:47.590893+11:00 store03 kernel: [ 4619.769399] Modules linked 
in: bonding hwmon_vid btrfs raid6_pq zlib_deflate xor libcrc32c joydev 
hid_generic iTCO_wdt iTCO_vendor_support coretemp pcspkr serio_raw i2c_i801 
ata_generic lpc_ich mfd_core usbhid mvsas libsas scsi_transport_sas e1000e ptp 
pps_core shpchp mperf sg dm_mod autofs4 ata_piix uhci_hcd ehci_pci ehci_hcd 
usbcore usb_common i915 fan thermal processor drm_kms_helper drm i2c_algo_bit 
button video thermal_sys scsi_dh_hp_sw scsi_dh_emc scsi_dh_rdac scsi_dh_alua 
scsi_dh
2014-02-06T12:47:47.590896+11:00 store03 kernel: [ 4619.769402] CPU: 0 PID: 
3005 Comm: btrfs-worker-1 Tainted: G        W    3.11.6-4-pae #1
2014-02-06T12:47:47.590898+11:00 store03 kernel: [ 4619.769403] Hardware name: 
PhoenixAward 945GM/945GM, BIOS 6.00 PG 08/13/2008
2014-02-06T12:47:47.590899+11:00 store03 kernel: [ 4619.769407]  00000009 
c06e075a 00000000 c0242c5e c085dbc8 00000000 00000bbd f8a06e34
2014-02-06T12:47:47.590901+11:00 store03 kernel: [ 4619.769411]  000001e2 
f8985503 f8985503 0000000d f5abaa5c d7c20a5c f1c97070 c0242d1b
2014-02-06T12:47:47.590903+11:00 store03 kernel: [ 4619.769415]  00000009 
00000000 f8985503 e85afce0 f5abaa5c e62a6c00 16c205ed c6b5476c
2014-02-06T12:47:47.590905+11:00 store03 kernel: [ 4619.769415] Call Trace:
2014-02-06T12:47:47.590906+11:00 store03 kernel: [ 4619.769424]  [<c0204ef9>] 
try_stack_unwind+0x179/0x190
2014-02-06T12:47:47.590908+11:00 store03 kernel: [ 4619.769430]  [<c0203e17>] 
dump_trace+0x47/0xf0
2014-02-06T12:47:47.590910+11:00 store03 kernel: [ 4619.769434]  [<c0204f4f>] 
show_trace_log_lvl+0x3f/0x50
2014-02-06T12:47:47.590911+11:00 store03 kernel: [ 4619.769437]  [<c0203f10>] 
show_stack_log_lvl+0x50/0xd0
2014-02-06T12:47:47.590913+11:00 store03 kernel: [ 4619.769441]  [<c0204f9f>] 
show_stack+0x1f/0x40
2014-02-06T12:47:47.590915+11:00 store03 kernel: [ 4619.769445]  [<c06e075a>] 
dump_stack+0x3e/0x4e
2014-02-06T12:47:47.590917+11:00 store03 kernel: [ 4619.769450]  [<c0242c5e>] 
warn_slowpath_common+0x7e/0xa0
2014-02-06T12:47:47.590918+11:00 store03 kernel: [ 4619.769454]  [<c0242d1b>] 
warn_slowpath_null+0x1b/0x20
2014-02-06T12:47:47.590920+11:00 store03 kernel: [ 4619.769472]  [<f8985503>] 
btree_csum_one_bio.isra.48+0x93/0x110 [btrfs]
2014-02-06T12:47:47.590922+11:00 store03 kernel: [ 4619.769555]  [<f898261f>] 
run_one_async_start+0x2f/0x40 [btrfs]
2014-02-06T12:47:47.590924+11:00 store03 kernel: [ 4619.769630]  [<f89bdcb7>] 
worker_loop+0x107/0x470 [btrfs]
^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@2014-02-06T15:07:05.120258+11:00
 store03 rsyslogd: [origin software="rsyslogd" swVersion="7.4.7" x-pid="418" 
x-info="http://www.rsyslog.com";] start
2014-02-06T15:07:05.127408+11:00 store03 kernel: [    0.000000] Initializing 
cgroup subsys cpuset

as I now can't mount (open_ctree failed)
Should I be mounting with -o recovery ?

On 06/02/14 15:35, Anand Jain wrote:
> 
> 
> your test case is same as in the patch below
> and the panic was due to null bdev (which matches
> in your logs).
> 
>   [RFC PATCH] btrfs: fix null pointer deference at
> btrfs_sysfs_add_one+0x105
> 
> 
> But in your logs below, there isn't a panic right ?
> wrong cut and paste ? or what did I miss?
> 
> 
> Thanks, Anand
> 
> 
> 
> On 02/06/14 11:40 AM, Thermionix wrote:
>> openSUSE 13.1 i686 8 device raid 10
>> when replacing a failed disk (new device is added)
>>
>> ~ # uname -r
>> 3.11.6-4-pae
>>
>> ~ # btrfs --version
>> Btrfs v3.12+20131125
>>
>> ~ # mount -o degraded /pool
>>
>> ~ # journalctl | tail
>>
>> Feb 06 12:22:51 store03 kernel: device label pool devid 4 transid 55050
>> /dev/sde
>> Feb 06 12:22:53 store03 kernel: btrfs: allowing degraded mounts
>> Feb 06 12:22:53 store03 kernel: btrfs: disk space caching is enabled
>> Feb 06 12:22:53 store03 kernel: btrfs: bdev (null) errs: wr 353, rd 1,
>> flush 17, corrupt 0, gen 0
>> Feb 06 12:23:16 store03 kernel: BTRFS debug (device sde): unlinked 1
>> orphans
>>
>> ~ # btrfs filesystem show /dev/disk/by-label/pool
>> Label: pool  uuid: 3e6ba20f-a4d0-40e4-88e7-a31c4930bcfe
>>          Total devices 9 FS bytes used 5.19TiB
>>          devid    1 size 1.36TiB used 169.50GiB path
>>          devid    2 size 1.82TiB used 1.62TiB path /dev/sdc
>>          devid    3 size 931.51GiB used 931.51GiB path /dev/sdd
>>          devid    4 size 931.51GiB used 931.51GiB path /dev/sde
>>          devid    6 size 1.82TiB used 1.62TiB path /dev/sdg
>>          devid    7 size 1.82TiB used 1.62TiB path /dev/sdh
>>          devid    8 size 931.51GiB used 931.51GiB path /dev/sdi
>>          devid    9 size 1.82TiB used 1.62TiB path /dev/sdf
>>          devid    10 size 1.82TiB used 1.01TiB path /dev/sdb
>>
>> ~ # btrfs device delete missing /pool
>>
>> ~ # journalctl -l | tail
>>
>> Feb 06 12:25:43 store03 kernel: btrfs: relocating block group
>> 10590585618432 flags 68
>> ...
>> Feb 06 12:47:23 store03 kernel:  [<c025ebd2>] kthread+0x92/0xa0
>> Feb 06 12:47:23 store03 kernel:  [<c06ece67>]
>> ret_from_kernel_thread+0x1b/0x28
>> Feb 06 12:47:23 store03 kernel:  [<c025eb40>]
>> kthread_create_on_node+0xd0/0xd0
>> Feb 06 12:47:23 store03 kernel: DWARF2 unwinder stuck at kthread+0x0/0xa0
>> Feb 06 12:47:23 store03 kernel: Feb 06 12:47:23 store03 kernel: Leftover
>> inexact backtrace:
>> Feb 06 12:47:23 store03 kernel: ---[ end trace c47f82d03f79250d ]---
>> Feb 06 12:47:23 store03 kernel: ------------[ cut here ]------------
>> Feb 06 12:47:23 store03 kernel: WARNING: CPU: 0 PID: 3028 at
>> /home/abuild/rpmbuild/BUILD/kernel-pae-3.11.6/linux-3.11/fs/btrfs/disk-io.c:482
>>
>> btree_csum_one_bio.isra.48+0x93/0x110 [btrfs]()
>> Feb 06 12:47:23 store03 kernel: Modules linked in: bonding hwmon_vid
>> btrfs raid6_pq zlib_deflate xor libcrc32c joydev hid_generic iTCO_wdt
>> iTCO_vendor_support coretemp pcspkr serio_raw i2c_i801 ata_generic
>> lpc_ich mfd_core usbhid mvsas libsas scsi_transport_sas e1000e ptp
>> pps_core shpchp mperf sg dm_mod autofs4 ata_piix uhci_hcd ehci_pci
>> ehci_hcd usbcore usb_common i915 fan thermal processor drm_kms_helper
>> drm i2c_algo_bit button video thermal_sys scsi_dh_hp_sw scsi_dh_emc
>> scsi_dh_rdac scsi_dh_alua scsi_dh
>> Feb 06 12:47:23 store03 kernel: CPU: 0 PID: 3028 Comm: btrfs-worker-2
>> Tainted: G        W    3.11.6-4-pae #1
>> Feb 06 12:47:23 store03 kernel: Hardware name: PhoenixAward 945GM/945GM,
>> BIOS 6.00 PG 08/13/2008
>> Feb 06 12:47:23 store03 kernel:  00000009 c06e075a 00000000 c0242c5e
>> c085dbc8 00000000 00000bd4 f8a06e34
>> Feb 06 12:47:23 store03 kernel:  000001e2 f8985503 f8985503 00000002
>> f5c60304 f2e606d8 c14ca4f0 c0242d1b
>> Feb 06 12:47:23 store03 kernel:  00000009 00000000 f8985503 ef93d4a0
>> f5c60304 e62a6c00 16c1f682 f46fe86c
>> Feb 06 12:47:23 store03 kernel: Call Trace:
>> Feb 06 12:47:23 store03 kernel:  [<c0204ef9>]
>> try_stack_unwind+0x179/0x190
>> Feb 06 12:47:23 store03 kernel:  [<c0203e17>] dump_trace+0x47/0xf0
>> Feb 06 12:47:23 store03 kernel:  [<c0204f4f>]
>> show_trace_log_lvl+0x3f/0x50
>> Feb 06 12:47:23 store03 kernel:  [<c0203f10>]
>> show_stack_log_lvl+0x50/0xd0
>> Feb 06 12:47:23 store03 kernel:  [<c0204f9f>] show_stack+0x1f/0x40
>> Feb 06 12:47:23 store03 kernel:  [<c06e075a>] dump_stack+0x3e/0x4e
>> Feb 06 12:47:23 store03 kernel:  [<c0242c5e>]
>> warn_slowpath_common+0x7e/0xa0
>> Feb 06 12:47:23 store03 kernel:  [<c0242d1b>]
>> warn_slowpath_null+0x1b/0x20
>> Feb 06 12:47:23 store03 kernel:  [<f8985503>]
>> btree_csum_one_bio.isra.48+0x93/0x110 [btrfs]
>> Feb 06 12:47:23 store03 kernel:  [<f898261f>]
>> run_one_async_start+0x2f/0x40 [btrfs]
>> Feb 06 12:47:23 store03 kernel:  [<f89bdcb7>] worker_loop+0x107/0x470
>> [btrfs]
>> Feb 06 12:47:23 store03 kernel:  [<c025ebd2>] kthread+0x92/0xa0
>> Feb 06 12:47:23 store03 kernel:  [<c06ece67>]
>> ret_from_kernel_thread+0x1b/0x28
>> Feb 06 12:47:23 store03 kernel:  [<c025eb40>]
>> kthread_create_on_node+0xd0/0xd0
>> Feb 06 12:47:23 store03 kernel: DWARF2 unwinder stuck at kthread+0x0/0xa0
>> Feb 06 12:47:23 store03 kernel: Feb 06 12:47:23 store03 kernel: Leftover
>> inexact backtrace:
>> Feb 06 12:47:23 store03 kernel: ---[ end trace c47f82d03f79250e ]---
>> Feb 06 12:47:23 store03 kernel: ------------[ cut here ]------------
>> ...
>>
>> kernel soon locks up, any advice on how to proceed?
>> any other info needed?
>> -- 
>> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
>> the body of a message to majord...@vger.kernel.org
>> More majordomo info at  http://vger.kernel.org/majordomo-info.html
>>
--
To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to