those are the last useful log outputs before the server locks up digging in /var/log/messages - you can see it stopped logging at 12:47, and I hard reset at 3:07
maybe I should have specified hard-lock-up instead of panic 2014-02-06T12:47:47.590784+11:00 store03 kernel: [ 4619.769346] ------------[ cut here ]------------ 2014-02-06T12:47:47.590785+11:00 store03 kernel: [ 4619.769369] WARNING: CPU: 0 PID: 3005 at /home/abuild/rpmbuild/BUILD/kernel-pae-3.11.6/l inux-3.11/fs/btrfs/disk-io.c:482 btree_csum_one_bio.isra.48+0x93/0x110 [btrfs]() 2014-02-06T12:47:47.590893+11:00 store03 kernel: [ 4619.769399] Modules linked in: bonding hwmon_vid btrfs raid6_pq zlib_deflate xor libcrc32c joydev hid_generic iTCO_wdt iTCO_vendor_support coretemp pcspkr serio_raw i2c_i801 ata_generic lpc_ich mfd_core usbhid mvsas libsas scsi_transport_sas e1000e ptp pps_core shpchp mperf sg dm_mod autofs4 ata_piix uhci_hcd ehci_pci ehci_hcd usbcore usb_common i915 fan thermal processor drm_kms_helper drm i2c_algo_bit button video thermal_sys scsi_dh_hp_sw scsi_dh_emc scsi_dh_rdac scsi_dh_alua scsi_dh 2014-02-06T12:47:47.590896+11:00 store03 kernel: [ 4619.769402] CPU: 0 PID: 3005 Comm: btrfs-worker-1 Tainted: G W 3.11.6-4-pae #1 2014-02-06T12:47:47.590898+11:00 store03 kernel: [ 4619.769403] Hardware name: PhoenixAward 945GM/945GM, BIOS 6.00 PG 08/13/2008 2014-02-06T12:47:47.590899+11:00 store03 kernel: [ 4619.769407] 00000009 c06e075a 00000000 c0242c5e c085dbc8 00000000 00000bbd f8a06e34 2014-02-06T12:47:47.590901+11:00 store03 kernel: [ 4619.769411] 000001e2 f8985503 f8985503 0000000d f5abaa5c d7c20a5c f1c97070 c0242d1b 2014-02-06T12:47:47.590903+11:00 store03 kernel: [ 4619.769415] 00000009 00000000 f8985503 e85afce0 f5abaa5c e62a6c00 16c205ed c6b5476c 2014-02-06T12:47:47.590905+11:00 store03 kernel: [ 4619.769415] Call Trace: 2014-02-06T12:47:47.590906+11:00 store03 kernel: [ 4619.769424] [<c0204ef9>] try_stack_unwind+0x179/0x190 2014-02-06T12:47:47.590908+11:00 store03 kernel: [ 4619.769430] [<c0203e17>] dump_trace+0x47/0xf0 2014-02-06T12:47:47.590910+11:00 store03 kernel: [ 4619.769434] [<c0204f4f>] show_trace_log_lvl+0x3f/0x50 2014-02-06T12:47:47.590911+11:00 store03 kernel: [ 4619.769437] [<c0203f10>] show_stack_log_lvl+0x50/0xd0 2014-02-06T12:47:47.590913+11:00 store03 kernel: [ 4619.769441] [<c0204f9f>] show_stack+0x1f/0x40 2014-02-06T12:47:47.590915+11:00 store03 kernel: [ 4619.769445] [<c06e075a>] dump_stack+0x3e/0x4e 2014-02-06T12:47:47.590917+11:00 store03 kernel: [ 4619.769450] [<c0242c5e>] warn_slowpath_common+0x7e/0xa0 2014-02-06T12:47:47.590918+11:00 store03 kernel: [ 4619.769454] [<c0242d1b>] warn_slowpath_null+0x1b/0x20 2014-02-06T12:47:47.590920+11:00 store03 kernel: [ 4619.769472] [<f8985503>] btree_csum_one_bio.isra.48+0x93/0x110 [btrfs] 2014-02-06T12:47:47.590922+11:00 store03 kernel: [ 4619.769555] [<f898261f>] run_one_async_start+0x2f/0x40 [btrfs] 2014-02-06T12:47:47.590924+11:00 store03 kernel: [ 4619.769630] [<f89bdcb7>] worker_loop+0x107/0x470 [btrfs] ^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@2014-02-06T15:07:05.120258+11:00 store03 rsyslogd: [origin software="rsyslogd" swVersion="7.4.7" x-pid="418" x-info="http://www.rsyslog.com"] start 2014-02-06T15:07:05.127408+11:00 store03 kernel: [ 0.000000] Initializing cgroup subsys cpuset as I now can't mount (open_ctree failed) Should I be mounting with -o recovery ? On 06/02/14 15:35, Anand Jain wrote: > > > your test case is same as in the patch below > and the panic was due to null bdev (which matches > in your logs). > > [RFC PATCH] btrfs: fix null pointer deference at > btrfs_sysfs_add_one+0x105 > > > But in your logs below, there isn't a panic right ? > wrong cut and paste ? or what did I miss? > > > Thanks, Anand > > > > On 02/06/14 11:40 AM, Thermionix wrote: >> openSUSE 13.1 i686 8 device raid 10 >> when replacing a failed disk (new device is added) >> >> ~ # uname -r >> 3.11.6-4-pae >> >> ~ # btrfs --version >> Btrfs v3.12+20131125 >> >> ~ # mount -o degraded /pool >> >> ~ # journalctl | tail >> >> Feb 06 12:22:51 store03 kernel: device label pool devid 4 transid 55050 >> /dev/sde >> Feb 06 12:22:53 store03 kernel: btrfs: allowing degraded mounts >> Feb 06 12:22:53 store03 kernel: btrfs: disk space caching is enabled >> Feb 06 12:22:53 store03 kernel: btrfs: bdev (null) errs: wr 353, rd 1, >> flush 17, corrupt 0, gen 0 >> Feb 06 12:23:16 store03 kernel: BTRFS debug (device sde): unlinked 1 >> orphans >> >> ~ # btrfs filesystem show /dev/disk/by-label/pool >> Label: pool uuid: 3e6ba20f-a4d0-40e4-88e7-a31c4930bcfe >> Total devices 9 FS bytes used 5.19TiB >> devid 1 size 1.36TiB used 169.50GiB path >> devid 2 size 1.82TiB used 1.62TiB path /dev/sdc >> devid 3 size 931.51GiB used 931.51GiB path /dev/sdd >> devid 4 size 931.51GiB used 931.51GiB path /dev/sde >> devid 6 size 1.82TiB used 1.62TiB path /dev/sdg >> devid 7 size 1.82TiB used 1.62TiB path /dev/sdh >> devid 8 size 931.51GiB used 931.51GiB path /dev/sdi >> devid 9 size 1.82TiB used 1.62TiB path /dev/sdf >> devid 10 size 1.82TiB used 1.01TiB path /dev/sdb >> >> ~ # btrfs device delete missing /pool >> >> ~ # journalctl -l | tail >> >> Feb 06 12:25:43 store03 kernel: btrfs: relocating block group >> 10590585618432 flags 68 >> ... >> Feb 06 12:47:23 store03 kernel: [<c025ebd2>] kthread+0x92/0xa0 >> Feb 06 12:47:23 store03 kernel: [<c06ece67>] >> ret_from_kernel_thread+0x1b/0x28 >> Feb 06 12:47:23 store03 kernel: [<c025eb40>] >> kthread_create_on_node+0xd0/0xd0 >> Feb 06 12:47:23 store03 kernel: DWARF2 unwinder stuck at kthread+0x0/0xa0 >> Feb 06 12:47:23 store03 kernel: Feb 06 12:47:23 store03 kernel: Leftover >> inexact backtrace: >> Feb 06 12:47:23 store03 kernel: ---[ end trace c47f82d03f79250d ]--- >> Feb 06 12:47:23 store03 kernel: ------------[ cut here ]------------ >> Feb 06 12:47:23 store03 kernel: WARNING: CPU: 0 PID: 3028 at >> /home/abuild/rpmbuild/BUILD/kernel-pae-3.11.6/linux-3.11/fs/btrfs/disk-io.c:482 >> >> btree_csum_one_bio.isra.48+0x93/0x110 [btrfs]() >> Feb 06 12:47:23 store03 kernel: Modules linked in: bonding hwmon_vid >> btrfs raid6_pq zlib_deflate xor libcrc32c joydev hid_generic iTCO_wdt >> iTCO_vendor_support coretemp pcspkr serio_raw i2c_i801 ata_generic >> lpc_ich mfd_core usbhid mvsas libsas scsi_transport_sas e1000e ptp >> pps_core shpchp mperf sg dm_mod autofs4 ata_piix uhci_hcd ehci_pci >> ehci_hcd usbcore usb_common i915 fan thermal processor drm_kms_helper >> drm i2c_algo_bit button video thermal_sys scsi_dh_hp_sw scsi_dh_emc >> scsi_dh_rdac scsi_dh_alua scsi_dh >> Feb 06 12:47:23 store03 kernel: CPU: 0 PID: 3028 Comm: btrfs-worker-2 >> Tainted: G W 3.11.6-4-pae #1 >> Feb 06 12:47:23 store03 kernel: Hardware name: PhoenixAward 945GM/945GM, >> BIOS 6.00 PG 08/13/2008 >> Feb 06 12:47:23 store03 kernel: 00000009 c06e075a 00000000 c0242c5e >> c085dbc8 00000000 00000bd4 f8a06e34 >> Feb 06 12:47:23 store03 kernel: 000001e2 f8985503 f8985503 00000002 >> f5c60304 f2e606d8 c14ca4f0 c0242d1b >> Feb 06 12:47:23 store03 kernel: 00000009 00000000 f8985503 ef93d4a0 >> f5c60304 e62a6c00 16c1f682 f46fe86c >> Feb 06 12:47:23 store03 kernel: Call Trace: >> Feb 06 12:47:23 store03 kernel: [<c0204ef9>] >> try_stack_unwind+0x179/0x190 >> Feb 06 12:47:23 store03 kernel: [<c0203e17>] dump_trace+0x47/0xf0 >> Feb 06 12:47:23 store03 kernel: [<c0204f4f>] >> show_trace_log_lvl+0x3f/0x50 >> Feb 06 12:47:23 store03 kernel: [<c0203f10>] >> show_stack_log_lvl+0x50/0xd0 >> Feb 06 12:47:23 store03 kernel: [<c0204f9f>] show_stack+0x1f/0x40 >> Feb 06 12:47:23 store03 kernel: [<c06e075a>] dump_stack+0x3e/0x4e >> Feb 06 12:47:23 store03 kernel: [<c0242c5e>] >> warn_slowpath_common+0x7e/0xa0 >> Feb 06 12:47:23 store03 kernel: [<c0242d1b>] >> warn_slowpath_null+0x1b/0x20 >> Feb 06 12:47:23 store03 kernel: [<f8985503>] >> btree_csum_one_bio.isra.48+0x93/0x110 [btrfs] >> Feb 06 12:47:23 store03 kernel: [<f898261f>] >> run_one_async_start+0x2f/0x40 [btrfs] >> Feb 06 12:47:23 store03 kernel: [<f89bdcb7>] worker_loop+0x107/0x470 >> [btrfs] >> Feb 06 12:47:23 store03 kernel: [<c025ebd2>] kthread+0x92/0xa0 >> Feb 06 12:47:23 store03 kernel: [<c06ece67>] >> ret_from_kernel_thread+0x1b/0x28 >> Feb 06 12:47:23 store03 kernel: [<c025eb40>] >> kthread_create_on_node+0xd0/0xd0 >> Feb 06 12:47:23 store03 kernel: DWARF2 unwinder stuck at kthread+0x0/0xa0 >> Feb 06 12:47:23 store03 kernel: Feb 06 12:47:23 store03 kernel: Leftover >> inexact backtrace: >> Feb 06 12:47:23 store03 kernel: ---[ end trace c47f82d03f79250e ]--- >> Feb 06 12:47:23 store03 kernel: ------------[ cut here ]------------ >> ... >> >> kernel soon locks up, any advice on how to proceed? >> any other info needed? >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in >> the body of a message to majord...@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html >> -- To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in the body of a message to majord...@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html