Re: BTRFS Kernel Panic

2013-04-14 Thread Alex Robbins

I was able to copy almost everything from dd images to a new set
of filesystems.  The only thing that was excluded from the transfer
was /var/cache/apt, because something in there was causing the problem.
I performed the transfer by booting to a second system.  Interestingly,
when this problem arose during the transfer, which was being run from this
other system, the problem was not as serious.  The program that triggered
the problem died and a call stack was printed by the kernel, but apparently
everything else continued to work.  I don't know why the problem was less
serious when it occurred on the transfer system, as the filesystem with
a problem was not the / of any system, it was /var (and therefore no more
important to the first system as it was to the system performing the 
transfer).


In any case, all appears to be well; I can run both aptitude update and
updatedb, and it hasn't crashed since.

Thank you for your help.



--
To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org 
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org

Archive: http://lists.debian.org/516b5c2b.10...@gmail.com



Re: BTRFS Kernel Panic

2013-04-12 Thread Martin Steigerwald
On Thursday 11 April 2013 23:23:30 Roger Leigh wrote:
> You could also try upgrading to a newer kernel e.g. 3.8.x.  I've
> done this myself due to btrfs issues with older kernels; you might
> need to hand-build it though since Debian doesn't yet have it.
> See kernel-package.

Debian has it in experimental:

martin@merkaba:~> rmadison linux-image-3.8-trunk-amd64
 linux-image-3.8-trunk-amd64 | 3.8.5-1~experimental.1 | experimental | 
amd64, i386

;)

I also recommend running current kernels with BTRFS.

-- 
Martin 'Helios' Steigerwald - http://www.Lichtvoll.de
GPG: 03B0 0D6C 0040 0710 4AFA  B82F 991B EAAC A599 84C7


-- 
To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org 
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/29696065.TvIoNG5Q5r@merkaba



Re: BTRFS Kernel Panic

2013-04-11 Thread Roger Leigh
On Thu, Apr 11, 2013 at 03:50:19PM -0500, Alex Robbins wrote:
> So it looks like btrfs really is still experimental.
> 
> I have / and /var on two separate btrfs partitions and I consistently get
> a kernel panic when I run "aptitude update" (see the end).  I also managed
> to get the same results by using find and dd to read every file in /var.
> 
> Although my oops output does have RIP: btrfs_num_copies, I do not think that
> I need to use btrfs-zero-log because (1) I am using a 3.2 series kernel, (2)
> my panic callstack does not have a function that begins with "replay_one_",
> and (3) I am able to mount the partitions (and boot) just fine.  It is only
> when some process such as aptitude or updatedb (indexing for the locate
> command) tries to read something in the filesystem.

I'd suggest that you firstly:

1) take a full image of the raw devices containing these filesystems
   with dd in case of further damage
2) try to backup the filesystems with tar

If you get a backup, you could then reformat the partitions and
restore the backups.

I'd definitely recommend looking at the changelog for newer
versions of the linux kernel to see if this has fixed.  It
also might be worth contacting the brtfs developers' mailing
list to see if they have seen this issue before--the BUG alone
might be useful.

You could also try upgrading to a newer kernel e.g. 3.8.x.  I've
done this myself due to btrfs issues with older kernels; you might
need to hand-build it though since Debian doesn't yet have it.
See kernel-package.

[I've previously lost a large raid-1 btrfs filesystem--both copies
completely trashed after a SATA glitch, including the good copy
which was corrupted entirely by btrfs...  Currently using it on an
SSH for testing purposes as my rootfs, but don't use it for user
data, which is on ext4 on LVM RAID.]


Regards,
Roger

-- 
  .''`.  Roger Leigh
 : :' :  Debian GNU/Linuxhttp://people.debian.org/~rleigh/
 `. `'   schroot and sbuild  http://alioth.debian.org/projects/buildd-tools
   `-GPG Public Key  F33D 281D 470A B443 6756 147C 07B3 C8BC 4083 E800


-- 
To UNSUBSCRIBE, email to debian-user-requ...@lists.debian.org 
with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20130411222330.ga23...@codelibre.net



BTRFS Kernel Panic

2013-04-11 Thread Alex Robbins

So it looks like btrfs really is still experimental.

I have / and /var on two separate btrfs partitions and I consistently get
a kernel panic when I run "aptitude update" (see the end).  I also managed
to get the same results by using find and dd to read every file in /var.

Although my oops output does have RIP: btrfs_num_copies, I do not think that
I need to use btrfs-zero-log because (1) I am using a 3.2 series kernel, (2)
my panic callstack does not have a function that begins with "replay_one_",
and (3) I am able to mount the partitions (and boot) just fine.  It is only
when some process such as aptitude or updatedb (indexing for the locate
command) tries to read something in the filesystem.

I tried scrubbing the filesystems, but no errors were found (and somehow no
panics occured).  Finally, I can't really see what's going on with an strace
or anything because the panic happens before anything gets written to 
the disk

(even if I do an emergency sync and remount-ro with sysrq before rebooting),
and I *really* don't want to install netconsole because I am worried that
any package management will cause a panic during an installation which could
make my situation much worse.

I have run out of ideas and am looking for suggestions.

The panic from /var/log/kern.log:
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.632188] parent transid 
verify failed on 18446277369678266402 wanted 18446277756225214326 found 0
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.632203] parent transid 
verify failed on 18446277369678266402 wanted 18446277756225214326 found 0
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.632278] [ 
cut here ]
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.632331] kernel BUG at 
/build/buildd-linux_3.2.39-2-amd64-G5_nN0/linux-3.2.39/fs/btrfs/volumes.c:2860!
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.632412] invalid opcode: 
 [#1] SMP

Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.632458] CPU 0
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.632478] Modules linked 
in: hidp cryptd aes_x86_64 aes_generic nbd ip6table_filter ip6_tables 
iptable_filter ip_tables ebtable_nat ebtables x_tables parport_pc ppdev 
lp parport rfcomm bnep cpufreq_userspace cpufreq_stats cpufreq_powersave 
cpufreq_conservative pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) 
vboxdrv(O) binfmt_misc uinput fuse nfsd nfs nfs_acl auth_rpcgss fscache 
lockd sunrpc loop dm_crypt sg uvcvideo videodev v4l2_compat_ioctl32 
media btusb bluetooth crc16 joydev ata_generic acpi_cpufreq i915 mperf 
r592 dell_laptop ata_piix coretemp drm_kms_helper dcdbas drm 
i2c_algo_bit memstick snd_hda_codec_idt snd_hda_codec_hdmi pcspkr arc4 
firewire_ohci firewire_core crc_itu_t iwl3945 iwl_legacy snd_hda_intel 
mac80211 snd_hda_codec sky2 snd_hwdep snd_pcm snd_page_alloc psmouse 
serio_raw cfg80211 rfkill r852 sm_common nand nand_ecc nand_ids mtd 
dell_wmi snd_seq snd_seq_device sparse_keymap snd_timer sdhci_pci sdhci 
snd iTCO_wdt iTCO_vendor_support wmi soundcore processor mm
Apr  7 17:39:29 alexdell2-wheezy kernel: c_core i2c_i801 video battery 
ac i2c_core evdev power_supply button btrfs crc32c libcrc32c 
zlib_deflate dm_mod usb_storage usbhid hid sl811_hcd ohci_hcd sr_mod 
cdrom sd_mod crc_t10dif thermal ahci libahci libata thermal_sys uhci_hcd 
scsi_mod ehci_hcd usbcore usb_common

Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.633954]
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.633972] Pid: 6448, comm: 
aptitude Tainted: G   O 3.2.0-4-amd64 #1 Debian 3.2.39-2 Dell 
Inc. Inspiron 1525   /0WP007
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.634090] RIP: 
0010:[]  [] 
btrfs_num_copies+0x3f/0x89 [btrfs]
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.634196] RSP: 
0018:88004dd1d9b8  EFLAGS: 00010246
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.634244] RAX: 
 RBX: fffe57890022 RCX: 0001
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.634304] RDX: 
 RSI: fffe57890022 RDI: 880115079110
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.634363] RBP: 
880115079110 R08:  R09: 
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.634424] R10: 
88007972a6d8 R11: 88007972a6d8 R12: 
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.634484] R13: 
 R14: fffe57e2fffe5776 R15: 880116877820
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.634545] FS:  
7f2b8c238760() GS:88011fc0() knlGS:
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.634615] CS:  0010 DS: 
 ES:  CR0: 80050033
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.634664] CR2: 
7f2b8a913f50 CR3: 4dd1a000 CR4: 06f0
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.634724] DR0: 
 DR1:  DR2: 
Apr  7 17:39:29 alexdell2-wheezy kernel: [ 7290.634784] DR3: 
 DR6: 000