Bug#498505: mdadm: Bug still not fixed

2008-11-03 Thread martin f krafft
also sprach Yauhen Kharuzhy <[EMAIL PROTECTED]> [2008.11.03.1235 +0100]:
> Problem still not fixed in 2.6.7.1-1. I start to reshape RAID5 array and
> reboot machine. After that, mdadm --assemble causes segmentation fault. While 
> machine was powered off, some drives was moved to other SATA connectors and
> devices in Linux was changed respectively (/dev/sdg was moved to /dev/sdd).

Does the problem go away if you pass the superblock version on the
command line, like suggested by Steinar, the original bug poster of
498505?

-- 
 .''`.   martin f. krafft <[EMAIL PROTECTED]>
: :'  :  proud Debian developer, author, administrator, and user
`. `'`   http://people.debian.org/~madduck - http://debiansystem.info
  `-  Debian - when you have better things to do than fixing systems


digital_signature_gpg.asc
Description: Digital signature (see http://martin-krafft.net/gpg/)


Bug#498505: mdadm: Bug still not fixed

2008-11-03 Thread Yauhen Kharuzhy
Package: mdadm
Version: 2.6.7.1-1
Followup-For: Bug #498505


Problem still not fixed in 2.6.7.1-1. I start to reshape RAID5 array and
reboot machine. After that, mdadm --assemble causes segmentation fault. While 
machine was powered off, some drives was moved to other SATA connectors and
devices in Linux was changed respectively (/dev/sdg was moved to /dev/sdd).

# gdb mdadm
GNU gdb 6.8-debian
Copyright (C) 2008 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later 
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.  Type "show copying"
and "show warranty" for details.
This GDB was configured as "x86_64-linux-gnu"...
(gdb) set args --assemble  /dev/md0  /dev/sdd2 /dev/sde2 /dev/sdf2 /dev/sdh2 
missing
(gdb) run
Starting program: /tmp/mdadm-2.6.7.1/mdadm --assemble  /dev/md0  /dev/sdd2 
/dev/sde2 /dev/sdf2 /dev/sdh2 missing

Program received signal SIGSEGV, Segmentation fault.
0x0041c733 in match_home0 (st=0x637430, homehost=0x7fff52e0f620 
"gamayun") at super0.c:326
326 return (memcmp(&sb->set_uuid2, hash, 8)==0);
(gdb) bt
#0  0x0041c733 in match_home0 (st=0x637430, homehost=0x7fff52e0f620 
"gamayun") at super0.c:326
#1  0x0040d42b in Assemble (st=0x637430, mddev=0x7fff52e10e84 
"/dev/md0", mdfd=5, ident=0x7fff52e0f730, devlist=0x637030, backup_file=0x0, 
readonly=0, 
runstop=0, update=0x0, homehost=0x7fff52e0f620 "gamayun", verbose=0, 
force=0) at Assemble.c:338
#2  0x004055a5 in main (argc=8, argv=0x7fff52e0fa88) at mdadm.c:1027
(gdb) quit

[EMAIL PROTECTED]:/tmp/mdadm-2.6.7.1# mdadm --misc -E /dev/sdd2
/dev/sdd2:
  Magic : a92b4efc
Version : 00.91.00
   UUID : c70787d0:6a2b1ffc:7cd3e4dc:cd9c5167
  Creation Time : Sat Nov  1 15:01:30 2008
 Raid Level : raid5
  Used Dev Size : 478608384 (456.44 GiB 490.09 GB)
 Array Size : 1914433536 (1825.75 GiB 1960.38 GB)
   Raid Devices : 5
  Total Devices : 4
Preferred Minor : 0

  Reshape pos'n : 94130432 (89.77 GiB 96.39 GB)
  Delta Devices : 1 (4->5)

Update Time : Mon Nov  3 10:56:31 2008
  State : clean
 Active Devices : 4
Working Devices : 4
 Failed Devices : 1
  Spare Devices : 0
   Checksum : deacb78b - correct
 Events : 60660

 Layout : left-symmetric
 Chunk Size : 64K

  Number   Major   Minor   RaidDevice State
this 1   8   981  active sync

   0 0   8   660  active sync   /dev/sde2
   1 1   8   981  active sync
   2 2   8  1142  active sync   /dev/sdh2
   3 3   8   823  active sync   /dev/sdf2
   4 4   004  faulty removed

[EMAIL PROTECTED]:/tmp/mdadm-2.6.7.1# mdadm --misc -E /dev/sde2
/dev/sde2:
  Magic : a92b4efc
Version : 00.91.00
   UUID : c70787d0:6a2b1ffc:7cd3e4dc:cd9c5167
  Creation Time : Sat Nov  1 15:01:30 2008
 Raid Level : raid5
  Used Dev Size : 478608384 (456.44 GiB 490.09 GB)
 Array Size : 1914433536 (1825.75 GiB 1960.38 GB)
   Raid Devices : 5
  Total Devices : 4
Preferred Minor : 0

  Reshape pos'n : 94130432 (89.77 GiB 96.39 GB)
  Delta Devices : 1 (4->5)

Update Time : Mon Nov  3 10:56:31 2008
  State : clean
 Active Devices : 4
Working Devices : 4
 Failed Devices : 1
  Spare Devices : 0
   Checksum : deacb769 - correct
 Events : 60660

 Layout : left-symmetric
 Chunk Size : 64K

  Number   Major   Minor   RaidDevice State
this 0   8   660  active sync   /dev/sde2

   0 0   8   660  active sync   /dev/sde2
   1 1   8   981  active sync
   2 2   8  1142  active sync   /dev/sdh2
   3 3   8   823  active sync   /dev/sdf2
   4 4   004  faulty removed
[EMAIL PROTECTED]:/tmp/mdadm-2.6.7.1# mdadm --misc -E /dev/sdf2
/dev/sdf2:
  Magic : a92b4efc
Version : 00.91.00
   UUID : c70787d0:6a2b1ffc:7cd3e4dc:cd9c5167
  Creation Time : Sat Nov  1 15:01:30 2008
 Raid Level : raid5
  Used Dev Size : 478608384 (456.44 GiB 490.09 GB)
 Array Size : 1914433536 (1825.75 GiB 1960.38 GB)
   Raid Devices : 5
  Total Devices : 4
Preferred Minor : 0

  Reshape pos'n : 94130432 (89.77 GiB 96.39 GB)
  Delta Devices : 1 (4->5)

Update Time : Mon Nov  3 10:56:31 2008
  State : clean
 Active Devices : 4
Working Devices : 4
 Failed Devices : 1
  Spare Devices : 0
   Checksum : deacb77f - correct
 Events : 60660

 Layout : left-symmetric
 Chunk Size : 64K

  Number   Major   Minor   RaidDevice State
this 3   8   823  active sync   /dev/sdf2

   0 0   8   660  active sync   /dev/sde2
   1 1   8   981  active sync
   2 2  

Bug#498505: mdadm: Bug still not fixed

2008-11-05 Thread Neil Brown
On Monday November 3, [EMAIL PROTECTED] wrote:
> Package: mdadm
> Version: 2.6.7.1-1
> Followup-For: Bug #498505
> 
> 
> Problem still not fixed in 2.6.7.1-1. I start to reshape RAID5 array and
> reboot machine. After that, mdadm --assemble causes segmentation fault. While 
> machine was powered off, some drives was moved to other SATA connectors and
> devices in Linux was changed respectively (/dev/sdg was moved to /dev/sdd).

This bug is fixed by commit 56f8add211a840faaed325bd16483b55da544e93
which is scheduled to be in 2.6.8, but was not included in 2.6.7.1.

I include it below.

NeilBrown

>From 56f8add211a840faaed325bd16483b55da544e93 Mon Sep 17 00:00:00 2001
From: Neil Brown <[EMAIL PROTECTED]>
Date: Thu, 19 Jun 2008 16:30:36 +1000
Subject: [PATCH] Fix an error when assembling arrays that are in the middle of 
a reshape.

It is important that dup_super always returns an 'st' with the same
->ss and ->minor_version as the st that was passed.
This wasn't happening for 0.91 metadata (i.e. in the middle of a reshape).
---
 super0.c |5 -
 1 files changed, 4 insertions(+), 1 deletions(-)

diff --git a/super0.c b/super0.c
index 7e81482..8e4c568 100644
--- a/super0.c
+++ b/super0.c
@@ -849,12 +849,15 @@ static struct supertype *match_metadata_desc0(char *arg)
st->sb = NULL;
if (strcmp(arg, "0") == 0 ||
strcmp(arg, "0.90") == 0 ||
-   strcmp(arg, "0.91") == 0 ||
strcmp(arg, "default") == 0 ||
strcmp(arg, "") == 0 /* no metadata */
)
return st;
 
+   st->minor_version = 91; /* reshape in progress */
+   if (strcmp(arg, "0.91") == 0) /* For dup_super support */
+   return st;
+
st->minor_version = 9; /* flag for 'byte-swapped' */
if (strcmp(arg, "0.swap")==0 ||
strcmp(arg, "0.9") == 0) /* For dup_super support */
-- 
1.5.6.5




-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Bug#498505: mdadm: Bug still not fixed

2008-11-08 Thread martin f krafft
also sprach Neil Brown <[EMAIL PROTECTED]> [2008.11.06.0657 +0100]:
> This bug is fixed by commit 56f8add211a840faaed325bd16483b55da544e93
> which is scheduled to be in 2.6.8, but was not included in 2.6.7.1.

Would you release 2.6.7.2 with it, or is that too much trouble for
you?

-- 
 .''`.   martin f. krafft <[EMAIL PROTECTED]>
: :'  :  proud Debian developer, author, administrator, and user
`. `'`   http://people.debian.org/~madduck - http://debiansystem.info
  `-  Debian - when you have better things to do than fixing systems


digital_signature_gpg.asc
Description: Digital signature (see http://martin-krafft.net/gpg/)