Re: [CentOS] boot problem after disk change on raid1

2011-09-08 Thread Bünyamin İzzet
On Thu, Sep 8, 2011 at 8:11 AM, Devin Reade g...@gno.org wrote:

 Bünyamin Ýzzet bunyamin.iz...@gmail.com wrote:

  On Wed, Sep 7, 2011 at 11:38 PM, Devin Reade g...@gno.org wrote:
 
 [snip]
 # grub
 grub device (hd0) /dev/sdb
 grub root (hd0,0)
 grub setup (hd0)
 grub quit
 
  It still does not boot. I could not see the error message, because it is
 a
  dedicated server and I am not sitting at the monitor of the server. So I
  type the lines in grub.conf manually to see the error (I'm not sure if it
 is
  the right thing to see the error).

 If you mean that you typed the lines I gave above into grub.conf, then
 that was not what was intended (and I doubt that it would work).  My
 intent was that you get the system booted and running normally (perhaps
 via the rescue disk), and after that execute 'grub' interactively
 and issue those commands.

  Devin


As you said, I booted the system via rescue disk, and execute grub and issue
those commands. Then, I reboot the system, but it does not boot.

Then I searched on google about logging grub errors, which I could not find
anything useful (meybe I did not look enough). So that, in rescue system, I
execute grub and type commands in grub.conf (results are below) to see which
error occurs.



# grub
Probing devices to guess BIOS drives. This may take a long time.


GNU GRUB version 0.97 (640K lower / 3072K upper memory)

[ Minimal BASH-like line editing is supported. For the first word, TAB
lists possible command completions. Anywhere else TAB lists the possible
completions of a device/filename.]
grub root (hd1,0)
root (hd1,0)
Filesystem type is ext2fs, partition type 0xfd
grub kernel /vmlinuz-2.6.18-238.19.1.el5 ro root=/dev/md1 vga=0x317

kernel /vmlinuz-2.6.18-238.19.1.el5 ro root=/dev/md1 vga=0x317
[Linux-bzImage, setup=0x1e00, size=0x1fe01c]
grub initrd /initrd-2.6.18-238.19.1.el5.img

initrd /initrd-2.6.18-238.19.1.el5.img

Error 28: Selected item cannot fit into memory
grub quit
quit
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] boot problem after disk change on raid1

2011-09-08 Thread Bünyamin İzzet
The system works just fine now. I think there was a disk check or something
at boot time which takes 15-20 minutes (I rarely reboot the system once or
twice a year), and it made me think that the system does not boot.

Thanks for your help.

Bunyamin.

2011/9/8 Bünyamin İzzet bunyamin.iz...@gmail.com



 On Thu, Sep 8, 2011 at 8:11 AM, Devin Reade g...@gno.org wrote:

 Bünyamin Ýzzet bunyamin.iz...@gmail.com wrote:

  On Wed, Sep 7, 2011 at 11:38 PM, Devin Reade g...@gno.org wrote:
 
 [snip]
 # grub
 grub device (hd0) /dev/sdb
 grub root (hd0,0)
 grub setup (hd0)
 grub quit
 
  It still does not boot. I could not see the error message, because it is
 a
  dedicated server and I am not sitting at the monitor of the server. So I
  type the lines in grub.conf manually to see the error (I'm not sure if
 it is
  the right thing to see the error).

 If you mean that you typed the lines I gave above into grub.conf, then
 that was not what was intended (and I doubt that it would work).  My
 intent was that you get the system booted and running normally (perhaps
 via the rescue disk), and after that execute 'grub' interactively
 and issue those commands.

  Devin


 As you said, I booted the system via rescue disk, and execute grub and
 issue those commands. Then, I reboot the system, but it does not boot.

 Then I searched on google about logging grub errors, which I could not find
 anything useful (meybe I did not look enough). So that, in rescue system, I
 execute grub and type commands in grub.conf (results are below) to see which
 error occurs.



 # grub
 Probing devices to guess BIOS drives. This may take a long time.


 GNU GRUB version 0.97 (640K lower / 3072K upper memory)

 [ Minimal BASH-like line editing is supported. For the first word, TAB
 lists possible command completions. Anywhere else TAB lists the possible
 completions of a device/filename.]
 grub root (hd1,0)
 root (hd1,0)
 Filesystem type is ext2fs, partition type 0xfd
 grub kernel /vmlinuz-2.6.18-238.19.1.el5 ro root=/dev/md1 vga=0x317

 kernel /vmlinuz-2.6.18-238.19.1.el5 ro root=/dev/md1 vga=0x317
 [Linux-bzImage, setup=0x1e00, size=0x1fe01c]
 grub initrd /initrd-2.6.18-238.19.1.el5.img

 initrd /initrd-2.6.18-238.19.1.el5.img

 Error 28: Selected item cannot fit into memory
 grub quit
 quit

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


[CentOS] boot problem after disk change on raid1

2011-09-07 Thread Bünyamin İzzet
Hello,

I have two disks sda and sdb. One of the was broken so I have changed the
broken disk with a working one. I started the server in rescue mode, and
created the partional table, and added all the partitions to the software
raid.



*I have added the partitions to the RAID, and reboot.*

# mdadm /dev/md0 --add /dev/sdb1
# mdadm /dev/md1 --add /dev/sdb2
# mdadm /dev/md2 --add /dev/sdb3
# mdadm /dev/md3 --add /dev/sdb4



*After reboot, server did not boot. So I do the followings:*

# mount /dev/md1 /mnt/rescue
# mount /dev/md0 /mnt/rescue/boot
# mount -o bind /dev /mnt/rescue/dev
# mount -o bind /proc /mnt/rescue/proc
# mount -o bind /dev/shm /mnt/rescue/dev/shm
# mount -o bind /sys /mnt/rescue/sys
# chroot /mnt/rescue



*I checked the device.map*
# cat /boot/grub/device.map
(hd0)   /dev/sda
(hd1)   /dev/sdb



*And, install the grub.*
# grub
Probing devices to guess BIOS drives. This may take a long time.


GNU GRUB  version 0.97  (640K lower / 3072K upper memory)

 [ Minimal BASH-like line editing is supported.  For the first word, TAB
   lists possible command completions.  Anywhere else TAB lists the possible
   completions of a device/filename.]
grub root (hd0,0)
root (hd0,0)
 Filesystem type is ext2fs, partition type 0xfd
grub setup (hd0)
setup (hd0)
 Checking if /boot/grub/stage1 exists... yes
 Checking if /boot/grub/stage2 exists... yes
 Checking if /boot/grub/e2fs_stage1_5 exists... yes
 Running embed /boot/grub/e2fs_stage1_5 (hd0)...  15 sectors are embedded.
succeeded
 Running install /boot/grub/stage1 (hd0) (hd0)1+15 p
(hd0,0)/boot/grub/stage2 /boot/grub/grub.conf... succeeded
Done.
grub root (hd1,0)
root (hd1,0)
 Filesystem type is ext2fs, partition type 0xfd
grub setup (hd1)
setup (hd1)
 Checking if /boot/grub/stage1 exists... yes
 Checking if /boot/grub/stage2 exists... yes
 Checking if /boot/grub/e2fs_stage1_5 exists... yes
 Running embed /boot/grub/e2fs_stage1_5 (hd1)...  15 sectors are embedded.
succeeded
 Running install /boot/grub/stage1 (hd1) (hd1)1+15 p
(hd1,0)/boot/grub/stage2 /boot/grub/grub.conf... succeeded
Done.
grub quit
quit


*But it still does not boot. What should I do at this point? What do you
suggest?*



*Disk informations*

# fdisk -l /dev/sd[ab]

Disk /dev/sda: 1500.3 GB, 1500301910016 bytes
255 heads, 63 sectors/track, 182401 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0x00090dd1

   Device Boot  Start End  Blocks   Id  System
/dev/sda1   *   1 131 1052226   fd  Linux raid
autodetect
/dev/sda2 132535341945715   fd  Linux raid
autodetect
/dev/sda35354   96733   734009850   fd  Linux raid
autodetect
/dev/sda4   96734  182401   688128210   fd  Linux raid
autodetect

Disk /dev/sdb: 1500.3 GB, 1500301910016 bytes
255 heads, 63 sectors/track, 182401 cylinders
Units = cylinders of 16065 * 512 = 8225280 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disk identifier: 0xf5c3aa6a

   Device Boot  Start End  Blocks   Id  System
/dev/sdb1   *   1 131 1052226   fd  Linux raid
autodetect
/dev/sdb2 132535341945715   fd  Linux raid
autodetect
/dev/sdb35354   96733   734009850   fd  Linux raid
autodetect
/dev/sdb4   96734  182401   688128210   fd  Linux raid
autodetect



*Raid info*

# cat /proc/mdstat
Personalities : [raid1]
md3 : active raid1 sda4[0] sdb4[2]
  688128128 blocks [2/1] [U_]
resync=DELAYED

md2 : active raid1 sda3[0] sdb3[2]
  734009728 blocks [2/1] [U_]
  []  recovery =  1.8% (13651648/734009728)
finish=133.5min speed=89915K/sec

md1 : active raid1 sdb2[2] sda2[0]
  41945600 blocks [2/1] [U_]
resync=DELAYED

md0 : active raid1 sda1[0] sdb1[1]
  1052160 blocks [2/2] [UU]

unused devices: none



*Mount informations*

*/dev/md0 is mounted /boot
/dev/md1 is mounted /
**/dev/md2 and /dev/md3 is for backup and some personal data*



*Here is the grub.conf*

# cat /mnt/rescue/boot/grub/grub.conf

timeout 5
default 0

fallback 1

title CentOS (2.6.18-238.19.1.el5)
root (hd1,0)
kernel /vmlinuz-2.6.18-238.19.1.el5 ro root=/dev/md1 vga=0x317
initrd /initrd-2.6.18-238.19.1.el5.img

title CentOS (2.6.18-238.9.1.el5)
root (hd0,0)
kernel /vmlinuz-2.6.18-238.9.1.el5 ro root=/dev/md1 vga=0x317
initrd /initrd-2.6.18-238.9.1.el5.img

title CentOS (2.6.18-194.26.1.el5)
root (hd0,0)
kernel /vmlinuz-2.6.18-194.26.1.el5 ro root=/dev/md1 vga=0x317
initrd /initrd-2.6.18-194.26.1.el5.img

title CentOS (2.6.18-194.3.1.el5)
root (hd0,0)
kernel /vmlinuz-2.6.18-194.3.1.el5 ro root=/dev/md1 vga=0x317
initrd /initrd-2.6.18-194.3.1.el5.img

title CentOS (2.6.18-164.15.1.el5)
root (hd0,0)
kernel /vmlinuz-2.6.18-164.15.1.el5 ro root=/dev/md1 vga=0x317
initrd 

Re: [CentOS] boot problem after disk change on raid1

2011-09-07 Thread Devin Reade
--On Wednesday, September 07, 2011 08:22:47 PM +0300 Bünyamin İzzet
bunyamin.iz...@gmail.com wrote:

 Hello,
 
 I have two disks sda and sdb. One of the was broken so I have changed the
 broken disk with a working one. I started the server in rescue mode, and
 created the partional table, and added all the partitions to the software
 raid.

Sounds like the 'no boot record' issue.  Here's an extract from one
of my server change logs, for setting things up *before* they go bad:

- made sure we have boot blocks on both disks, based on information
  at http://grub.enbug.org/MirroringRAID
modified /boot/grub/device.map from:
(hd0) /dev/sda
(hd1) /dev/sdb
to:
(hd0) /dev/sda
(hd0) /dev/sdb
and then:
# grub
grub device (hd0) /dev/sdb
grub root (hd0,0)
grub setup (hd0)
grub quit

With the above in place you can boot from either disk as long as 
your BIOS will let you select the boot device.  I've had older
systems where the BIOS didn't have that option, in which case
the 'in event of breakage' procedure would be to take the working
slave disk and put it on the controller cable that originally had
the flawed master disk.

# cat /boot/grub/device.map
 (hd0)   /dev/sda
 (hd1)   /dev/sdb

Not quite right (need hd0 on both)

Devin

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] boot problem after disk change on raid1

2011-09-07 Thread Devin Reade
--On Wednesday, September 07, 2011 02:38:14 PM -0600 Devin Reade
g...@gno.org wrote:

 Sounds like the 'no boot record' issue.  Here's an extract from one
 of my server change logs, for setting things up *before* they go bad:

I should add that, with a tested boot-from-alternate disk in place
I've been able to avoid the boot into rescue mode procedure, thus
minimizing down time.  Ignoring the case of hot-swap devices and
hot spares:

   - shut down system
   - remove faulty disk
   - add in replacement disk
   - reboot to normal run level (3 or 5).  RAID is now running in 
 degraded mode, but you're back in operation
   - do disk paritioning, MBR init, and reconstruction while in service

Devin

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] boot problem after disk change on raid1

2011-09-07 Thread Bünyamin İzzet
On Wed, Sep 7, 2011 at 11:38 PM, Devin Reade g...@gno.org wrote:

 --On Wednesday, September 07, 2011 08:22:47 PM +0300 Bünyamin İzzet
 bunyamin.iz...@gmail.com wrote:

- made sure we have boot blocks on both disks, based on information
  at http://grub.enbug.org/MirroringRAID
modified /boot/grub/device.map from:
 (hd0) /dev/sda
(hd1) /dev/sdb
 to:
(hd0) /dev/sda
(hd0) /dev/sdb

I have changed the device.map as you said


and then:
# grub
grub device (hd0) /dev/sdb
grub root (hd0,0)
grub setup (hd0)
grub quit

 And, it is done too.


It still does not boot. I could not see the error message, because it is a
dedicated server and I am not sitting at the monitor of the server. So I
type the lines in grub.conf manually to see the error (I'm not sure if it is
the right thing to see the error).


# grub
Probing devices to guess BIOS drives. This may take a long time.


GNU GRUB  version 0.97  (640K lower / 3072K upper memory)

 [ Minimal BASH-like line editing is supported.  For the first word, TAB
   lists possible command completions.  Anywhere else TAB lists the possible
   completions of a device/filename.]
grub root (hd1,0)
root (hd1,0)
 Filesystem type is ext2fs, partition type 0xfd
grub kernel /vmlinuz-2.6.18-238.19.1.el5 ro root=/dev/md1 vga=0x317
kernel /vmlinuz-2.6.18-238.19.1.el5 ro root=/dev/md1 vga=0x317
   [Linux-bzImage, setup=0x1e00, size=0x1fe01c]
grub initrd /initrd-2.6.18-238.19.1.el5.img
initrd /initrd-2.6.18-238.19.1.el5.img

Error 28: Selected item cannot fit into memory
grub quit
quit


If it is not the right thing to see the error message at boot time, is there
any chance to log grub errors, boot errors?

Thanks,
Bunyamin.
___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos


Re: [CentOS] boot problem after disk change on raid1

2011-09-07 Thread Devin Reade
Bünyamin Ýzzet bunyamin.iz...@gmail.com wrote:

 On Wed, Sep 7, 2011 at 11:38 PM, Devin Reade g...@gno.org wrote:
 
[snip]
# grub
grub device (hd0) /dev/sdb
grub root (hd0,0)
grub setup (hd0)
grub quit
 
 It still does not boot. I could not see the error message, because it is a
 dedicated server and I am not sitting at the monitor of the server. So I
 type the lines in grub.conf manually to see the error (I'm not sure if it is
 the right thing to see the error).

If you mean that you typed the lines I gave above into grub.conf, then
that was not what was intended (and I doubt that it would work).  My 
intent was that you get the system booted and running normally (perhaps
via the rescue disk), and after that execute 'grub' interactively
and issue those commands.

Devin

___
CentOS mailing list
CentOS@centos.org
http://lists.centos.org/mailman/listinfo/centos