Re: n2100, failed raid 1 disk: cannot shutdown

2008-05-20 Thread Michele Bini

Lennart Sorensen wrote:

What's the point of raid1 if the machine crashes and looses current data
if a disk fails?  That's what most likely will happen when you loose
part of swap.


As I said I simply didn't think about it: as a sysadmin I'm a rookie.


[...]
There is no way you can shutdown with swap in use that can't be
accessed.


Ok, so I'll go for the power cord...

Thank you all,
Michele


--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Re: n2100, failed raid 1 disk: cannot shutdown

2008-05-20 Thread Lennart Sorensen
On Sat, May 17, 2008 at 05:35:58PM +0200, Michele Bini wrote:
> Uhm, you are obviously right: let's say that when I partitioned the 
> disks I didn't have swap space resiliency at the front of my thoughts.

What's the point of raid1 if the machine crashes and looses current data
if a disk fails?  That's what most likely will happen when you loose
part of swap.

> The swap partition on the dead disk actually had some swap in use (48 
> blocks, per /proc/swaps) but swapoff seems to be capable to stop using 
> it or at least it doesn't complain and the dead partition disappears 
> from /proc/swaps.
> 
> But still no reboot. Do you think it is really messed up (I don't 
> understand how the process(es) those 48 pages belonged to can survive 
> without them) and there is no hope of a clean shutdown?

There is no way you can shutdown with swap in use that can't be
accessed.

-- 
Len Sorensen


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Re: n2100, failed raid 1 disk: cannot shutdown

2008-05-18 Thread Tobias Frost
Before that, sync and try to force mount them readonly Just to make
sure.

On Sat, 2008-05-17 at 11:07 +0200, Joachim Beckers wrote:
> Can you power off just that disk? i.e. by unplugging it's power  
> connector. That should definitely stop the kernel from trying to  
> access it, right? :-)
> 


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Re: n2100, failed raid 1 disk: cannot shutdown

2008-05-17 Thread Michele Bini

Paul Brook wrote:

I have the following configuration: a thecus n2100 with two disks
configured as raid1 (actually there are three partitions on each disk: a
non-mirrored swap partition


Rookie mistake. If either drive goes down you suddenly loose swap, and the 
whole machine is foobared.


Uhm, you are obviously right: let's say that when I partitioned the 
disks I didn't have swap space resiliency at the front of my thoughts.


The swap partition on the dead disk actually had some swap in use (48 
blocks, per /proc/swaps) but swapoff seems to be capable to stop using 
it or at least it doesn't complain and the dead partition disappears 
from /proc/swaps.


But still no reboot. Do you think it is really messed up (I don't 
understand how the process(es) those 48 pages belonged to can survive 
without them) and there is no hope of a clean shutdown?


linux will do load ballancing over multiple swap 
partitions, however it does not provide any redundancy.


Paul





--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Re: n2100, failed raid 1 disk: cannot shutdown

2008-05-17 Thread Paul Brook
> I have the following configuration: a thecus n2100 with two disks
> configured as raid1 (actually there are three partitions on each disk: a
> non-mirrored swap partition

Rookie mistake. If either drive goes down you suddenly loose swap, and the 
whole machine is foobared.  linux will do load ballancing over multiple swap 
partitions, however it does not provide any redundancy.

Paul


-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Re: n2100, failed raid 1 disk: cannot shutdown

2008-05-17 Thread Joachim Beckers
Can you power off just that disk? i.e. by unplugging it's power  
connector. That should definitely stop the kernel from trying to  
access it, right? :-)


On 17 May 2008, at 11:02, Michele Bini wrote:


Michele Bini wrote:
[...] But then I tried to shutdown (via shutdown -r now) and I  
cannot.


I forgot to say that the failed disk (/dev/sda) is completely dead  
(probably on-board electronics problem, as accessing it gives i/o  
errors on each and every block and the hd light is stuck on).


Also, blindly guessing, I tried and removed the dead disk from the  
array:


# mdadm /dev/md0 -r /dev/sda1
# mdadm /dev/md1 -r /dev/sda3

but nothing changed.


--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]




--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]



Re: n2100, failed raid 1 disk: cannot shutdown

2008-05-17 Thread Michele Bini

Michele Bini wrote:
[...] 
But then I tried to shutdown (via shutdown -r now) and I cannot.


I forgot to say that the failed disk (/dev/sda) is completely dead 
(probably on-board electronics problem, as accessing it gives i/o errors 
on each and every block and the hd light is stuck on).


Also, blindly guessing, I tried and removed the dead disk from the array:

# mdadm /dev/md0 -r /dev/sda1
# mdadm /dev/md1 -r /dev/sda3

but nothing changed.


--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]